Federated learning with multi‐cohort real‐world data for predicting the progression from mild cognitive impairment to Alzheimer's disease

Date
2025
Language
American English
Embargo Lift Date
Committee Members
Degree
Degree Year
Department
Grantor
Journal Title
Journal ISSN
Volume Title
Found At
Wiley
Can't use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.
Abstract

Introduction: Leveraging routinely collected electronic health records (EHRs) from multiple health-care institutions, this approach aims to assess the feasibility of using federated learning (FL) to predict the progression from mild cognitive impairment (MCI) to Alzheimer's disease (AD).

Methods: We analyzed EHR data from the OneFlorida+ consortium, simulating six sites, and used a long short-term memory (LSTM) model with a federated averaging (FedAvg) algorithm. A personalized FL approach was used to address between-site heterogeneity. Model performance was assessed using the area under the receiver operating characteristic curve (AUC) and feature importance techniques.

Results: Of 44,899 MCI patients, 6391 progressed to AD. FL models achieved a 6% improvement in AUC compared to local models. Key predictive features included body mass index, vitamin B12, blood pressure, and others.

Discussion: FL showed promise in predicting AD progression by integrating heterogeneous data across multiple institutions while preserving privacy. Despite limitations, it offers potential for future clinical applications.

Highlights: We applied long short-term memory and federated learning (FL) to predict mild cognitive impairment to Alzheimer's disease progression using electronic health record data from multiple institutions. FL improved prediction performance, with a 6% increase in area under the receiver operating characteristic curve compared to local models. We identified key predictive features, such as body mass index, vitamin B12, and blood pressure. FL shows effectiveness in handling data heterogeneity across multiple sites while ensuring data privacy. Personalized and pooled FL models generally performed better than global and local models.

Description
item.page.description.tableofcontents
item.page.relation.haspart
Cite As
Pan J, Fan Z, Smith GE, Guo Y, Bian J, Xu J. Federated learning with multi-cohort real-world data for predicting the progression from mild cognitive impairment to Alzheimer's disease. Alzheimers Dement. 2025;21(4):e70128. doi:10.1002/alz.70128
ISSN
Publisher
Series/Report
Sponsorship
Major
Extent
Identifier
Relation
Journal
Alzheimer's & Dementia
Source
PMC
Alternative Title
Type
Article
Number
Volume
Conference Dates
Conference Host
Conference Location
Conference Name
Conference Panel
Conference Secretariat Location
Version
Final published version
Full Text Available at
This item is under embargo {{howLong}}