Utilizing multimodal AI to improve genetic analyses of cardiovascular traits

dc.contributor.authorZhou, Yuchen
dc.contributor.authorCosentino, Justin
dc.contributor.authorYun, Taedong
dc.contributor.authorBiradar, Mahantesh I.
dc.contributor.authorShreibati, Jacqueline
dc.contributor.authorLai, Dongbing
dc.contributor.authorSchwantes-An, Tae-Hwi
dc.contributor.authorLuben, Robert
dc.contributor.authorMcCaw, Zachary
dc.contributor.authorEngmann, Jorgen
dc.contributor.authorProvidencia, Rui
dc.contributor.authorSchmidt, Amand Floriaan
dc.contributor.authorMunroe, Patricia
dc.contributor.authorYang, Howard
dc.contributor.authorCarroll, Andrew
dc.contributor.authorKhawaja, Anthony P.
dc.contributor.authorMcLean, Cory Y.
dc.contributor.authorBehsaz, Babak
dc.contributor.authorHormozdiari, Farhad
dc.contributor.departmentMedical and Molecular Genetics, School of Medicine
dc.date.accessioned2024-07-10T16:13:54Z
dc.date.available2024-07-10T16:13:54Z
dc.date.issued2024-03-20
dc.description.abstractElectronic health records, biobanks, and wearable biosensors contain multiple high-dimensional clinical data (HDCD) modalities (e.g., ECG, Photoplethysmography (PPG), and MRI) for each individual. Access to multimodal HDCD provides a unique opportunity for genetic studies of complex traits because different modalities relevant to a single physiological system (e.g., circulatory system) encode complementary and overlapping information. We propose a novel multimodal deep learning method, M-REGLE, for discovering genetic associations from a joint representation of multiple complementary HDCD modalities. We showcase the effectiveness of this model by applying it to several cardiovascular modalities. M-REGLE jointly learns a lower representation (i.e., latent factors) of multimodal HDCD using a convolutional variational autoencoder, performs genome wide association studies (GWAS) on each latent factor, then combines the results to study the genetics of the underlying system. To validate the advantages of M-REGLE and multimodal learning, we apply it to common cardiovascular modalities (PPG and ECG), and compare its results to unimodal learning methods in which representations are learned from each data modality separately, but the downstream genetic analyses are performed on the combined unimodal representations. M-REGLE identifies 19.3% more loci on the 12-lead ECG dataset, 13.0% more loci on the ECG lead I + PPG dataset, and its genetic risk score significantly outperforms the unimodal risk score at predicting cardiac phenotypes, such as atrial fibrillation (Afib), in multiple biobanks.
dc.eprint.versionPre-Print
dc.identifier.citationZhou Y, Cosentino J, Yun T, et al. Utilizing multimodal AI to improve genetic analyses of cardiovascular traits. Preprint. medRxiv. 2024;2024.03.19.24304547. Published 2024 Mar 20. doi:10.1101/2024.03.19.24304547
dc.identifier.urihttps://hdl.handle.net/1805/42082
dc.language.isoen_US
dc.publishermedRxiv
dc.relation.isversionof10.1101/2024.03.19.24304547
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.sourcePMC
dc.subjectHigh-dimensional clinical data (HDCD)
dc.subjectElectronic health records
dc.subjectBiobanks
dc.subjectBiosensors
dc.titleUtilizing multimodal AI to improve genetic analyses of cardiovascular traits
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Zhou2024Utilizing-CCBYNCND.pdf
Size:
1.69 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.04 KB
Format:
Item-specific license agreed upon to submission
Description: