- Browse by Author
Browsing by Author "Hobbs, Brian D."
Now showing 1 - 3 of 3
Results Per Page
Sort Options
Item Unsupervised representation learning improves genomic discovery and risk prediction for respiratory and circulatory functions and diseases(medRxiv, 2023-08-29) Yun, Taedong; Cosentino, Justin; Behsaz, Babak; McCaw, Zachary R.; Hill, Davin; Luben, Robert; Lai, Dongbing; Bates, John; Yang, Howard; Schwantes-An, Tae-Hwi; Zhou, Yuchen; Khawaja, Anthony P.; Carroll, Andrew; Hobbs, Brian D.; Cho, Michael H.; McLean, Cory Y.; Hormozdiari, Farhad; Medical and Molecular Genetics, School of MedicineHigh-dimensional clinical data are becoming more accessible in biobank-scale datasets. However, effectively utilizing high-dimensional clinical data for genetic discovery remains challenging. Here we introduce a general deep learning-based framework, REpresentation learning for Genetic discovery on Low-dimensional Embeddings (REGLE), for discovering associations between genetic variants and high-dimensional clinical data. REGLE uses convolutional variational autoencoders to compute a non-linear, low-dimensional, disentangled embedding of the data with highly heritable individual components. REGLE can incorporate expert-defined or clinical features and provides a framework to create accurate disease-specific polygenic risk scores (PRS) in datasets which have minimal expert phenotyping. We apply REGLE to both respiratory and circulatory systems: spirograms which measure lung function and photoplethysmograms (PPG) which measure blood volume changes. Genome-wide association studies on REGLE embeddings identify more genome-wide significant loci than existing methods and replicate known loci for both spirograms and PPG, demonstrating the generality of the framework. Furthermore, these embeddings are associated with overall survival. Finally, we construct a set of PRSs that improve predictive performance of asthma, chronic obstructive pulmonary disease, hypertension, and systolic blood pressure in multiple biobanks. Thus, REGLE embeddings can quantify clinically relevant features that are not currently captured in a standardized or automated way.Item Unsupervised representation learning on high-dimensional clinical data improves genomic discovery and prediction(Springer Nature, 2024) Yun, Taedong; Cosentino, Justin; Behsaz, Babak; McCaw, Zachary R.; Hill, Davin; Luben, Robert; Lai, Dongbing; Bates, John; Yang, Howard; Schwantes-An, Tae-Hwi; Zhou, Yuchen; Khawaja, Anthony P.; Carroll, Andrew; Hobbs, Brian D.; Cho, Michael H.; McLean, Cory Y.; Hormozdiari, Farhad; Medical and Molecular Genetics, School of MedicineAlthough high-dimensional clinical data (HDCD) are increasingly available in biobank-scale datasets, their use for genetic discovery remains challenging. Here we introduce an unsupervised deep learning model, Representation Learning for Genetic Discovery on Low-Dimensional Embeddings (REGLE), for discovering associations between genetic variants and HDCD. REGLE leverages variational autoencoders to compute nonlinear disentangled embeddings of HDCD, which become the inputs to genome-wide association studies (GWAS). REGLE can uncover features not captured by existing expert-defined features and enables the creation of accurate disease-specific polygenic risk scores (PRSs) in datasets with very few labeled data. We apply REGLE to perform GWAS on respiratory and circulatory HDCD-spirograms measuring lung function and photoplethysmograms measuring blood volume changes. REGLE replicates known loci while identifying others not previously detected. REGLE are predictive of overall survival, and PRSs constructed from REGLE loci improve disease prediction across multiple biobanks. Overall, REGLE contain clinically relevant information beyond that captured by existing expert-defined features, leading to improved genetic discovery and disease prediction.Item Whole Genome Sequencing Analysis of Body Mass Index Identifies Novel African Ancestry-Specific Risk Allele(medRxiv, 2023-08-22) Zhang, Xinruo; Brody, Jennifer A.; Graff, Mariaelisa; Highland, Heather M.; Chami, Nathalie; Xu, Hanfei; Wang, Zhe; Ferrier, Kendra; Chittoor, Geetha; Josyula, Navya S.; Li, Xihao; Li, Zilin; Allison, Matthew A.; Becker, Diane M.; Bielak, Lawrence F.; Bis, Joshua C.; Boorgula, Meher Preethi; Bowden, Donald W.; Broome, Jai G.; Buth, Erin J.; Carlson, Christopher S.; Chang, Kyong-Mi; Chavan, Sameer; Chiu, Yen-Feng; Chuang, Lee-Ming; Conomos, Matthew P.; DeMeo, Dawn L.; Du, Margaret; Duggirala, Ravindranath; Eng, Celeste; Fohner, Alison E.; Freedman, Barry I.; Garrett, Melanie E.; Guo, Xiuqing; Haiman, Chris; Heavner, Benjamin D.; Hidalgo, Bertha; Hixson, James E.; Ho, Yuk-Lam; Hobbs, Brian D.; Hu, Donglei; Hui, Qin; Hwu, Chii-Min; Jackson, Rebecca D.; Jain, Deepti; Kalyani, Rita R.; Kardia, Sharon L. R.; Kelly, Tanika N.; Lange, Ethan M.; LeNoir, Michael; Li, Changwei; Marchand, Loic Le; McDonald, Merry-Lynn N.; McHugh, Caitlin P.; Morrison, Alanna C.; Naseri, Take; NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium; O'Connell, Jeffrey; O'Donnell, Christopher J.; Palmer, Nicholette D.; Pankow, James S.; Perry, James A.; Peters, Ulrike; Preuss, Michael H.; Rao, D. C.; Regan, Elizabeth A.; Reupena, Sefuiva M.; Roden, Dan M.; Rodriguez-Santana, Jose; Sitlani, Colleen M.; Smith, Jennifer A.; Tiwari, Hemant K.; Vasan, Ramachandran S.; Wang, Zeyuan; Weeks, Daniel E.; Wessel, Jennifer; Wiggins, Kerri L.; Wilkens, Lynne R.; Wilson, Peter W. F.; Yanek, Lisa R.; Yoneda, Zachary T.; Zhao, Wei; Zöllner, Sebastian; Arnett, Donna K.; Ashley-Koch, Allison E.; Barnes, Kathleen C.; Blangero, John; Boerwinkle, Eric; Burchard, Esteban G.; Carson, April P.; Chasman, Daniel I.; Chen, Yii-Der Ida; Curran, Joanne E.; Fornage, Myriam; Gordeuk, Victor R.; He, Jiang; Heckbert, Susan R.; Hou, Lifang; Irvin, Marguerite R.; Kooperberg, Charles; Minster, Ryan L.; Mitchell, Braxton D.; Nouraie, Mehdi; Psaty, Bruce M.; Raffield, Laura M.; Reiner, Alexander P.; Rich, Stephen S.; Rotter, Jerome I.; Shoemaker, M. Benjamin; Smith, Nicholas L.; Taylor, Kent D.; Telen, Marilyn J.; Weiss, Scott T.; Zhang, Yingze; Heard-Costa, Nancy; Sun, Yan V.; Lin, Xihong; Cupples, L. Adrienne; Lange, Leslie A.; Liu, Ching-Ti; Loos, Ruth J. F.; North, Kari E.; Justice, Anne E.; Biostatistics and Health Data Science, School of MedicineObesity is a major public health crisis associated with high mortality rates. Previous genome-wide association studies (GWAS) investigating body mass index (BMI) have largely relied on imputed data from European individuals. This study leveraged whole-genome sequencing (WGS) data from 88,873 participants from the Trans-Omics for Precision Medicine (TOPMed) Program, of which 51% were of non-European population groups. We discovered 18 BMI-associated signals (P < 5 × 10−9). Notably, we identified and replicated a novel low frequency single nucleotide polymorphism (SNP) in MTMR3 that was common in individuals of African descent. Using a diverse study population, we further identified two novel secondary signals in known BMI loci and pinpointed two likely causal variants in the POC5 and DMD loci. Our work demonstrates the benefits of combining WGS and diverse cohorts in expanding current catalog of variants and genes confer risk for obesity, bringing us one step closer to personalized medicine.