- Browse by Author
Browsing by Author "Cho, Michael H."
Now showing 1 - 6 of 6
Results Per Page
Sort Options
Item A high-resolution HLA reference panel capturing global population diversity enables multi-ancestry fine-mapping in HIV host response(Springer Nature, 2021) Luo, Yang; Kanai, Masahiro; Choi, Wanson; Li, Xinyi; Sakaue, Saori; Yamamoto, Kenichi; Ogawa, Kotaro; Gutierrez-Arcelus, Maria; Gregersen, Peter K.; Stuart, Philip E.; Elder, James T.; Forer, Lukas; Schönherr, Sebastian; Fuchsberger, Christian; Smith, Albert V.; Fellay, Jacques; Carrington, Mary; Haas, David W.; Guo, Xiuqing; Palmer, Nicholette D.; Chen, Yii-Der Ida; Rotter, Jerome I.; Taylor, Kent D.; Rich, Stephen S.; Correa, Adolfo; Wilson, James G.; Kathiresan, Sekar; Cho, Michael H.; Metspalu, Andres; Esko, Tonu; Okada, Yukinori; Han, Buhm; NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium; McLaren, Paul J.; Raychaudhuri, Soumya; Obstetrics and Gynecology, School of MedicineFine-mapping to plausible causal variation may be more effective in multi-ancestry cohorts, particularly in the MHC, which has population-specific structure. To enable such studies, we constructed a large (n = 21,546) HLA reference panel spanning five global populations based on whole-genome sequences. Despite population-specific long-range haplotypes, we demonstrated accurate imputation at G-group resolution (94.2%, 93.7%, 97.8% and 93.7% in admixed African (AA), East Asian (EAS), European (EUR) and Latino (LAT) populations). Applying HLA imputation to genome-wide association study data for HIV-1 viral load in three populations (EUR, AA and LAT), we obviated effects of previously reported associations from population-specific HIV studies and discovered a novel association at position 156 in HLA-B. We pinpointed the MHC association to three amino acid positions (97, 67 and 156) marking three consecutive pockets (C, B and D) within the HLA-B peptide-binding groove, explaining 12.9% of trait variance.Item Allele-specific control of rodent and human lncRNA KMT2E-AS1 promotes hypoxic endothelial pathology in pulmonary hypertension(American Association for the Advancement of Science, 2024) Tai, Yi-Yin; Yu, Qiujun; Tang, Ying; Sun, Wei; Kelly, Neil J.; Okawa, Satoshi; Zhao, Jingsi; Schwantes-An, Tae-Hwi; Lacoux, Caroline; Torrino, Stephanie; Al Aaraj, Yassmin; El Khoury, Wadih; Negi, Vinny; Liu, Mingjun; Corey, Catherine G.; Belmonte, Frances; Vargas, Sara O.; Schwartz, Brian; Bhat, Bal; Chau, B. Nelson; Karnes, Jason H.; Satoh, Taijyu; Barndt, Robert J.; Wu, Haodi; Parikh, Victoria N.; Wang, Jianrong; Zhang, Yingze; McNamara, Dennis; Li, Gang; Speyer, Gil; Wang, Bing; Shiva, Sruti; Kaufman, Brett; Kim, Seungchan; Gomez, Delphine; Mari, Bernard; Cho, Michael H.; Boueiz, Adel; Pauciulo, Michael W.; Southgate, Laura; Trembath, Richard C.; Sitbon, Olivier; Humbert, Marc; Graf, Stefan; Morrell, Nicholas W.; Rhodes, Christopher J.; Wilkins, Martin R.; Nouraie, Mehdi; Nichols, William C.; Desai, Ankit A.; Bertero, Thomas; Chan, Stephen Y.; Medicine, School of MedicineHypoxic reprogramming of vasculature relies on genetic, epigenetic, and metabolic circuitry, but the control points are unknown. In pulmonary arterial hypertension (PAH), a disease driven by hypoxia inducible factor (HIF)-dependent vascular dysfunction, HIF-2α promoted expression of neighboring genes, long noncoding RNA (lncRNA) histone lysine N-methyltransferase 2E-antisense 1 (KMT2E-AS1) and histone lysine N-methyltransferase 2E (KMT2E). KMT2E-AS1 stabilized KMT2E protein to increase epigenetic histone 3 lysine 4 trimethylation (H3K4me3), driving HIF-2α-dependent metabolic and pathogenic endothelial activity. This lncRNA axis also increased HIF-2α expression across epigenetic, transcriptional, and posttranscriptional contexts, thus promoting a positive feedback loop to further augment HIF-2α activity. We identified a genetic association between rs73184087, a single-nucleotide variant (SNV) within a KMT2E intron, and disease risk in PAH discovery and replication patient cohorts and in a global meta-analysis. This SNV displayed allele (G)-specific association with HIF-2α, engaged in long-range chromatin interactions, and induced the lncRNA-KMT2E tandem in hypoxic (G/G) cells. In vivo, KMT2E-AS1 deficiency protected against PAH in mice, as did pharmacologic inhibition of histone methylation in rats. Conversely, forced lncRNA expression promoted more severe PH. Thus, the KMT2E-AS1/KMT2E pair orchestrates across convergent multi-ome landscapes to mediate HIF-2α pathobiology and represents a key clinical target in pulmonary hypertension.Item NHLBI-CMREF Workshop Report on Pulmonary Vascular Disease Classification: JACC State-of-the-Art Review(Elsevier, 2021) Oldham, William M.; Hemnes, Anna R.; Aldred, Micheala A.; Barnard, John; Brittain, Evan L.; Chan, Stephen Y.; Cheng, Feixiong; Cho, Michael H.; Desai, Ankit A.; Garcia, Joe G.N.; Geraci, Mark W.; Ghiassian, Susan D.; Hall, Kathryn T.; Horn, Evelyn M.; Jain, Mohit; Kelly, Rachel S.; Leopold, Jane A.; Lindstrom, Sara; Modena, Brian D.; Nichols, William C.; Rhodes, Christopher J.; Sun, Wei; Sweatt, Andrew J.; Vanderpool, Rebecca R.; Wilkins, Martin R.; Wilmot, Beth; Zamanian, Roham T.; Fessel, Joshua P.; Aggarwal, Neil R.; Loscalzo, Joseph; Xiao, Lei; Medicine, School of MedicineThe National Heart, Lung, and Blood Institute and the Cardiovascular Medical Research and Education Fund held a workshop on the application of pulmonary vascular disease omics data to the understanding, prevention, and treatment of pulmonary vascular disease. Experts in pulmonary vascular disease, omics, and data analytics met to identify knowledge gaps and formulate ideas for future research priorities in pulmonary vascular disease in line with National Heart, Lung, and Blood Institute Strategic Vision goals. The group identified opportunities to develop analytic approaches to multiomic datasets, to identify molecular pathways in pulmonary vascular disease pathobiology, and to link novel phenotypes to meaningful clinical outcomes. The committee suggested support for interdisciplinary research teams to develop and validate analytic methods, a national effort to coordinate biosamples and data, a consortium of preclinical investigators to expedite target evaluation and drug development, longitudinal assessment of molecular biomarkers in clinical trials, and a task force to develop a master clinical trials protocol for pulmonary vascular disease.Item Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program(Springer Nature, 2021) Taliun, Daniel; Harris, Daniel N.; Kessler, Michael D.; Carlson, Jedidiah; Szpiech, Zachary A.; Torres, Raul; Gagliano Taliun, Sarah A.; Corvelo, André; Gogarten, Stephanie M.; Kang, Hyun Min; Pitsillides, Achilleas N.; LeFaive, Jonathon; Lee, Seung-Been; Tian, Xiaowen; Browning, Brian L.; Das, Sayantan; Emde, Anne-Katrin; Clarke, Wayne E.; Loesch, Douglas P.; Shetty, Amol C.; Blackwell, Thomas W.; Smith, Albert V.; Wong, Quenna; Liu, Xiaoming; Conomos, Matthew P.; Bobo, Dean M.; Aguet, François; Albert, Christine; Alonso, Alvaro; Ardlie, Kristin G.; Arking, Dan E.; Aslibekyan, Stella; Auer, Paul L.; Barnard, John; Barr, R. Graham; Barwick, Lucas; Becker, Lewis C.; Beer, Rebecca L.; Benjamin, Emelia J.; Bielak, Lawrence F.; Blangero, John; Boehnke, Michael; Bowden, Donald W.; Brody, Jennifer A.; Burchard, Esteban G.; Cade, Brian E.; Casella, James F.; Chalazan, Brandon; Chasman, Daniel I.; Chen, Yii-Der Ida; Cho, Michael H.; Choi, Seung Hoan; Chung, Mina K.; Clish, Clary B.; Correa, Adolfo; Curran, Joanne E.; Custer, Brian; Darbar, Dawood; Daya, Michelle; de Andrade, Mariza; DeMeo, Dawn L.; Dutcher, Susan K.; Ellinor, Patrick T.; Emery, Leslie S.; Eng, Celeste; Fatkin, Diane; Fingerlin, Tasha; Forer, Lukas; Fornage, Myriam; Franceschini, Nora; Fuchsberger, Christian; Fullerton, Stephanie M.; Germer, Soren; Gladwin, Mark T.; Gottlieb, Daniel J.; Guo, Xiuqing; Hall, Michael E.; He, Jiang; Heard-Costa, Nancy L.; Heckbert, Susan R.; Irvin, Marguerite R.; Johnsen, Jill M.; Johnson, Andrew D.; Kaplan, Robert; Kardia, Sharon L. R.; Kelly, Tanika; Kelly, Shannon; Kenny, Eimear E.; Kiel, Douglas P.; Klemmer, Robert; Konkle, Barbara A.; Kooperberg, Charles; Köttgen, Anna; Lange, Leslie A.; Lasky-Su, Jessica; Levy, Daniel; Lin, Xihong; Lin, Keng-Han; Liu, Chunyu; Loos, Ruth J. F.; Garman, Lori; Gerszten, Robert; Lubitz, Steven A.; Lunetta, Kathryn L.; Mak, Angel C. Y.; Manichaikul, Ani; Manning, Alisa K.; Mathias, Rasika A.; McManus, David D.; McGarvey, Stephen T.; Meigs, James B.; Meyers, Deborah A.; Mikulla, Julie L.; Minear, Mollie A.; Mitchell, Braxton D.; Mohanty, Sanghamitra; Montasser, May E.; Montgomery, Courtney; Morrison, Alanna C.; Murabito, Joanne M.; Natale, Andrea; Natarajan, Pradeep; Nelson, Sarah C.; North, Kari E.; O'Connell, Jeffrey R.; Palmer, Nicholette D.; Pankratz, Nathan; Peloso, Gina M.; Peyser, Patricia A.; Pleiness, Jacob; Post, Wendy S.; Psaty, Bruce M.; Rao, D. C.; Redline, Susan; Reiner, Alexander P.; Roden, Dan; Rotter, Jerome I.; Ruczinski, Ingo; Sarnowski, Chloé; Schoenherr, Sebastian; Schwartz, David A.; Seo, Jeong-Sun; Seshadri, Sudha; Sheehan, Vivien A.; Sheu, Wayne H.; Shoemaker, M. Benjamin; Smith, Nicholas L.; Smith, Jennifer A.; Sotoodehnia, Nona; Stilp, Adrienne M.; Tang, Weihong; Taylor, Kent D.; Telen, Marilyn; Thornton, Timothy A.; Tracy, Russell P.; Van Den Berg, David J.; Vasan, Ramachandran S.; Viaud-Martinez, Karine A.; Vrieze, Scott; Weeks, Daniel E.; Weir, Bruce S.; Weiss, Scott T.; Weng, Lu-Chen; Willer, Cristen J.; Zhang, Yingze; Zhao, Xutong; Arnett, Donna K.; Ashley-Koch, Allison E.; Barnes, Kathleen C.; Boerwinkle, Eric; Gabriel, Stacey; Gibbs, Richard; Rice, Kenneth M.; Rich, Stephen S.; Silverman, Edwin K.; Qasba, Pankaj; Gan, Weiniu; NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium; Papanicolaou, George J.; Nickerson, Deborah A.; Browning, Sharon R.; Zody, Michael C.; Zöllner, Sebastian; Wilson, James G.; Cupples, L. Adrienne; Laurie, Cathy C.; Jaquish, Cashell E.; Hernandez, Ryan D.; O'Connor, Timothy D.; Abecasis, Gonçalo R.; Epidemiology, Richard M. Fairbanks School of Public HealthThe Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.Item Unsupervised representation learning improves genomic discovery and risk prediction for respiratory and circulatory functions and diseases(medRxiv, 2023-08-29) Yun, Taedong; Cosentino, Justin; Behsaz, Babak; McCaw, Zachary R.; Hill, Davin; Luben, Robert; Lai, Dongbing; Bates, John; Yang, Howard; Schwantes-An, Tae-Hwi; Zhou, Yuchen; Khawaja, Anthony P.; Carroll, Andrew; Hobbs, Brian D.; Cho, Michael H.; McLean, Cory Y.; Hormozdiari, Farhad; Medical and Molecular Genetics, School of MedicineHigh-dimensional clinical data are becoming more accessible in biobank-scale datasets. However, effectively utilizing high-dimensional clinical data for genetic discovery remains challenging. Here we introduce a general deep learning-based framework, REpresentation learning for Genetic discovery on Low-dimensional Embeddings (REGLE), for discovering associations between genetic variants and high-dimensional clinical data. REGLE uses convolutional variational autoencoders to compute a non-linear, low-dimensional, disentangled embedding of the data with highly heritable individual components. REGLE can incorporate expert-defined or clinical features and provides a framework to create accurate disease-specific polygenic risk scores (PRS) in datasets which have minimal expert phenotyping. We apply REGLE to both respiratory and circulatory systems: spirograms which measure lung function and photoplethysmograms (PPG) which measure blood volume changes. Genome-wide association studies on REGLE embeddings identify more genome-wide significant loci than existing methods and replicate known loci for both spirograms and PPG, demonstrating the generality of the framework. Furthermore, these embeddings are associated with overall survival. Finally, we construct a set of PRSs that improve predictive performance of asthma, chronic obstructive pulmonary disease, hypertension, and systolic blood pressure in multiple biobanks. Thus, REGLE embeddings can quantify clinically relevant features that are not currently captured in a standardized or automated way.Item Unsupervised representation learning on high-dimensional clinical data improves genomic discovery and prediction(Springer Nature, 2024) Yun, Taedong; Cosentino, Justin; Behsaz, Babak; McCaw, Zachary R.; Hill, Davin; Luben, Robert; Lai, Dongbing; Bates, John; Yang, Howard; Schwantes-An, Tae-Hwi; Zhou, Yuchen; Khawaja, Anthony P.; Carroll, Andrew; Hobbs, Brian D.; Cho, Michael H.; McLean, Cory Y.; Hormozdiari, Farhad; Medical and Molecular Genetics, School of MedicineAlthough high-dimensional clinical data (HDCD) are increasingly available in biobank-scale datasets, their use for genetic discovery remains challenging. Here we introduce an unsupervised deep learning model, Representation Learning for Genetic Discovery on Low-Dimensional Embeddings (REGLE), for discovering associations between genetic variants and HDCD. REGLE leverages variational autoencoders to compute nonlinear disentangled embeddings of HDCD, which become the inputs to genome-wide association studies (GWAS). REGLE can uncover features not captured by existing expert-defined features and enables the creation of accurate disease-specific polygenic risk scores (PRSs) in datasets with very few labeled data. We apply REGLE to perform GWAS on respiratory and circulatory HDCD-spirograms measuring lung function and photoplethysmograms measuring blood volume changes. REGLE replicates known loci while identifying others not previously detected. REGLE are predictive of overall survival, and PRSs constructed from REGLE loci improve disease prediction across multiple biobanks. Overall, REGLE contain clinically relevant information beyond that captured by existing expert-defined features, leading to improved genetic discovery and disease prediction.