- Browse by Subject
Browsing by Subject "Whole genome sequencing"
Now showing 1 - 8 of 8
Results Per Page
Sort Options
Item A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies(Springer Nature, 2022) Li, Zilin; Li, Xihao; Zhou, Hufeng; Gaynor, Sheila M.; Selvaraj, Margaret Sunitha; Arapoglou, Theodore; Quick, Corbin; Liu, Yaowu; Chen, Han; Sun, Ryan; Dey, Rounak; Arnett, Donna K.; Auer, Paul L.; Bielak, Lawrence F.; Bis, Joshua C.; Blackwell, Thomas W.; Blangero, John; Boerwinkle, Eric; Bowden, Donald W.; Brody, Jennifer A.; Cade, Brian E.; Conomos, Matthew P.; Correa, Adolfo; Cupples, L. Adrienne; Curran, Joanne E.; de Vries, Paul S.; Duggirala, Ravindranath; Franceschini, Nora; Freedman, Barry I.; Göring, Harald H. H.; Guo, Xiuqing; Kalyani, Rita R.; Kooperberg, Charles; Kral, Brian G.; Lange, Leslie A.; Lin, Bridget M.; Manichaikul, Ani; Manning, Alisa K.; Martin, Lisa W.; Mathias, Rasika A.; Meigs, James B.; Mitchell, Braxton D.; Montasser, May E.; Morrison, Alanna C.; Naseri, Take; O'Connell, Jeffrey R.; Palmer, Nicholette D.; Peyser, Patricia A.; Psaty, Bruce M.; Raffield, Laura M.; Redline, Susan; Reiner, Alexander P.; Reupena, Muagututi'a Sefuiva; Rice, Kenneth M.; Rich, Stephen S.; Smith, Jennifer A.; Taylor, Kent D.; Taub, Margaret A.; Vasan, Ramachandran S.; Weeks, Daniel E.; Wilson, James G.; Yanek, Lisa R.; Zhao, Wei; NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium; TOPMed Lipids Working Group; Rotter, Jerome I.; Willer, Cristen J.; Natarajan, Pradeep; Peloso, Gina M.; Lin, Xihong; Biostatistics and Health Data Science, School of MedicineLarge-scale whole-genome sequencing studies have enabled analysis of noncoding rare-variant (RV) associations with complex human diseases and traits. Variant-set analysis is a powerful approach to study RV association. However, existing methods have limited ability in analyzing the noncoding genome. We propose a computationally efficient and robust noncoding RV association detection framework, STAARpipeline, to automatically annotate a whole-genome sequencing study and perform flexible noncoding RV association analysis, including gene-centric analysis and fixed window-based and dynamic window-based non-gene-centric analysis by incorporating variant functional annotations. In gene-centric analysis, STAARpipeline uses STAAR to group noncoding variants based on functional categories of genes and incorporate multiple functional annotations. In non-gene-centric analysis, STAARpipeline uses SCANG-STAAR to incorporate dynamic window sizes and multiple functional annotations. We apply STAARpipeline to identify noncoding RV sets associated with four lipid traits in 21,015 discovery samples from the Trans-Omics for Precision Medicine (TOPMed) program and replicate several of them in an additional 9,123 TOPMed samples. We also analyze five non-lipid TOPMed traits.Item Association analysis of rare variants near the APOE region with CSF and neuroimaging biomarkers of Alzheimer's disease(Springer Nature, 2017-05-24) Nho, Kwangsik; Kim, Sungeun; Horgusluoglu, Emrin; Risacher, Shannon L.; Shen, Li; Kim, Dokyoon; Lee, Seunggeun; Foroud, Tatiana; Shaw, Leslie M.; Trojanowski, John Q.; Aisen, Paul S.; Petersen, Ronald C.; Jack, Clifford R., Jr.; Weiner, Michael W.; Green, Robert C.; Toga, Arthur W.; Saykin, Andrew J.; Radiology and Imaging Sciences, School of MedicineBACKGROUND: The APOE ε4 allele is the most significant common genetic risk factor for late-onset Alzheimer's disease (LOAD). The region surrounding APOE on chromosome 19 has also shown consistent association with LOAD. However, no common variants in the region remain significant after adjusting for APOE genotype. We report a rare variant association analysis of genes in the vicinity of APOE with cerebrospinal fluid (CSF) and neuroimaging biomarkers of LOAD. METHODS: Whole genome sequencing (WGS) was performed on 817 blood DNA samples from the Alzheimer's Disease Neuroimaging Initiative (ADNI). Sequence data from 757 non-Hispanic Caucasian participants was used in the present analysis. We extracted all rare variants (MAF (minor allele frequency) < 0.05) within a 312 kb window in APOE's vicinity encompassing 12 genes. We assessed CSF and neuroimaging (MRI and PET) biomarkers as LOAD-related quantitative endophenotypes. Gene-based analyses of rare variants were performed using the optimal Sequence Kernel Association Test (SKAT-O). RESULTS: A total of 3,334 rare variants (MAF < 0.05) were found within the APOE region. Among them, 72 rare non-synonymous variants were observed. Eight genes spanning the APOE region were significantly associated with CSF Aβ1-42 (p < 1.0 × 10-3). After controlling for APOE genotype and adjusting for multiple comparisons, 4 genes (CBLC, BCAM, APOE, and RELB) remained significant. Whole-brain surface-based analysis identified highly significant clusters associated with rare variants of CBLC in the temporal lobe region including the entorhinal cortex, as well as frontal lobe regions. Whole-brain voxel-wise analysis of amyloid PET identified significant clusters in the bilateral frontal and parietal lobes showing associations of rare variants of RELB with cortical amyloid burden. CONCLUSIONS: Rare variants within genes spanning the APOE region are significantly associated with LOAD-related CSF Aβ1-42 and neuroimaging biomarkers after adjusting for APOE genotype. These findings warrant further investigation and illustrate the role of next generation sequencing and quantitative endophenotypes in assessing rare variants which may help explain missing heritability in AD and other complex diseases.Item Comparison of whole genome sequencing and targeted sequencing for mitochondrial DNA(Elsevier, 2021) Chen, Ruoying; Aldred, Micheala A.; Xu, Weiling; Zein, Joe; Bazeley, Peter; Comhai, Suzy A. A.; Meyers, Deborah A.; Bleecker, Eugene R.; Liu, Chunyu; Erzurum, Serpil C.; Hu, Bo; NHLBI Severe Asthma Research Program (SARP); Medicine, School of MedicineMitochondrial dysfunction has emerged to be associated with a broad spectrum of diseases, and there is an increasing demand for accurate detection of mitochondrial DNA (mtDNA) variants. Whole genome sequencing (WGS) has been the dominant sequencing approach to identify genetic variants in recent decades, but most studies focus on variants on the nuclear genome. Whole genome sequencing is also costly and time consuming. Sequencing specifically targeted for mtDNA is commonly used in the diagnostic settings and has lower costs. However, there is a lack of pairwise comparisons between these two sequencing approaches for calling mtDNA variants on a population basis. In this study, we compared WGS and mtDNA-targeted sequencing (targeted-seq) in analyzing mitochondrial DNA from 1499 participants recruited into the Severe Asthma Research Program (SARP). Our study reveals that targeted-sequencing and WGS have comparable capacity to determine genotypes and to call haplogroups and homoplasmies on mtDNA. However, there exists a large variability in calling heteroplasmies, especially for low-frequency heteroplasmies, which indicates that investigators should be cautious about heteroplasmies acquired from different sequencing methods. Further research is highly desired to improve variant detection methods for mitochondrial DNA.Item Integration of bioinformatics and imaging informatics for identifying rare PSEN1 variants in Alzheimer's disease(BioMed Central, 2016-08-12) Nho, Kwangsik; Horgusluoglu, Emrin; Kim, Sungeun; Risacher, Shannon L.; Kim, Dokyoon; Foroud, Tatiana; Aisen, Paul S.; Peterson, Ronald C.; Jack Jr., Clifford R.; Shaw, Leslie M.; Trojanowski, John Q.; Weiner, Michael W.; Green, Robert C.; Toga, Arthur W.; Saykin, Andrew J.; Department of Radiology and Imaging Sciences, IU School of MedicineBACKGROUND: Pathogenic mutations in PSEN1 are known to cause familial early-onset Alzheimer's disease (EOAD) but common variants in PSEN1 have not been found to strongly influence late-onset AD (LOAD). The association of rare variants in PSEN1 with LOAD-related endophenotypes has received little attention. In this study, we performed a rare variant association analysis of PSEN1 with quantitative biomarkers of LOAD using whole genome sequencing (WGS) by integrating bioinformatics and imaging informatics. METHODS: A WGS data set (N = 815) from the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort was used in this analysis. 757 non-Hispanic Caucasian participants underwent WGS from a blood sample and high resolution T1-weighted structural MRI at baseline. An automated MRI analysis technique (FreeSurfer) was used to measure cortical thickness and volume of neuroanatomical structures. We assessed imaging and cerebrospinal fluid (CSF) biomarkers as LOAD-related quantitative endophenotypes. Single variant analyses were performed using PLINK and gene-based analyses of rare variants were performed using the optimal Sequence Kernel Association Test (SKAT-O). RESULTS: A total of 839 rare variants (MAF < 1/√(2 N) = 0.0257) were found within a region of ±10 kb from PSEN1. Among them, six exonic (three non-synonymous) variants were observed. A single variant association analysis showed that the PSEN1 p. E318G variant increases the risk of LOAD only in participants carrying APOE ε4 allele where individuals carrying the minor allele of this PSEN1 risk variant have lower CSF Aβ1-42 and higher CSF tau. A gene-based analysis resulted in a significant association of rare but not common (MAF ≥ 0.0257) PSEN1 variants with bilateral entorhinal cortical thickness. CONCLUSIONS: This is the first study to show that PSEN1 rare variants collectively show a significant association with the brain atrophy in regions preferentially affected by LOAD, providing further support for a role of PSEN1 in LOAD. The PSEN1 p. E318G variant increases the risk of LOAD only in APOE ε4 carriers. Integrating bioinformatics with imaging informatics for identification of rare variants could help explain the missing heritability in LOAD.Item Myeloma Genome Project Panel is a Comprehensive Targeted Genomics Panel for Molecular Profiling of Patients with Multiple Myeloma(American Association for Cancer Research, 2022) Sudha, Parvathi; Ahsan, Aarif; Ashby, Cody; Kausar, Tasneem; Khera, Akhil; Kazeroun, Mohammad H.; Hsu, Chih-Chao; Wang, Lin; Fitzsimons, Evelyn; Salminen, Outi; Blaney, Patrick; Czader, Magdalena; Williams, Jonathan; Zaid, Mohammad I. Abu; Ansari-Pour, Naser; Yong, Kwee L.; van Rhee, Frits; Pierceall, William E.; Morgan, Gareth J.; Flynt, Erin; Gooding, Sarah; Abonour, Rafat; Ramasamy, Karthik; Thakurta, Anjan; Walker, Brian A.; Medicine, School of MedicinePurpose: We designed a comprehensive multiple myeloma targeted sequencing panel to identify common genomic abnormalities in a single assay and validated it against known standards. Experimental design: The panel comprised 228 genes/exons for mutations, 6 regions for translocations, and 56 regions for copy number abnormalities (CNA). Toward panel validation, targeted sequencing was conducted on 233 patient samples and further validated using clinical FISH (translocations), multiplex ligation probe analysis (MLPA; CNAs), whole-genome sequencing (WGS; CNAs, mutations, translocations), or droplet digital PCR (ddPCR) of known standards (mutations). Results: Canonical immunoglobulin heavy chain translocations were detected in 43.2% of patients by sequencing, and aligned with FISH except for 1 patient. CNAs determined by sequencing and MLPA for 22 regions were comparable in 103 samples and concordance between platforms was R2 = 0.969. Variant allele frequency (VAF) for 74 mutations were compared between sequencing and ddPCR with concordance of R2 = 0.9849. Conclusions: In summary, we have developed a targeted sequencing panel that is as robust or superior to FISH and WGS. This molecular panel is cost-effective, comprehensive, clinically actionable, and can be routinely deployed to assist risk stratification at diagnosis or posttreatment to guide sequencing of therapies.Item Powerful, scalable and resource-efficient meta-analysis of rare variant associations in large whole genome sequencing studies(Springer Nature, 2023) Li, Xihao; Quick, Corbin; Zhou, Hufeng; Gaynor, Sheila M.; Liu, Yaowu; Chen, Han; Selvaraj, Margaret Sunitha; Sun, Ryan; Dey, Rounak; Arnett, Donna K.; Bielak, Lawrence F.; Bis, Joshua C.; Blangero, John; Boerwinkle, Eric; Bowden, Donald W.; Brody, Jennifer A.; Cade, Brian E.; Correa, Adolfo; Cupples, L. Adrienne; Curran, Joanne E.; de Vries, Paul S.; Duggirala, Ravindranath; Freedman, Barry I.; Göring, Harald H. H.; Guo, Xiuqing; Haessler, Jeffrey; Kalyani, Rita R.; Kooperberg, Charles; Kral, Brian G.; Lange, Leslie A.; Manichaikul, Ani; Martin, Lisa W.; McGarvey, Stephen T.; Mitchell, Braxton D.; Montasser, May E.; Morrison, Alanna C.; Naseri, Take; O'Connell, Jeffrey R.; Palmer, Nicholette D.; Peyser, Patricia A.; Psaty, Bruce M.; Raffield, Laura M.; Redline, Susan; Reiner, Alexander P.; Reupena, Muagututi'a Sefuiva; Rice, Kenneth M.; Rich, Stephen S.; Sitlani, Colleen M.; Smith, Jennifer A.; Taylor, Kent D.; Vasan, Ramachandran S.; Willer, Cristen J.; Wilson, James G.; Yanek, Lisa R.; Zhao, Wei; NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium; TOPMed Lipids Working Group; Rotter, Jerome I.; Natarajan, Pradeep; Peloso, Gina M.; Li, Zilin; Lin, Xihong; Biostatistics and Health Data Science, School of MedicineMeta-analysis of whole genome sequencing/whole exome sequencing (WGS/WES) studies provides an attractive solution to the problem of collecting large sample sizes for discovering rare variants associated with complex phenotypes. Existing rare variant meta-analysis approaches are not scalable to biobank-scale WGS data. Here we present MetaSTAAR, a powerful and resource-efficient rare variant meta-analysis framework for large-scale WGS/WES studies. MetaSTAAR accounts for relatedness and population structure, can analyze both quantitative and dichotomous traits and boosts the power of rare variant tests by incorporating multiple variant functional annotations. Through meta-analysis of four lipid traits in 30,138 ancestrally diverse samples from 14 studies of the Trans Omics for Precision Medicine (TOPMed) Program, we show that MetaSTAAR performs rare variant meta-analysis at scale and produces results comparable to using pooled data. Additionally, we identified several conditionally significant rare variant associations with lipid traits. We further demonstrate that MetaSTAAR is scalable to biobank-scale cohorts through meta-analysis of TOPMed WGS data and UK Biobank WES data of ~200,000 samples.Item Rapid Whole-Genomic Sequencing and a Targeted Neonatal Gene Panel in Infants With a Suspected Genetic Disorder(American Medical Association, 2023) Maron, Jill L.; Kingsmore, Stephen; Gelb, Bruce D.; Vockley, Jerry; Wigby, Kristen; Bragg, Jennifer; Stroustrup, Annemarie; Poindexter, Brenda; Suhrie, Kristen; Kim, Jae H.; Diacovo, Thomas; Powell, Cynthia M.; Trembath, Andrea; Guidugli, Lucia; Ellsworth, Katarzyna A.; Reed, Dallas; Kurfiss, Anne; Breeze, Janis L.; Trinquart, Ludovic; Davis, Jonathan M.; Pediatrics, School of MedicineImportance: Genomic testing in infancy guides medical decisions and can improve health outcomes. However, it is unclear whether genomic sequencing or a targeted neonatal gene-sequencing test provides comparable molecular diagnostic yields and times to return of results. Objective: To compare outcomes of genomic sequencing with those of a targeted neonatal gene-sequencing test. Design, setting, and participants: The Genomic Medicine for Ill Neonates and Infants (GEMINI) study was a prospective, comparative, multicenter study of 400 hospitalized infants younger than 1 year of age (proband) and their parents, when available, suspected of having a genetic disorder. The study was conducted at 6 US hospitals from June 2019 to November 2021. Exposure: Enrolled participants underwent simultaneous testing with genomic sequencing and a targeted neonatal gene-sequencing test. Each laboratory performed an independent interpretation of variants guided by knowledge of the patient's phenotype and returned results to the clinical care team. Change in clinical management, therapies offered, and redirection of care was provided to families based on genetic findings from either platform. Main outcomes and measures: Primary end points were molecular diagnostic yield (participants with ≥1 pathogenic variant or variant of unknown significance), time to return of results, and clinical utility (changes in patient care). Results: A molecular diagnostic variant was identified in 51% of participants (n = 204; 297 variants identified with 134 being novel). Molecular diagnostic yield of genomic sequencing was 49% (95% CI, 44%-54%) vs 27% (95% CI, 23%-32%) with the targeted gene-sequencing test. Genomic sequencing did not report 19 variants found by the targeted neonatal gene-sequencing test; the targeted gene-sequencing test did not report 164 variants identified by genomic sequencing as diagnostic. Variants unidentified by the targeted genomic-sequencing test included structural variants longer than 1 kilobase (25.1%) and genes excluded from the test (24.6%) (McNemar odds ratio, 8.6 [95% CI, 5.4-14.7]). Variant interpretation by laboratories differed by 43%. Median time to return of results was 6.1 days for genomic sequencing and 4.2 days for the targeted genomic-sequencing test; for urgent cases (n = 107) the time was 3.3 days for genomic sequencing and 4.0 days for the targeted gene-sequencing test. Changes in clinical care affected 19% of participants, and 76% of clinicians viewed genomic testing as useful or very useful in clinical decision-making, irrespective of a diagnosis. Conclusions and relevance: The molecular diagnostic yield for genomic sequencing was higher than a targeted neonatal gene-sequencing test, but the time to return of routine results was slower. Interlaboratory variant interpretation contributes to differences in molecular diagnostic yield and may have important consequences for clinical management.Item Rare variants in the splicing regulatory elements of EXOC3L4 are associated with brain glucose metabolism in Alzheimer's disease(Biomed Central, 2018-09-14) Miller, Jason E.; Shivakumar, Manu K.; Lee, Younghee; Han, Seonggyun; Horgousluoglu, Emrin; Risacher, Shannon L.; Saykin, Andrew J.; Nho, Kwangsik; Kim, Dokyoon; Radiology and Imaging Sciences, School of MedicineBACKGROUND: Alzheimer's disease (AD) is one of the most common neurodegenerative diseases that causes problems related to brain function. To some extent it is understood on a molecular level how AD arises, however there are a lack of biomarkers that can be used for early diagnosis. Two popular methods to identify AD-related biomarkers use genetics and neuroimaging. Genes and neuroimaging phenotypes have provided some insights as to the potential for AD biomarkers. While the field of imaging-genomics has identified genetic features associated with structural and functional neuroimaging phenotypes, it remains unclear how variants that affect splicing could be important for understanding the genetic etiology of AD. METHODS: In this study, rare variants (minor allele frequency < 0.01) in splicing regulatory element (SRE) loci from whole genome sequencing (WGS) in the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort, were used to identify genes that are associated with global brain cortical glucose metabolism in AD measured by FDG PET-scans. Gene-based associated analyses of rare variants were performed using the program BioBin and the optimal Sequence Kernel Association Test (SKAT-O). RESULTS: The gene, EXOC3L4, was identified as significantly associated with global cortical glucose metabolism (FDR (false discovery rate) corrected p < 0.05) using SRE coding variants only. Three loci that may affect splicing within EXOC3L4 contribute to the association. CONCLUSION: Based on sequence homology, EXOC3L4 is likely a part of the exocyst complex. Our results suggest the possibility that variants which affect proper splicing of EXOC3L4 via SREs may impact vesicle transport, giving rise to AD related phenotypes. Overall, by utilizing WGS and functional neuroimaging we have identified a gene significantly associated with an AD related endophenotype, potentially through a mechanism that involves splicing.