- Browse by Subject
Browsing by Subject "RNA splicing"
Now showing 1 - 8 of 8
Results Per Page
Sort Options
Item circMeta: a unified computational framework for genomic feature annotation and differential expression analysis of circular RNAs(Oxford University Press, 2020-01-15) Chen, Li; Wang, Feng; Bruggeman, Emily C.; Li, Chao; Yao, Bing; Medicine, School of MedicineMotivation: Circular RNAs (circRNAs), a class of non-coding RNAs generated from non-canonical back-splicing events, have emerged to play key roles in many biological processes. Though numerous tools have been developed to detect circRNAs from rRNA-depleted RNA-seq data based on back-splicing junction-spanning reads, computational tools to identify critical genomic features regulating circRNA biogenesis are still lacking. In addition, rigorous statistical methods to perform differential expression (DE) analysis of circRNAs remain under-developed. Results: We present circMeta, a unified computational framework for circRNA analyses. circMeta has three primary functional modules: (i) a pipeline for comprehensive genomic feature annotation related to circRNA biogenesis, including length of introns flanking circularized exons, repetitive elements such as Alu elements and SINEs, competition score for forming circulation and RNA editing in back-splicing flanking introns; (ii) a two-stage DE approach of circRNAs based on circular junction reads to quantitatively compare circRNA levels and (iii) a Bayesian hierarchical model for DE analysis of circRNAs based on the ratio of circular reads to linear reads in back-splicing sites to study spatial and temporal regulation of circRNA production. Both proposed DE methods without and with considering host genes outperform existing methods by obtaining better control of false discovery rate and comparable statistical power. Moreover, the identified DE circRNAs by the proposed two-stage DE approach display potential biological functions in Gene Ontology and circRNA-miRNA-mRNA networks that are not able to be detected using existing mRNA DE methods. Furthermore, top DE circRNAs have been further validated by RT-qPCR using divergent primers spanning back-splicing junctions.Item Discovery and evolutionary dynamics of RBPs and circular RNAs in mammalian transcriptomes(2015-03-30) Badve, Abhijit; Janga, Sarath Chandra; Hahn, Matthew William; Liu, YunlongRNA-binding proteins (RBPs) are vital post-transcriptional regulatory molecules in transcriptome of mammalian species. It necessitates studying their expression dynamics to extract how post-transcriptional networks work in various mammalian tissues. RNA binding proteins (RBPs) play important roles in controlling the post-transcriptional fate of RNA molecules, yet their evolutionary dynamics remains largely unknown. As expression profiles of genes encoding for RBPs can yield insights about their evolutionary trajectories on the post-transcriptional regulatory networks across species, we performed a comparative analyses of RBP expression profiles across 8 tissues (brain, cerebellum, heart, lung, liver, lung, skeletal muscle, testis) in 11 mammals (human, chimpanzee, gorilla, orangutan, macaque, rat, mouse, platypus, opossum, cow) and chicken & frog (evolutionary outgroups). Noticeably, orthologous gene expression profiles suggest a significantly higher expression level for RBPs than their non-RBP gene counterparts, which include other protein-coding and non-coding genes, across all the mammalian tissues studied here. This trend is significant irrespective of the tissue and species being compared, though RBP gene expression distribution patterns were found to be generally diverse in nature. Our analysis also shows that RBPs are expressed at a significantly lower level in human and mouse tissues compared to their expression levels in equivalent tissues in other mammals: chimpanzee, orangutan, rat, etc., which are all likely exposed to diverse natural habitats and ecological settings compared to more stable ecological environment humans and mice might have been exposed, thus reducing the need for complex and extensive post-transcriptional control. Further analysis of the similarity of orthologous RBP expression profiles between all pairs of tissue-mammal combinations clearly showed the grouping of RBP expression profiles across tissues in a given mammal, in contrast to the clustering of expression profiles for non-RBPs, which frequently grouped equivalent tissues across diverse mammalian species together, suggesting a significant evolution of RBPs expression after speciation events. Calculation of species specificity indices (SSIs) for RBPs across various tissues, to identify those that exhibited restricted expression to few mammals, revealed that about 30% of the RBPs are species-specific in at least one tissue studied here, with lung, liver, kidney & testis exhibiting a significantly higher proportion of species specifically expressed RBPs. We conducted a differential expression analysis of RBPs in human, mouse and chicken tissues to study the evolution of expression levels in recently evolved species (i.e., humans and mice) than evolutionarily-distant species (i.e., chickens). We identified more than 50% of the orthologous RBPs to be differentially expressed in at least one tissue, compared between human and mouse, but not so between human and an outgroup chicken, in which RBP expression levels are relatively conserved. Among the studied tissues (brain, liver and kidney) showed a higher fraction of differentially expressed RBPs, which may suggest hyper- regulatory activities by RBPs in these tissues with species evolution. Overall, this study forms a foundation for understanding the evolution of expression levels of RBPs in mammals, facilitating a snapshot of the wiring patterns of post-transcriptional regulatory networks in mammalian genomes. In our second study, we focused on elucidating novel features of post-transcriptional regulatory molecules called as circRNA from LongPolyA RNA-sequence data. The debate over presence of nonlinear exon splicing such as exon-shuffling or formation of circularized forms has finally come to an end as numerous repertoires have shown of their occurrence and presence through transcriptomic analyses. It is evident from previous studies that along with consensus-site splicing non-consensus site splicing is robustly occurring in the cell. Also, in spite of applying different high-throughput approaches (both computational and experimental) to determine their abundance, the signal is consistent and strongly conforming the plausible circularization mechanisms. Earlier studies hypothesized and hence focused on the ribo-minus non-polyA RNA-sequence data to identify circular RNA structures in cell and compared their abundance levels with their linear counterparts. Thus far, the studies show their conserved nature across tissues and species also that they are not translated and preferentially are without poly (A) tail, with one to five exons long. Much of this initial work has been performed using non-polyA sequencing thus probably underestimates the abundance of circular RNAs originating from long poly (A) RNA isoforms. Our hypothesis is if the circular RNA events are not the artifact of random events, but has a structured and defined mechanism for their formation, then there would not be biases on preferential selection / leaving of polyA tails, while forming the circularized isoforms. We have applied an existing computational pipeline from earlier studies by Memczack et. al., on ENCODE cell-lines long poly (A) RNA-sequence data. With the same pipeline, we achieve a significant number of circular RNA isoforms in the data, some of which are overlapping with known circular RNA isoforms from the literature. We identified an approach and worked upon to identify the precise structure of circular RNA, which is not plausible from the existing computational approaches. We aim to study their expression profiles in normal and cancer cell-lines, and see if there exists any pattern and functional significance based on their abundance levels in the cell.Item FOXP3 interacts with hnRNPF to modulate pre-mRNA alternative splicing(American Society for Biochemistry and Molecular Biology, 2018-06-29) Du, Jianguang; Wang, Qun; Ziegler, Steven F.; Zhou, Baohua; Microbiology and Immunology, School of MedicineFOXP3 promotes the development and function of regulatory T cells mainly through regulating the transcription of target genes. RNA alternative splicing has been implicated in a wide range of physiological and pathophysiological processes. We report here that FOXP3 associates with heterogeneous nuclear ribonucleoprotein (hnRNP) F through the exon 2-encoded region of FOXP3 and the second quasi-RNA recognition motif (qRRM) of hnRNPF. FOXP3 represses the ability of hnRNPF to bind to its target pre-mRNA and thus modulates RNA alternative splicing. Furthermore, overexpression of mouse hnRNPF in in vitro-differentiated regulatory T cells (Tregs) reduced their suppressive function. Thus, our studies identify a novel mechanism by which FOXP3 regulates mRNA alternative splicing to modulate the function of regulatory T cells.Item Human carboxylesterase 2 splice variants: expression, activity, and role in the metabolism of irinotecan and capecitabine(2009-02) Schiel, Marissa Ann; Bosron, William F.; Chiorean, E. Gabriela; Flockhart, David A.; Harrington, Maureen A.; Sanghani, Sonal P.Carboxylesterases (CES) are enzymes that metabolize a wide variety of compounds including esters, thioesters, carbamates, and amides. In humans there are three known carboxylesterase genes CES1, CES2, and CES3. Irinotecan (CPT-11) and capecitabine are important chemotherapeutic prodrugs that are used for the treatment of colorectal cancer. Of the three CES isoenzymes, CES2 has the highest catalytic efficiency for irinotecan activation. There is large inter-individual variation in response to treatment with irinotecan. Life-threatening late-onset diarrhea has been reported in approximately 13% of patients receiving irinotecan. Several studies have reported single nucleotide polymorphisms (SNPs) for the CES2 gene. However, there has been no consensus on the effect of different CES2 SNPs and their relationship to CES2 RNA expression or irinotecan hydrolase activity. Three CES2 mRNA transcripts of approximately 2kb,3kb, and 4kb have been identified by multi-tissue northern analysis. The expressed sequence tag (EST) database indicates that CES2 undergoes several splicing events that could generate up to six potential proteins. Four of the proteins CES2, CES2458-473, CES2+64, CES21-93 were studied to characterize their expression and activity. Multi-tissue northern analysis revealed that CES2+64 corresponds to the 4kb and 3kb transcripts while CES21-93 is located only in the 4 kb transcript. CES2458-473 is an inactive splice variant that accounts for approximately 6% of the CES2 transcripts in normal and tumor colon tissue. There is large inter-individual variation in CES2 expression in both tumor and normal colon samples. Characterization of CES2+64 identified the protein as normal CES2 indicating that the signal peptide is recognized in spite of the additional 64 amino acids at the N-terminus. Sub-cellular localization studies revealed that CES2 and CES2+64 localize to the ER, and CES21-93 localizes to the cytoplasm. To date CES2 SNP data has not provided any explanation for the high inter-individual variability in response to irinotecan treatment. Multi-tissue northern blots indicate that CES2 is expressed in a tissue specific manner. We have identified the CES2 variants which correspond to each mRNA transcript. This information will be critical to defining the role of CES2 variants in the different tissues.Item Intron retention-induced neoantigen load correlates with unfavorable prognosis in multiple myeloma(Springer Nature, 2021-10) Dong, Chuanpeng; Cesarano, Annamaria; Bombaci, Giuseppe; Reiter, Jill L.; Yu, Christina Y.; Wang, Yue; Jiang, Zhaoyang; Zaid, Mohammad Abu; Huang, Kun; Lu, Xiongbin; Walker, Brian A.; Perna, Fabiana; Liu, Yunlong; BioHealth Informatics, School of Informatics and ComputingNeoantigen peptides arising from genetic alterations may serve as targets for personalized cancer vaccines and as positive predictors of response to immune checkpoint therapy. Mutations in genes regulating RNA splicing are common in hematological malignancies leading to dysregulated splicing and intron retention (IR). In this study, we investigated IR as a potential source of tumor neoantigens in multiple myeloma (MM) patients and the relationship of IR-induced neoantigens (IR-neoAg) with clinical outcomes. MM-specific IR events were identified in RNA-sequencing data from the Multiple Myeloma Research Foundation CoMMpass study after removing IR events that also occurred in normal plasma cells. We quantified the IR-neoAg load by assessing IR-induced novel peptides that were predicted to bind to major histocompatibility complex (MHC) molecules. We found that high IR-neoAg load was associated with poor overall survival in both newly diagnosed and relapsed MM patients. Further analyses revealed that poor outcome in MM patients with high IR-neoAg load was associated with high expression levels of T-cell co-inhibitory molecules and elevated interferon signaling activity. We also found that MM cells exhibiting high IR levels had lower MHC-II protein abundance and treatment of MM cells with a spliceosome inhibitor resulted in increased MHC-I protein abundance. Our findings suggest that IR-neoAg may represent a novel biomarker of MM patient clinical outcome and further that targeting RNA splicing may serve as a potential therapeutic strategy to prevent MM immune escape and promote response to checkpoint blockade.Item Mutational and splicing landscape in a cohort of 43,000 patients tested for hereditary cancer(Springer Nature, 2022-08-25) Horton, Carolyn; Cass, Ashley; Conner, Blair R.; Hoang, Lily; Zimmermann, Heather; Abualkheir, Nelly; Burks, David; Qian, Dajun; Molparia, Bhuvan; Vuong, Huy; LaDuca, Holly; Grzybowski, Jessica; Durda, Kate; Pilarski, Robert; Profato, Jessica; Clayback, Katherine; Mahoney, Martin; Schroeder, Courtney; Torres-Martinez, Wilfredo; Elliott, Aaron; Chao, Elizabeth C.; Karam, Rachid; Medical and Molecular Genetics, School of MedicineDNA germline genetic testing can identify individuals with cancer susceptibility. However, DNA sequencing alone is limited in its detection and classification of mRNA splicing variants, particularly those located far from coding sequences. Here we address the limitations of splicing variant identification and interpretation by pairing DNA and RNA sequencing and describe the mutational and splicing landscape in a clinical cohort of 43,524 individuals undergoing genetic testing for hereditary cancer predisposition.Item RegSNPs-intron: a computational framework for predicting pathogenic impact of intronic single nucleotide variants(BMC, 2019-11-28) Lin, Hai; Hargreaves, Katherine A.; Li, Rudong; Reiter, Jill L.; Wang, Yue; Mort, Matthew; Cooper, David N.; Zhou, Yaoqi; Zhang, Chi; Eadon, Michael T.; Dolan, M. Eileen; Ipe, Joseph; Skaar, Todd C.; Liu, Yunlong; Medical and Molecular Genetics, School of MedicineSingle nucleotide variants (SNVs) in intronic regions have yet to be systematically investigated for their disease-causing potential. Using known pathogenic and neutral intronic SNVs (iSNVs) as training data, we develop the RegSNPs-intron algorithm based on a random forest classifier that integrates RNA splicing, protein structure, and evolutionary conservation features. RegSNPs-intron showed excellent performance in evaluating the pathogenic impacts of iSNVs. Using a high-throughput functional reporter assay called ASSET-seq (ASsay for Splicing using ExonTrap and sequencing), we evaluate the impact of RegSNPs-intron predictions on splicing outcome. Together, RegSNPs-intron and ASSET-seq enable effective prioritization of iSNVs for disease pathogenesis.Item RNA alternative splicing impacts the risk for alcohol use disorder(Springer Nature, 2023) Li, Rudong; Reiter, Jill L.; Chen, Andy B.; Chen, Steven X.; Foroud, Tatiana; Edenberg, Howard J.; Lai, Dongbing; Liu, Yunlong; Medical and Molecular Genetics, School of MedicineAlcohol use disorder (AUD) is a complex genetic disorder characterized by problems arising from excessive alcohol consumption. Identifying functional genetic variations that contribute to risk for AUD is a major goal. Alternative splicing of RNA mediates the flow of genetic information from DNA to gene expression and expands proteome diversity. We asked whether alternative splicing could be a risk factor for AUD. Herein, we used a Mendelian randomization (MR)-based approach to identify skipped exons (the predominant splicing event in brain) that contribute to AUD risk. Genotypes and RNA-seq data from the CommonMind Consortium were used as the training dataset to develop predictive models linking individual genotypes to exon skipping in the prefrontal cortex. We applied these models to data from the Collaborative Studies on Genetics of Alcoholism to examine the association between the imputed cis-regulated splicing outcome and the AUD-related traits. We identified 27 exon skipping events that were predicted to affect AUD risk; six of these were replicated in the Australian Twin-family Study of Alcohol Use Disorder. Their host genes are DRC1, ELOVL7, LINC00665, NSUN4, SRRM2 and TBC1D5. The genes downstream of these splicing events are enriched in neuroimmune pathways. The MR-inferred impacts of the ELOVL7 skipped exon on AUD risk was further supported in four additional large-scale genome-wide association studies. Additionally, this exon contributed to changes of gray matter volumes in multiple brain regions, including the visual cortex known to be involved in AUD. In conclusion, this study provides strong evidence that RNA alternative splicing impacts the susceptibility to AUD and adds new information on AUD-relevant genes and pathways. Our framework is also applicable to other types of splicing events and to other complex genetic disorders.