- Browse by Author
Browsing by Author "Wang, Yadong"
Now showing 1 - 10 of 10
Results Per Page
Sort Options
Item Characterizing the roles of long non-coding RNA in rat alcohol preference(IEEE, 2016-12) Zhou, Ao; Wang, Yadong; Liu, Yunlong; Feng, Weixing; Edenberg, Howard J.; Medical and Molecular Genetics, School of MedicineAlcohol is one of the major threats to health in United States. With the emerging of next-generation sequencing technology, the association between alcohol preference and the variants and expression of genes has been investigated. However, the roles of long non-coding RNAs (lncRNA) in alcohol preference remains unclear. In this study, we identified 37 novel lncRNAs that differentially expressed across alcohol preferring (P) and non-preferring (NP) rats. The functional study on these lncRNAs demonstrates that they are associated with gene regulation, as well as neural functions. This suggests that these lncRNAs may contribute to the alcohol preference behaviors.Item Identification of transcription factor and microRNA binding sites in responsible to fetal alcohol syndrome(BioMed Central, 2008-03-20) Wang, Guohua; Wang, Xin; Wang, Yadong; Yang, Jack Y.; Li, Lang; Nephew, Kenneth P.; Edenberg, Howard J.; Zhou, Feng C.; Liu, Yunlong; Medicine, School of MedicineThis is a first report, using our MotifModeler informatics program, to simultaneously identify transcription factor (TF) and microRNA (miRNA) binding sites from gene expression microarray data. Based on the assumption that gene expression is controlled by combinatorial effects of transcription factors binding in the 5'-upstream regulatory region and miRNAs binding in the 3'-untranslated region (3'-UTR), we developed a model for (1) predicting the most influential cis-acting elements under a given biological condition, and (2) estimating the effects of those elements on gene expression levels. The regulatory regions, TF and miRNA, which mediate the differential genes expression in fetal alcohol syndrome were unknown; microarray data from alcohol exposure paradigm was used. The model predicted strong inhibitory effects of 5' cis-acting elements and stimulatory effects of 3'-UTR under alcohol treatment. Current predictive model derived a key hypothesis for the first time a novel role of miRNAs in gene expression changes associated with abnormal mouse embryo development after alcohol exposure. This suggests that disturbance of miRNA functions may contribute to the alcohol-induced developmental deficiencies.Item Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies(BioMed Central, 2016-08-22) Feng, Weixing; Zhao, Sen; Xue, Dingkai; Song, Fengfei; Li, Ziwei; Chao, Duojiao; He, Bo; Hao, Yangyang; Wang, Yadong; Liu, Yunlong; Department of Medical and Molecular Genetics, IU School of MedicineBACKGROUND: Ion Torrent and Ion Proton are semiconductor-based sequencing technologies that feature rapid sequencing speed and low upfront and operating costs, thanks to the avoidance of modified nucleotides and optical measurements. Despite of these advantages, however, Ion semiconductor sequencing technologies suffer much reduced sequencing accuracy at the genomic loci with homopolymer repeats of the same nucleotide. Such limitation significantly reduces its efficiency for the biological applications aiming at accurately identifying various genetic variants. RESULTS: In this study, we propose a Bayesian inference-based method that takes the advantage of the signal distributions of the electrical voltages that are measured for all the homopolymers of a fixed length. By cross-referencing the length of homopolymers in the reference genome and the voltage signal distribution derived from the experiment, the proposed integrated model significantly improves the alignment accuracy around the homopolymer regions. CONCLUSIONS: Besides improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies with the proposed model, similar strategies can also be used on other high-throughput sequencing technologies that share similar limitations.Item MeDReaders: a database for transcription factors that bind to methylated DNA(Oxford Academic, 2018-01-04) Wang, Guohua; Luo, Ximei; Wang, Jianan; Wan, Jun; Xia, Shuli; Zhu, Heng; Qian, Jiang; Wang, Yadong; Medical and Molecular Genetics, School of MedicineUnderstanding the molecular principles governing interactions between transcription factors (TFs) and DNA targets is one of the main subjects for transcriptional regulation. Recently, emerging evidence demonstrated that some TFs could bind to DNA motifs containing highly methylated CpGs both in vitro and in vivo. Identification of such TFs and elucidation of their physiological roles now become an important stepping-stone toward understanding the mechanisms underlying the methylation-mediated biological processes, which have crucial implications for human disease and disease development. Hence, we constructed a database, named as MeDReaders, to collect information about methylated DNA binding activities. A total of 731 TFs, which could bind to methylated DNA sequences, were manually curated in human and mouse studies reported in the literature. In silico approaches were applied to predict methylated and unmethylated motifs of 292 TFs by integrating whole genome bisulfite sequencing (WGBS) and ChIP-Seq datasets in six human cell lines and one mouse cell line extracted from ENCODE and GEO database. MeDReaders database will provide a comprehensive resource for further studies and aid related experiment designs. The database implemented unified access for users to most TFs involved in such methylation-associated binding actives. The website is available at http://medreader.org/.Item miR2Disease: a manually curated database for microRNA deregulation in human disease(Oxford Academic, 2008-10-15) Jiang, Qinghua; Wang, Yadong; Hao, Yangyang; Juan, Liran; Teng, Mingxiang; Zhang, Xinjun; Li, Meimei; Wang, Guohua; Liu, Yunlong; Medicine, School of Medicine‘miR2Disease’, a manually curated database, aims at providing a comprehensive resource of microRNA deregulation in various human diseases. The current version of miR2Disease documents 1939 curated relationships between 299 human microRNAs and 94 human diseases by reviewing more than 600 published papers. Around one-seventh of the microRNA–disease relationships represent the pathogenic roles of deregulated microRNA in human disease. Each entry in the miR2Disease contains detailed information on a microRNA–disease relationship, including a microRNA ID, the disease name, a brief description of the microRNA–disease relationship, an expression pattern of the microRNA, the detection method for microRNA expression, experimentally verified target gene(s) of the microRNA and a literature reference. miR2Disease provides a user-friendly interface for a convenient retrieval of each entry by microRNA ID, disease name, or target gene. In addition, miR2Disease offers a submission page that allows researchers to submit established microRNA–disease relationships that are not documented. Once approved by the submission review committee, the submitted records will be included in the database. miR2Disease is freely available at http://www.miR2Disease.org.Item A modulated empirical Bayes model for identifying topological and temporal estrogen receptor α regulatory networks in breast cancer.(BioMed Central, 2011-05-09) Shen, Changyu; Huang, Yiwen; Liu, Yunlong; Wang, Guohua; Zhao, Yuming; Wang, Zhiping; Teng, Mingxiang; Wang, Yadong; Flockhart, David A.; Skaar, Todd C.; Yan, Pearlly; Nephew, Kenneth P.; Huang, Tim Hm; Li, LangBACKGROUND: Estrogens regulate diverse physiological processes in various tissues through genomic and non-genomic mechanisms that result in activation or repression of gene expression. Transcription regulation upon estrogen stimulation is a critical biological process underlying the onset and progress of the majority of breast cancer. Dynamic gene expression changes have been shown to characterize the breast cancer cell response to estrogens, the every molecular mechanism of which is still not well understood. RESULTS: We developed a modulated empirical Bayes model, and constructed a novel topological and temporal transcription factor (TF) regulatory network in MCF7 breast cancer cell line upon stimulation by 17β-estradiol stimulation. In the network, significant TF genomic hubs were identified including ER-alpha and AP-1; significant non-genomic hubs include ZFP161, TFDP1, NRF1, TFAP2A, EGR1, E2F1, and PITX2. Although the early and late networks were distinct (<5% overlap of ERα target genes between the 4 and 24 h time points), all nine hubs were significantly represented in both networks. In MCF7 cells with acquired resistance to tamoxifen, the ERα regulatory network was unresponsive to 17β-estradiol stimulation. The significant loss of hormone responsiveness was associated with marked epigenomic changes, including hyper- or hypo-methylation of promoter CpG islands and repressive histone methylations. CONCLUSIONS: We identified a number of estrogen regulated target genes and established estrogen-regulated network that distinguishes the genomic and non-genomic actions of estrogen receptor. Many gene targets of this network were not active anymore in anti-estrogen resistant cell lines, possibly because their DNA methylation and histone acetylation patterns have changed.Item regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution(Springer, 2017) Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N.; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong; Department of Medical & Molecular Genetics, IU School of MedicineWhile synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.Item RNA Polymerase II Binding Patterns Reveal Genomic Regions Involved in MicroRNA Gene Regulation(Public Library of Science, 2010-11-02) Wang, Guohua; Wang, Yadong; Shen, Changyu; Huangn, Yi-wen; Huang, Kun; Huang, Tim H. M.; Nephew, Kenneth P.; Li, Lang; Liu, Yunlong; Medicine, School of MedicineMicroRNAs are small non-coding RNAs involved in post-transcriptional regulation of gene expression. Due to the poor annotation of primary microRNA (pri-microRNA) transcripts, the precise location of promoter regions driving expression of many microRNA genes is enigmatic. This deficiency hinders our understanding of microRNA-mediated regulatory networks. In this study, we develop a computational approach to identify the promoter region and transcription start site (TSS) of pri-microRNAs actively transcribed using genome-wide RNA Polymerase II (RPol II) binding patterns derived from ChIP-seq data. Based upon the assumption that the distribution of RPol II binding patterns around the TSS of microRNA and protein coding genes are similar, we designed a statistical model to mimic RPol II binding patterns around the TSS of highly expressed, well-annotated promoter regions of protein coding genes. We used this model to systematically scan the regions upstream of all intergenic microRNAs for RPol II binding patterns similar to those of TSS from protein coding genes. We validated our findings by examining the conservation, CpG content, and activating histone marks in the identified promoter regions. We applied our model to assess changes in microRNA transcription in steroid hormone-treated breast cancer cells. The results demonstrate many microRNA genes have lost hormone-dependent regulation in tamoxifen-resistant breast cancer cells. MicroRNA promoter identification based upon RPol II binding patterns provides important temporal and spatial measurements regarding the initiation of transcription, and therefore allows comparison of transcription activities between different conditions, such as normal and disease states.Item Signal Transducers and Activators of Transcription-1 (STAT1) Regulates microRNA Transcription in Interferon γ-Stimulated HeLa Cells(Public Library of Science, 2010-07-26) Wang, Guohua; Wang, Yadong; Teng, Mingxiang; Zhang, Denan; Li, Lang; Liu, Yunlong; Medical and Molecular Genetics, School of MedicineBackground Constructing and modeling the gene regulatory network is one of the central themes of systems biology. With the growing understanding of the mechanism of microRNA biogenesis and its biological function, establishing a microRNA-mediated gene regulatory network is not only desirable but also achievable. Methodology In this study, we propose a bioinformatics strategy to construct the microRNA-mediated regulatory network using genome-wide binding patterns of transcription factor(s) and RNA polymerase II (RPol II), derived using chromatin immunoprecipitation following next generation sequencing (ChIP-seq) technology. Our strategy includes three key steps, identification of transcription start sites and promoter regions of primary microRNA transcripts using RPol II binding patterns, selection of cooperating transcription factors that collaboratively function with the transcription factors targeted by ChIP-seq assay, and construction of the network that contains regulatory cascades of both transcription factors and microRNAs. Principal Findings Using CAMDA (Critical Assessment of Massive Data Analysis) 2009 data set that includes ChIP-seq data on RPol II and STAT1 (signal transducers and activators of transcription 1) in HeLa S3 cells in control condition and with interferon γ stimulation, we first identified promoter regions of 83 microRNAs in HeLa cells. We then identified two potential STAT1 collaborating factors, AP-1 and C/EBP (CCAAT enhancer-binding proteins), and further established eight feedback network elements that may regulate cellular response during interferon γ stimulation. Conclusions This study offers a bioinformatics strategy to provide testable hypotheses on the mechanisms of microRNA-mediated transcriptional regulation, based upon genome-wide protein-DNA interaction data derived from ChIP-seq experiments.Item Understanding Transcription Factor Regulation by Integrating Gene Expression and DNase I Hypersensitive Sites(Hindawi, 2015-09-03) Wang, Guohua; Wang, Fang; Huang, Qian; Li, Yu; Liu, Yunlong; Wang, Yadong; Medical and Molecular Genetics, School of MedicineTranscription factors are proteins that bind to DNA sequences to regulate gene transcription. The transcription factor binding sites are short DNA sequences (5-20 bp long) specifically bound by one or more transcription factors. The identification of transcription factor binding sites and prediction of their function continue to be challenging problems in computational biology. In this study, by integrating the DNase I hypersensitive sites with known position weight matrices in the TRANSFAC database, the transcription factor binding sites in gene regulatory region are identified. Based on the global gene expression patterns in cervical cancer HeLaS3 cell and HelaS3-ifnα4h cell (interferon treatment on HeLaS3 cell for 4 hours), we present a model-based computational approach to predict a set of transcription factors that potentially cause such differential gene expression. Significantly, 6 out 10 predicted functional factors, including IRF, IRF-2, IRF-9, IRF-1 and IRF-3, ICSBP, belong to interferon regulatory factor family and upregulate the gene expression levels responding to the interferon treatment. Another factor, ISGF-3, is also a transcriptional activator induced by interferon alpha. Using the different transcription factor binding sites selected criteria, the prediction result of our model is consistent. Our model demonstrated the potential to computationally identify the functional transcription factors in gene regulation.