- Browse by Subject
Browsing by Subject "G-quadruplex"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
Item A Comprehensive Survey and Deep Learning-Based Prediction on G-quadruplex Formation and Biological Functions(2022-09) Fang, Shuyi; Wan, Jun; Liu, Yunlong; Yan, Jingwen; Zhang, JieThe G-quadruplexes (G4s) are guanine-rich four-stranded DNA/RNA structures, which have been found throughout the human genome. G4s have been reported to affect chromatin structure and are involved in important biological processes at transcriptional and epigenetic levels. However, the underlying molecular mechanisms and locating of G4 still remain elusive due to the complexity of G4s. Taking advantage of the development of high-throughput sequencing technologies and machine learning approaches, we constructed this comprehensive investigation on G4 structures, including discovery of a novel marker for functional human hematopoietic stem cells and gained interest in G4 structure, exploring association between G4 and genomic factors by incorporating multi-omics data, and development of a deep-learningbased G4 prediction tool with G4 motif. First, we discovered ADGRG1 as a novel marker for functional human hematopoietic stem cells and its regulation through transcription activities. Our interest in G4s was stimulated while the transcription-related investigations. Next, we analyzed the genome-wide distribution properties of G4s and uncovered the associations of G4 with other epigenetic and transcriptional mechanisms to coordinate gene transcription. We explored that different-confidence G4 groups correlated differently with epigenetic regulatory elements and revealed that G4 structures could correlate with gene expression in two opposite ways depending on their locations and forming strands. Some transcription factors were identified to be over-represented with G4 emergence. We found distinct consensus sequences enriched in the G4 feet, with a high GC content in the feet of high-confidence G4s and a high TA content in solely predicted G4 feet. As for the last part, we developed a novel deep-learning-based prediction tool for DNA G4s with G4 motifs. Considering the classical G4 motif, we applied bi-directional LSTM model with attention method, which captures sequential information, and showed good performance in whole-genome level prediction of DNA G4s with the certified G4 pattern. Our comprehensive work investigated G4 with its functions and predictions and provided a better understanding of G4s on multi-omics level and computational information capture riding the wave of deep learning.Item Decoding regulatory associations of G-quadruplex with epigenetic and transcriptomic functional components(Frontiers Media, 2022-08-25) Fang, Shuyi; Liu, Sheng; Yang, Danzhou; Yang, Lei; Hu, Chang-Deng; Wan, Jun; BioHealth Informatics, School of Informatics and ComputingG-quadruplex (G4) has been previously observed to be associated with gene expression. In this study, we performed integrative analysis on G4 multi-omics data from in-silicon prediction and ChIP-seq in human genome. Potential G4 sites were classified into three distinguished groups, such as one group of high-confidence G4-forming locations (G4-II) and groups only containing either ChIP-seq detected G4s (G4-I) or predicted G4 motif candidates (G4-III). We explored the associations of different-confidence G4 groups with other epigenetic regulatory elements, including CpG islands, chromatin status, enhancers, super-enhancers, G4 locations compared to the genes, and DNA methylation. Our elastic net regression model revealed that G4 structures could correlate with gene expression in two opposite ways depending on their locations to the genes as well as G4-forming DNA strand. Some transcription factors were identified to be over-represented with G4 emergence. The motif analysis discovered distinct consensus sequences enriched in the G4 feet, the flanking regions of two groups of G4s. We found high GC content in the feet of high-confidence G4s (G4-II) when compared to high TA content in solely predicted G4 feet of G4-III. Overall, we uncovered the comprehensive associations of G4 formations or predictions with other epigenetic and transcriptional elements which potentially coordinate gene transcription.