- Browse by Author
Browsing by Author "Wang, Cankun"
Now showing 1 - 5 of 5
Results Per Page
Sort Options
Item Author Correction: scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses(Springer Nature, 2022-05-04) Wang, Juexin; Ma, Anjun; Chang, Yuzhou; Gong, Jianting; Jiang, Yuexu; Qi, Ren; Wang, Cankun; Fu, Hongjun; Ma, Qin; Xu, Dong; Biomedical Engineering and Informatics, Luddy School of Informatics, Computing, and EngineeringCorrection to: Nature Communications 10.1038/s41467-021-22197-x, published online 25 March 2021. In Figure 2, panels (a) and (b) were inadvertently swapped. The correct version of this figure appears below. This has been corrected in the HTML and PDF version of this article.Item IRIS3: integrated cell-type-specific regulon inference server from single-cell RNA-Seq(Oxford, 2020-05-18) Ma, Anjun; Wang, Cankun; Chang, Yuzhou; Brennan, Faith H; McDermaid, Adam; Liu, Bingqiang; Zhang, Chi; Popovich, Phillip G; Ma, Qin; Medical and Molecular Genetics, School of Medicinegroup of genes controlled as a unit, usually by the same repressor or activator gene, is known as a regulon. The ability to identify active regulons within a specific cell type, i.e., cell-type-specific regulons (CTSR), provides an extraordinary opportunity to pinpoint crucial regulators and target genes responsible for complex diseases. However, the identification of CTSRs from single-cell RNA-Seq (scRNA-Seq) data is computationally challenging. We introduce IRIS3, the first-of-its-kind web server for CTSR inference from scRNA-Seq data for human and mouse. IRIS3 is an easy-to-use server empowered by over 20 functionalities to support comprehensive interpretations and graphical visualizations of identified CTSRs. CTSR data can be used to reliably characterize and distinguish the corresponding cell type from others and can be combined with other computational or experimental analyses for biomedical studies. CTSRs can, therefore, aid in the discovery of major regulatory mechanisms and allow reliable constructions of global transcriptional regulation networks encoded in a specific cell type. The broader impact of IRIS3 includes, but is not limited to, investigation of complex diseases hierarchies and heterogeneity, causal gene regulatory network construction, and drug development.Item Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework(Oxford University Press, 2019-09-05) Yang, Jinyu; Ma, Anjun; Hoppe, Adam D.; Wang, Cankun; Li, Yang; Zhang, Chi; Wang, Yan; Liu, Bingqiang; Ma, Qin; Medical and Molecular Genetics, School of MedicineThe identification of transcription factor binding sites and cis-regulatory motifs is a frontier whereupon the rules governing protein-DNA binding are being revealed. Here, we developed a new method (DEep Sequence and Shape mOtif or DESSO) for cis-regulatory motif prediction using deep neural networks and the binomial distribution model. DESSO outperformed existing tools, including DeepBind, in predicting motifs in 690 human ENCODE ChIP-sequencing datasets. Furthermore, the deep-learning framework of DESSO expanded motif discovery beyond the state-of-the-art by allowing the identification of known and new protein-protein-DNA tethering interactions in human transcription factors (TFs). Specifically, 61 putative tethering interactions were identified among the 100 TFs expressed in the K562 cell line. In this work, the power of DESSO was further expanded by integrating the detection of DNA shape features. We found that shape information has strong predictive power for TF-DNA binding and provides new putative shape motif information for human TFs. Thus, DESSO improves in the identification and structural analysis of TF binding sites, by integrating the complexities of DNA binding into a deep-learning framework.Item QUBIC2: a novel and robust biclustering algorithm for analyses and interpretation of large-scale RNA-Seq data(Oxford, 2020) Xie, Juan; Ma, Anjun; Zhang, Yu; Liu, Bingqiang; Cao, Sha; Wang, Cankun; Xu, Jennifer; Zhang, Chi; Ma, Qin; Medical and Molecular Genetics, School of MedicineMotivation The biclustering of large-scale gene expression data holds promising potential for detecting condition-specific functional gene modules (i.e. biclusters). However, existing methods do not adequately address a comprehensive detection of all significant bicluster structures and have limited power when applied to expression data generated by RNA-Sequencing (RNA-Seq), especially single-cell RNA-Seq (scRNA-Seq) data, where massive zero and low expression values are observed. Results We present a new biclustering algorithm, QUalitative BIClustering algorithm Version 2 (QUBIC2), which is empowered by: (i) a novel left-truncated mixture of Gaussian model for an accurate assessment of multimodality in zero-enriched expression data, (ii) a fast and efficient dropouts-saving expansion strategy for functional gene modules optimization using information divergency and (iii) a rigorous statistical test for the significance of all the identified biclusters in any organism, including those without substantial functional annotations. QUBIC2 demonstrated considerably improved performance in detecting biclusters compared to other five widely used algorithms on various benchmark datasets from E.coli, Human and simulated data. QUBIC2 also showcased robust and superior performance on gene expression data generated by microarray, bulk RNA-Seq and scRNA-Seq.Item scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses(Springer Nature, 2021-03-25) Wang, Juexin; Ma, Anjun; Chang, Yuzhou; Gong, Jianting; Jiang, Yuexu; Qi, Ren; Wang, Cankun; Fu, Hongjun; Ma, Qin; Xu, Dong; Biomedical Engineering and Informatics, Luddy School of Informatics, Computing, and EngineeringSingle-cell RNA-sequencing (scRNA-Seq) is widely used to reveal the heterogeneity and dynamics of tissues, organisms, and complex diseases, but its analyses still suffer from multiple grand challenges, including the sequencing sparsity and complex differential patterns in gene expression. We introduce the scGNN (single-cell graph neural network) to provide a hypothesis-free deep learning framework for scRNA-Seq analyses. This framework formulates and aggregates cell-cell relationships with graph neural networks and models heterogeneous gene expression patterns using a left-truncated mixture Gaussian model. scGNN integrates three iterative multi-modal autoencoders and outperforms existing tools for gene imputation and cell clustering on four benchmark scRNA-Seq datasets. In an Alzheimer's disease study with 13,214 single nuclei from postmortem brain tissues, scGNN successfully illustrated disease-related neural development and the differential mechanism. scGNN provides an effective representation of gene expression and cell-cell relationships. It is also a powerful framework that can be applied to general scRNA-Seq analyses.