- Browse by Author
Browsing by Author "Ma, Anjun"
Now showing 1 - 9 of 9
Results Per Page
Sort Options
Item Author Correction: scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses(Springer Nature, 2022-05-04) Wang, Juexin; Ma, Anjun; Chang, Yuzhou; Gong, Jianting; Jiang, Yuexu; Qi, Ren; Wang, Cankun; Fu, Hongjun; Ma, Qin; Xu, Dong; Biomedical Engineering and Informatics, Luddy School of Informatics, Computing, and EngineeringCorrection to: Nature Communications 10.1038/s41467-021-22197-x, published online 25 March 2021. In Figure 2, panels (a) and (b) were inadvertently swapped. The correct version of this figure appears below. This has been corrected in the HTML and PDF version of this article.Item Deep learning analysis of single‐cell data in empowering clinical implementation(Wiley, 2022) Ma, Anjun; Wang, Juexin; Xu, Dong; Ma, Qin; Medical and Molecular Genetics, School of MedicineItem Define and visualize pathological architectures of human tissues from spatially resolved transcriptomics using deep learning(Elsevier, 2022-08-24) Chang, Yuzhou; He, Fei; Wang, Juexin; Chen, Shuo; Li, Jingyi; Liu, Jixin; Yu, Yang; Su, Li; Ma, Anjun; Allen, Carter; Lin, Yu; Sun, Shaoli; Liu, Bingqiang; Otero, José Javier; Chung, Dongjun; Fu, Hongjun; Li, Zihai; Xu, Dong; Ma, Qin; Medical and Molecular Genetics, School of MedicineSpatially resolved transcriptomics provides a new way to define spatial contexts and understand the pathogenesis of complex human diseases. Although some computational frameworks can characterize spatial context via various clustering methods, the detailed spatial architectures and functional zonation often cannot be revealed and localized due to the limited capacities of associating spatial information. We present RESEPT, a deep-learning framework for characterizing and visualizing tissue architecture from spatially resolved transcriptomics. Given inputs such as gene expression or RNA velocity, RESEPT learns a three-dimensional embedding with a spatial retained graph neural network from spatial transcriptomics. The embedding is then visualized by mapping into color channels in an RGB image and segmented with a supervised convolutional neural network model. Based on a benchmark of 10x Genomics Visium spatial transcriptomics datasets on the human and mouse cortex, RESEPT infers and visualizes the tissue architecture accurately. It is noteworthy that, for the in-house AD samples, RESEPT can localize cortex layers and cell types based on pre-defined region- or cell-type-enriched genes and furthermore provide critical insights into the identification of amyloid-beta plaques in Alzheimer's disease. Interestingly, in a glioblastoma sample analysis, RESEPT distinguishes tumor-enriched, non-tumor, and regions of neuropil with infiltrating tumor cells in support of clinical and prognostic cancer applications.Item IRIS3: integrated cell-type-specific regulon inference server from single-cell RNA-Seq(Oxford, 2020-05-18) Ma, Anjun; Wang, Cankun; Chang, Yuzhou; Brennan, Faith H; McDermaid, Adam; Liu, Bingqiang; Zhang, Chi; Popovich, Phillip G; Ma, Qin; Medical and Molecular Genetics, School of Medicinegroup of genes controlled as a unit, usually by the same repressor or activator gene, is known as a regulon. The ability to identify active regulons within a specific cell type, i.e., cell-type-specific regulons (CTSR), provides an extraordinary opportunity to pinpoint crucial regulators and target genes responsible for complex diseases. However, the identification of CTSRs from single-cell RNA-Seq (scRNA-Seq) data is computationally challenging. We introduce IRIS3, the first-of-its-kind web server for CTSR inference from scRNA-Seq data for human and mouse. IRIS3 is an easy-to-use server empowered by over 20 functionalities to support comprehensive interpretations and graphical visualizations of identified CTSRs. CTSR data can be used to reliably characterize and distinguish the corresponding cell type from others and can be combined with other computational or experimental analyses for biomedical studies. CTSRs can, therefore, aid in the discovery of major regulatory mechanisms and allow reliable constructions of global transcriptional regulation networks encoded in a specific cell type. The broader impact of IRIS3 includes, but is not limited to, investigation of complex diseases hierarchies and heterogeneity, causal gene regulatory network construction, and drug development.Item Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework(Oxford University Press, 2019-09-05) Yang, Jinyu; Ma, Anjun; Hoppe, Adam D.; Wang, Cankun; Li, Yang; Zhang, Chi; Wang, Yan; Liu, Bingqiang; Ma, Qin; Medical and Molecular Genetics, School of MedicineThe identification of transcription factor binding sites and cis-regulatory motifs is a frontier whereupon the rules governing protein-DNA binding are being revealed. Here, we developed a new method (DEep Sequence and Shape mOtif or DESSO) for cis-regulatory motif prediction using deep neural networks and the binomial distribution model. DESSO outperformed existing tools, including DeepBind, in predicting motifs in 690 human ENCODE ChIP-sequencing datasets. Furthermore, the deep-learning framework of DESSO expanded motif discovery beyond the state-of-the-art by allowing the identification of known and new protein-protein-DNA tethering interactions in human transcription factors (TFs). Specifically, 61 putative tethering interactions were identified among the 100 TFs expressed in the K562 cell line. In this work, the power of DESSO was further expanded by integrating the detection of DNA shape features. We found that shape information has strong predictive power for TF-DNA binding and provides new putative shape motif information for human TFs. Thus, DESSO improves in the identification and structural analysis of TF binding sites, by integrating the complexities of DNA binding into a deep-learning framework.Item QUBIC2: a novel and robust biclustering algorithm for analyses and interpretation of large-scale RNA-Seq data(Oxford, 2020) Xie, Juan; Ma, Anjun; Zhang, Yu; Liu, Bingqiang; Cao, Sha; Wang, Cankun; Xu, Jennifer; Zhang, Chi; Ma, Qin; Medical and Molecular Genetics, School of MedicineMotivation The biclustering of large-scale gene expression data holds promising potential for detecting condition-specific functional gene modules (i.e. biclusters). However, existing methods do not adequately address a comprehensive detection of all significant bicluster structures and have limited power when applied to expression data generated by RNA-Sequencing (RNA-Seq), especially single-cell RNA-Seq (scRNA-Seq) data, where massive zero and low expression values are observed. Results We present a new biclustering algorithm, QUalitative BIClustering algorithm Version 2 (QUBIC2), which is empowered by: (i) a novel left-truncated mixture of Gaussian model for an accurate assessment of multimodality in zero-enriched expression data, (ii) a fast and efficient dropouts-saving expansion strategy for functional gene modules optimization using information divergency and (iii) a rigorous statistical test for the significance of all the identified biclusters in any organism, including those without substantial functional annotations. QUBIC2 demonstrated considerably improved performance in detecting biclusters compared to other five widely used algorithms on various benchmark datasets from E.coli, Human and simulated data. QUBIC2 also showcased robust and superior performance on gene expression data generated by microarray, bulk RNA-Seq and scRNA-Seq.Item scGNN 2.0: a graph neural network tool for imputation and clustering of single-cell RNA-Seq data(Oxford University Press, 2022) Gu, Haocheng; Cheng, Hao; Ma, Anjun; Li, Yang; Wang, Juexin; Xu, Dong; Ma, Qin; Biostatistics and Health Data Science, Richard M. Fairbanks School of Public HealthMotivation: Gene expression imputation has been an essential step of the single-cell RNA-Seq data analysis workflow. Among several deep-learning methods, the debut of scGNN gained substantial recognition in 2021 for its superior performance and the ability to produce a cell-cell graph. However, the implementation of scGNN was relatively time-consuming and its performance could still be optimized. Results: The implementation of scGNN 2.0 is significantly faster than scGNN thanks to a simplified close-loop architecture. For all eight datasets, cell clustering performance was increased by 85.02% on average in terms of adjusted rand index, and the imputation Median L1 Error was reduced by 67.94% on average. With the built-in visualizations, users can quickly assess the imputation and cell clustering results, compare against benchmarks and interpret the cell-cell interaction. The expanded input and output formats also pave the way for custom workflows that integrate scGNN 2.0 with other scRNA-Seq toolkits on both Python and R platforms. Availability and implementation: scGNN 2.0 is implemented in Python (as of version 3.8) with the source code available at https://github.com/OSU-BMBL/scGNN2.0.Item scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses(Springer Nature, 2021-03-25) Wang, Juexin; Ma, Anjun; Chang, Yuzhou; Gong, Jianting; Jiang, Yuexu; Qi, Ren; Wang, Cankun; Fu, Hongjun; Ma, Qin; Xu, Dong; Biomedical Engineering and Informatics, Luddy School of Informatics, Computing, and EngineeringSingle-cell RNA-sequencing (scRNA-Seq) is widely used to reveal the heterogeneity and dynamics of tissues, organisms, and complex diseases, but its analyses still suffer from multiple grand challenges, including the sequencing sparsity and complex differential patterns in gene expression. We introduce the scGNN (single-cell graph neural network) to provide a hypothesis-free deep learning framework for scRNA-Seq analyses. This framework formulates and aggregates cell-cell relationships with graph neural networks and models heterogeneous gene expression patterns using a left-truncated mixture Gaussian model. scGNN integrates three iterative multi-modal autoencoders and outperforms existing tools for gene imputation and cell clustering on four benchmark scRNA-Seq datasets. In an Alzheimer's disease study with 13,214 single nuclei from postmortem brain tissues, scGNN successfully illustrated disease-related neural development and the differential mechanism. scGNN provides an effective representation of gene expression and cell-cell relationships. It is also a powerful framework that can be applied to general scRNA-Seq analyses.Item TrimNN: Characterizing cellular community motifs for studying multicellular topological organization in complex tissues(Research Square, 2025-01-17) Yu, Yang; Wang, Shuang; Li, Jinpu; Yu, Meichen; McCrocklin, Kyle; Kang, Jing-Qiong; Ma, Anjun; Ma, Qin; Xu, Dong; Wang, Juexin; Radiology and Imaging Sciences, School of MedicineThe spatial arrangement of cells plays a pivotal role in shaping tissue functions in various biological systems and diseased microenvironments. However, it is still under-investigated of the topological coordinating rules among different cell types as tissue spatial patterns. Here, we introduce the Triangulation cellular community motif Neural Network (TrimNN), a bottom-up approach to estimate the prevalence of sizeable conservative cell organization patterns as Cellular Community (CC) motifs in spatial transcriptomics and proteomics. Different from clustering cell type composition from classical top-down analysis, TrimNN differentiates cellular niches as countable topological blocks in recurring interconnections of various types, representing multicellular neighborhoods with interpretability and generalizability. This graph-based deep learning framework adopts inductive bias in CCs and uses a semi-divide and conquer approach in the triangulated space. In spatial omics studies, various sizes of CC motifs identified by TrimNN robustly reveal relations between spatially distributed cell-type patterns and diverse phenotypical biological functions.