- Browse by Author
Browsing by Author "Wang, Juexin"
Now showing 1 - 10 of 27
Results Per Page
Sort Options
Item A novel Synthetic phenotype association study approach reveals the landscape of association for genomic variants and phenotypes(Elsevier, 2022) Škrabišová, Mária; Dietz, Nicholas; Zeng, Shuai; Chan, Yen On; Wang, Juexin; Liu, Yang; Biová, Jana; Joshi, Trupti; Bilyeu, Kristin D.; Medical and Molecular Genetics, School of MedicineIntroduction: Genome-Wide Association Studies (GWAS) identify tagging variants in the genome that are statistically associated with the phenotype because of their linkage disequilibrium (LD) relationship with the causative mutation (CM). When both low-density genotyped accession panels with phenotypes and resequenced data accession panels are available, tagging variants can assist with post-GWAS challenges in CM discovery. Objectives: Our objective was to identify additional GWAS evaluation criteria to assess correspondence between genomic variants and phenotypes, as well as enable deeper analysis of the localized landscape of association. Methods: We used genomic variant positions as Synthetic phenotypes in GWAS that we named "Synthetic phenotype association study" (SPAS). The extreme case of SPAS is what we call an "Inverse GWAS" where we used CM positions of cloned soybean genes. We developed and validated the Accuracy concept as a measure of the correspondence between variant positions and phenotypes. Results: The SPAS approach demonstrated that the genotype status of an associated variant used as a Synthetic phenotype enabled us to explore the relationships between tagging variants and CMs, and further, that utilizing CMs as Synthetic phenotypes in Inverse GWAS illuminated the landscape of association. We implemented the Accuracy calculation for a curated accession panel to an online Accuracy calculation tool (AccuTool) as a resource for gene identification in soybean. We demonstrated our concepts on three examples of soybean cloned genes. As a result of our findings, we devised an enhanced "GWAS to Genes" analysis (Synthetic phenotype to CM strategy, SP2CM). Using SP2CM, we identified a CM for a novel gene. Conclusion: The SP2CM strategy utilizing Synthetic phenotypes and the Accuracy calculation of correspondence provides crucial information to assist researchers in CM discovery. The impact of this work is a more effective evaluation of landscapes of GWAS associations.Item Astrocytic GABA transporter 1 deficit in novel SLC6A1 variants mediated epilepsy: Connected from protein destabilization to seizures in mice and humans(Elsevier, 2022) Mermer, Felicia; Poliquin, Sarah; Zhou, Shuizhen; Wang, Xiaodong; Ding, Yifeng; Yin, Fei; Shen, Wangzhen; Wang, Juexin; Rigsby, Kathryn; Xu, Dong; Mack, Taralynn; Nwosu, Gerald; Flamm, Carson; Stein, Matthew; Kang, Jing-Qiong; Biomedical Engineering and Informatics, Luddy School of Informatics, Computing, and EngineeringObjective: Mutations in γ-aminobutyric acid (GABA) transporter 1 (GAT-1)-encoding SLC6A1 have been associated with myoclonic atonic epilepsy and other phenotypes. We determined the patho-mechanisms of the mutant GAT-1, in order to identify treatment targets. Methods: We conducted whole-exome sequencing of patients with myoclonic atonic epilepsy (MAE) and characterized the seizure phenotypes and EEG patterns. We studied the protein stability and structural changes with homology modeling and machine learning tools. We characterized the function and trafficking of the mutant GAT-1 with 3H radioactive GABA uptake assay and confocal microscopy. We utilized different models including a knockin mouse and human astrocytes derived from induced pluripotent stem cells (iPSCs). We focused on astrocytes because of their direct impact of astrocytic GAT-1 in seizures. Results: We identified four novel SLC6A1 variants associated with MAE and 2 to 4 Hz spike-wave discharges as a common EEG feature. Machine learning tools predicted that the variant proteins are destabilized. The variant protein had reduced expression and reduced GABA uptake due to endoplasmic reticular retention. The consistent observation was made in cortical and thalamic astrocytes from variant-knockin mice and human iPSC-derived astrocytes. The Slc6a+/A288V mouse, representative of MAE, had increased 5-7 Hz spike-wave discharges and absence seizures. Interpretation: SLC6A1 variants in various locations of the protein peptides can cause MAE with similar seizure phenotypes and EEG features. Reduced GABA uptake is due to decreased functional GAT-1, which, in thalamic astrocytes, could result in increased extracellular GABA accumulation and enhanced tonic inhibition, leading to seizures and abnormal EEGs.Item Author Correction: scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses(Springer Nature, 2022-05-04) Wang, Juexin; Ma, Anjun; Chang, Yuzhou; Gong, Jianting; Jiang, Yuexu; Qi, Ren; Wang, Cankun; Fu, Hongjun; Ma, Qin; Xu, Dong; Biomedical Engineering and Informatics, Luddy School of Informatics, Computing, and EngineeringCorrection to: Nature Communications 10.1038/s41467-021-22197-x, published online 25 March 2021. In Figure 2, panels (a) and (b) were inadvertently swapped. The correct version of this figure appears below. This has been corrected in the HTML and PDF version of this article.Item Bile reflux alters the profile of the gastric mucosa microbiota(Frontiers Media, 2022-09-09) Huang, Gang; Wang, Sui; Wang, Juexin; Tian, Lin; Yu, Yanbo; Zuo, Xiuli; Li, Yanqing; Medical and Molecular Genetics, School of MedicineBackground: Bile reflux can cause inflammation, gastric mucosa atrophy, and diseases such as stomach cancer. Alkaline bile flowing back into the stomach affects the intragastric environment and can alter the gastric bacterial community. We sought to identify the characteristics of the stomach mucosal microbiota in patients with bile reflux. Methods: Gastric mucosal samples were collected from 52 and 40 chronic gastritis patients with and without bile reflux, respectively. The bacterial profile was determined using 16S rRNA gene analysis. Results: In the absence of H. pylori infection, the richness (based on the Sobs and Chao1 indices; P <0.05) and diversity (based on Shannon indices; P <0.05) of gastric mucosa microbiota were higher in patients with bile reflux patients than in those without. There was a marked difference in the microbiota structure between patients with and without bile reflux (ANOSIM, R=0.058, P=0.011). While the genera, Comamonas, Halomonas, Bradymonas, Pseudomonas, Marinobacter, Arthrobacter, and Shewanella were enriched in patients with bile reflux, the genera, Haemophilus, Porphyromonas, and Subdoligranulum, were enriched in those without bile reflux. Conclusion: Our results demonstrate that bile reflux significantly alters the composition of the gastric microbiota.Item Celltyper: A Single-Cell Sequencing Marker Gene Tool Suite(2023-05) Paisley, Brianna Meadow; Liu, Yunlong; Yan, Jingwen; Cao, Sha; Wang, Juexin; Carfagna, MarkSingle-cell RNA-sequencing (scRNA-seq) has enabled researchers to study interindividual cellular heterogeneity, to explore disease impact on cellular composition of tissue, and to identify novel cell subtypes. However, a major challenge in scRNA-seq analysis is to identify the cell type of individual cells. Accurate cell type identification is crucial for any scRNA-seq analysis to be valid as incorrect cell type assignment will reduce statistical robustness and may lead to incorrect biological conclusions. Therefore, accurate and comprehensive cell type assignment is necessary for reliable biological insights into scRNA-seq datasets. With over 200 distinct cell types in humans alone, the concept of cell identity is large. Even within the same cell type there exists heterogeneity due to cell cycle phase, cell state, cell subtypes, cell health and the tissue microenvironment. This makes cell type classification a complicated biological problem requiring bioinformatics. One approach to classify cell type identity is using marker genes. Marker genes are genes specific for one or a few cell types. When coupled with bioinformatic methods, marker genes show promise of improving cell type classification. However, current scRNA-seq classification methods and databases use marker genes that are non-specific across sources, samples, and/or species leading to bias and errors. Furthermore, many existing tools require manual intervention by the user to provide training datasets or the expected number and name of cell types, which can introduce selection bias. The selection bias negatively impacts the accuracy of cell type classification methods as the model cannot extrapolate outside of the user inputs even when it is biologically meaningful to do so. In this dissertation I developed CellTypeR, a suite of tools to explore the biology governing cell identity in a “normal” state for humans and mice. The work presented here accomplishes three aims: 1. Develop an ontology standardized database of published marker gene literature; 2. Develop and apply a marker gene classification algorithm; and 3. Create user interface and input data structure for scRNA-seq cell type prediction.Item Computational Methods for Proteoform Identification and Characterization Using Top-Down Mass Spectrometry(2023-12) Chen, Wenrong; Yan, Jingwen; Wang, Juexin; Wan, Jun; Zang, Yong; Luo, Xiao; Liu, XiaowenProteoforms, distinct molecular forms of proteins, arise due to numerous factors such as genetic mutations, differential gene expression, alternative splicing, and a range of biological processes. These proteoforms are often characterized by primary structural variances such as amino acid substitutions, terminal truncations, and post-translational modifications (PTMs). Proteoforms from the same proteins can manifest varied functional behaviors based on the specific alterations. The complexity inherent to proteoforms has elevated the significance of top-down mass spectrometry (MS) due to its proficiency in providing intricate sequence information for these intact proteoforms. During a typical top-down MS experiment, intact proteoforms are separated through platforms like liquid chromatography (LC) or capillary zone electrophoresis (CZE) prior to tandem mass spectrometry (MS/MS) analysis. Despite advancements in instruments and protocols for top-down MS, computational challenges persist, with software tool development still in its early stage. In this dissertation, our research revolves around three primary goals, all aimed at refining proteoform characterization. First, we bridge RNA-Seq with top-down MS for a better proteoform identification. We propose TopPG, an innovative proteogenomic tool which is tailored to generate proteoform sequence databases from genetic and splicing variations explicitly for top-down MS in contrast to traditional approaches. Second, to boost the accuracy of proteoform detection, we utilize machine learning methods to predict proteoform retention and migration times in top-down MS, an area previously overshadowed by bottom-up MS paradigms. critically evaluating models in a realm traditionally dominated by bottom-up MS methodologies. Lastly, recognizing the indispensable role of post-translational modifications (PTMs) on cellular functions, we introduce PTM-TBA. This tool integrates the complementary strengths of both top-down and bottom-up MS, augmented with annotations, building a comprehensive strategy for precise PTM identification and localization.Item Coxiella burnetii Virulent Phase I and Avirulent Phase II Variants Differentially Manipulate Autophagy Pathway in Neutrophils(American Society for Microbiology, 2022) Kumaresan, Venkatesh; Wang, Juexin; Zhang, Wendy; Zhang, Yan; Xu, Dong; Zhang, Guoquan; Medical and Molecular Genetics, School of MedicineCoxiella burnetii is an obligate intracellular Gram-negative bacterium that causes Q fever in humans. The virulent C. burnetii Nine Mile phase I (NMI) strain causes disease in animal models, while the avirulent NM phase II (NMII) strain does not. In this study, we found that NMI infection induces severe splenomegaly and bacterial burden in the spleen in BALB/c mice, while NMII infection does not. A significantly higher number of CD11b+ Ly6G+ neutrophils accumulated in the liver, lung, and spleen of NMI-infected mice than in NMII-infected mice. Thus, neutrophil accumulation correlates with NMI and NMII infection-induced inflammatory responses. In vitro studies also demonstrated that although NMII exhibited a higher infection rate than NMI in mouse bone marrow neutrophils (BMNs), NMI-infected BMNs survived longer than NMII-infected BMNs. These results suggest that the differential interactions of NMI and NMII with neutrophils may be related to their ability to cause disease in animals. To understand the molecular mechanism underlying the differential interactions of NMI and NMII with neutrophils, global transcriptomic gene expressions were compared between NMI- and NMII-infected BMNs by RNA sequencing (RNA-seq) analysis. Interestingly, several genes involved in autophagy-related pathways, particularly membrane trafficking and lipid metabolism, are upregulated in NMII-infected BMNs but downregulated in NMI-infected BMNs. Immunofluorescence and immunoblot analyses indicate that compared to NMI-infected BMNs, vacuoles in NMII-infected-BMNs exhibit increased autophagic flux along with phosphatidylserine translocation in the cell membrane. Similar to neutrophils, NMII activated LC3-mediated autophagy in human macrophages. These findings suggest that the differential manipulation of autophagy of NMI and NMII may relate to their pathogenesis.Item CrossMP: Enabling Cross-Modality Translation between Single-Cell RNA-Seq and Single-Cell ATAC-Seq through Web-Based Portal(MDPI, 2024-07-05) Lyu, Zhen; Dahal, Sabin; Zeng, Shuai; Wang, Juexin; Xu, Dong; Joshi, Trupti; Biomedical Engineering and Informatics, Luddy School of Informatics, Computing, and EngineeringIn recent years, there has been a growing interest in profiling multiomic modalities within individual cells simultaneously. One such example is integrating combined single-cell RNA sequencing (scRNA-seq) data and single-cell transposase-accessible chromatin sequencing (scATAC-seq) data. Integrated analysis of diverse modalities has helped researchers make more accurate predictions and gain a more comprehensive understanding than with single-modality analysis. However, generating such multimodal data is technically challenging and expensive, leading to limited availability of single-cell co-assay data. Here, we propose a model for cross-modal prediction between the transcriptome and chromatin profiles in single cells. Our model is based on a deep neural network architecture that learns the latent representations from the source modality and then predicts the target modality. It demonstrates reliable performance in accurately translating between these modalities across multiple paired human scATAC-seq and scRNA-seq datasets. Additionally, we developed CrossMP, a web-based portal allowing researchers to upload their single-cell modality data through an interactive web interface and predict the other type of modality data, using high-performance computing resources plugged at the backend.Item Deep learning analysis of single‐cell data in empowering clinical implementation(Wiley, 2022) Ma, Anjun; Wang, Juexin; Xu, Dong; Ma, Qin; Medical and Molecular Genetics, School of MedicineItem Define and visualize pathological architectures of human tissues from spatially resolved transcriptomics using deep learning(Elsevier, 2022-08-24) Chang, Yuzhou; He, Fei; Wang, Juexin; Chen, Shuo; Li, Jingyi; Liu, Jixin; Yu, Yang; Su, Li; Ma, Anjun; Allen, Carter; Lin, Yu; Sun, Shaoli; Liu, Bingqiang; Otero, José Javier; Chung, Dongjun; Fu, Hongjun; Li, Zihai; Xu, Dong; Ma, Qin; Medical and Molecular Genetics, School of MedicineSpatially resolved transcriptomics provides a new way to define spatial contexts and understand the pathogenesis of complex human diseases. Although some computational frameworks can characterize spatial context via various clustering methods, the detailed spatial architectures and functional zonation often cannot be revealed and localized due to the limited capacities of associating spatial information. We present RESEPT, a deep-learning framework for characterizing and visualizing tissue architecture from spatially resolved transcriptomics. Given inputs such as gene expression or RNA velocity, RESEPT learns a three-dimensional embedding with a spatial retained graph neural network from spatial transcriptomics. The embedding is then visualized by mapping into color channels in an RGB image and segmented with a supervised convolutional neural network model. Based on a benchmark of 10x Genomics Visium spatial transcriptomics datasets on the human and mouse cortex, RESEPT infers and visualizes the tissue architecture accurately. It is noteworthy that, for the in-house AD samples, RESEPT can localize cortex layers and cell types based on pre-defined region- or cell-type-enriched genes and furthermore provide critical insights into the identification of amyloid-beta plaques in Alzheimer's disease. Interestingly, in a glioblastoma sample analysis, RESEPT distinguishes tumor-enriched, non-tumor, and regions of neuropil with infiltrating tumor cells in support of clinical and prognostic cancer applications.
- «
- 1 (current)
- 2
- 3
- »