- Browse by Subject
Browsing by Subject "Computational biology"
Now showing 1 - 10 of 26
Results Per Page
Sort Options
Item A distinct symptom pattern emerges for COVID-19 long-haul: a nationwide study(Springer Nature, 2022-09-23) Pinto, Melissa D.; Downs, Charles A.; Huang, Yong; El‑Azab, Sarah A.; Ramrakhiani, Nathan S.; Barisano, Anthony; Yu, Lu; Taylor, Kaitlyn; Esperanca, Alvaro; Abrahim, Heather L.; Hughes, Thomas; Giraldo Herrera, Maria; Rahamani, Amir M.; Dutt, Nikil; Chakraborty, Rana; Mendiola, Christian; Lambert, Natalie; Biostatistics, School of Public HealthLong-haul COVID-19, also called post-acute sequelae of SARS-CoV-2 (PASC), is a new illness caused by SARS-CoV-2 infection and characterized by the persistence of symptoms. The purpose of this cross-sectional study was to identify a distinct and significant temporal pattern of PASC symptoms (symptom type and onset) among a nationwide sample of PASC survivors (n = 5652). The sample was randomly sorted into two independent samples for exploratory (EFA) and confirmatory factor analyses (CFA). Five factors emerged from the EFA: (1) cold and flu-like symptoms, (2) change in smell and/or taste, (3) dyspnea and chest pain, (4) cognitive and visual problems, and (5) cardiac symptoms. The CFA had excellent model fit (x2 = 513.721, df = 207, p < 0.01, TLI = 0.952, CFI = 0.964, RMSEA = 0.024). These findings demonstrate a novel symptom pattern for PASC. These findings can enable nurses in the identification of at-risk patients and facilitate early, systematic symptom management strategies for PASC.Item A Fur family protein BosR is a novel RNA-binding protein that controls rpoS RNA stability in the Lyme disease pathogen(Oxford University Press, 2024) Raghunandanan, Sajith; Priya, Raj; Alanazi, Fuad; Lybecker, Meghan C.; Schlax, Paula Jean; Yang, X. Frank; Microbiology and Immunology, School of Medicine2´-O-methylation (Nm) is one of the most abundant modifications found in both mRNAs and noncoding RNAs. It contributes to many biological processes, such as the normal functioning of tRNA, the protection of mRNA against degradation by the decapping and exoribonuclease (DXO) protein, and the biogenesis and specificity of rRNA. Recent advancements in single-molecule sequencing techniques for long read RNA sequencing data offered by Oxford Nanopore technologies have enabled the direct detection of RNA modifications from sequencing data. In this study, we propose a bio-computational framework, Nm-Nano, for predicting the presence of Nm sites in direct RNA sequencing data generated from two human cell lines. The Nm-Nano framework integrates two supervised machine learning (ML) models for predicting Nm sites: Extreme Gradient Boosting (XGBoost) and Random Forest (RF) with K-mer embedding. Evaluation on benchmark datasets from direct RNA sequecing of HeLa and HEK293 cell lines, demonstrates high accuracy (99% with XGBoost and 92% with RF) in identifying Nm sites. Deploying Nm-Nano on HeLa and HEK293 cell lines reveals genes that are frequently modified with Nm. In HeLa cell lines, 125 genes are identified as frequently Nm-modified, showing enrichment in 30 ontologies related to immune response and cellular processes. In HEK293 cell lines, 61 genes are identified as frequently Nm-modified, with enrichment in processes like glycolysis and protein localization. These findings underscore the diverse regulatory roles of Nm modifications in metabolic pathways, protein degradation, and cellular processes. The source code of Nm-Nano can be freely accessed at https://github.com/Janga-Lab/Nm-Nano.Item Adult skin fibroblast state change in murine wound healing(Springer Nature, 2023-01-17) Gharbia, Fatma Z.; Abouhashem, Ahmed S.; Moqidem, Yomna A.; Elbaz, Ahmed A.; Abdellatif, Ahmed; Singh, Kanhaiya; Sen, Chandan K.; Azzazy, Hassan M. E.; Surgery, School of MedicineWound healing is a well-organized dynamic process involving coordinated consecutive phases: homeostasis, inflammation, proliferation and resolution. Fibroblasts play major roles in skin wound healing such as in wound contraction and release of growth factors which are of importance in angiogenesis and tissue remodeling. Abnormal fibroblast phenotypes have been identified in patients with chronic wounds. In this work, we analyzed scRNA-seq datasets of normal and wounded skin from mice at day 4 post-wound to investigate fibroblast heterogeneity during the proliferative phase of wound healing. Compositional analysis revealed a specific subset of fibroblast (cluster 3) that primarily increased in wounded skin (14%) compared to normal skin (3.9%). This subset was characterized by a gene signature marked by the plasma membrane proteins Sfrp2 + Sfrp4 + Sfrp1 + and the transcription factors Ebf1 + Prrx1 + Maged1 + . Differential gene expression and enrichment analysis identified epithelial to mesenchymal transition (EMT) and angiogenesis to be upregulated in the emerging subset of fibroblasts of the wounded skin. Using two other datasets for murine wounded skin confirmed the increase in cluster 3-like fibroblasts at days 2, 7 and 14 post-wounding with a peak at day 7. By performing a similarity check between the differential gene expression profile between wounded and normal skin for this emerging fibroblast subset with drug signature from the ConnectivityMap database, we identified drugs capable of mimicking the observed gene expression change in fibroblasts during wound healing. TTNPB, verteprofin and nicotinic acid were identified as candidate drugs capable of inducing fibroblast gene expression profile necessary for wound healing. On the other hand, methocarbamol, ifosfamide and penbutolol were recognized to antagonize the identified fibroblast differential expression profile during wound healing which might cause delay in wound healing. Taken together, analysis of murine transcriptomic skin wound healing datasets suggested a subset of fibroblasts capable of inducing EMT and further inferred drugs that might be tested as potential candidates to induce wound closure.Item AIscEA: unsupervised integration of single-cell gene expression and chromatin accessibility via their biological consistency(Oxford University Press, 2022) Jafari, Elham; Johnson, Travis; Wang, Yue; Liu, Yunlong; Huang, Kun; Wang, Yijie; Biostatistics and Health Data Science, School of MedicineMotivation: The integrative analysis of single-cell gene expression and chromatin accessibility measurements is essential for revealing gene regulation, but it is one of the key challenges in computational biology. Gene expression and chromatin accessibility are measurements from different modalities, and no common features can be directly used to guide integration. Current state-of-the-art methods lack practical solutions for finding heterogeneous clusters. However, previous methods might not generate reliable results when cluster heterogeneity exists. More importantly, current methods lack an effective way to select hyper-parameters under an unsupervised setting. Therefore, applying computational methods to integrate single-cell gene expression and chromatin accessibility measurements remains difficult. Results: We introduce AIscEA-Alignment-based Integration of single-cell gene Expression and chromatin Accessibility-a computational method that integrates single-cell gene expression and chromatin accessibility measurements using their biological consistency. AIscEA first defines a ranked similarity score to quantify the biological consistency between cell clusters across measurements. AIscEA then uses the ranked similarity score and a novel permutation test to identify cluster alignment across measurements. AIscEA further utilizes graph alignment for the aligned cell clusters to align the cells across measurements. We compared AIscEA with the competing methods on several benchmark datasets and demonstrated that AIscEA is highly robust to the choice of hyper-parameters and can better handle the cluster heterogeneity problem. Furthermore, AIscEA significantly outperforms the state-of-the-art methods when integrating real-world SNARE-seq and scMultiome-seq datasets in terms of integration accuracy. Availability and implementation: AIscEA is available at https://figshare.com/articles/software/AIscEA_zip/21291135 on FigShare as well as {https://github.com/elhaam/AIscEA} onGitHub.Item Best practices to evaluate the impact of biomedical research software-metric collection beyond citations(Oxford University Press, 2024) Afiaz, Awan; Ivanov, Andrey A.; Chamberlin, John; Hanauer, David; Savonen, Candace L.; Goldman, Mary J.; Morgan, Martin; Reich, Michael; Getka, Alexander; Holmes, Aaron; Pati, Sarthak; Knight, Dan; Boutros, Paul C.; Bakas, Spyridon; Caporaso, J. Gregory; Del Fiol, Guilherme; Hochheiser, Harry; Haas, Brian; Schloss, Patrick D.; Eddy, James A.; Albrecht, Jake; Fedorov, Andrey; Waldron, Levi; Hoffman, Ava M.; Bradshaw, Richard L.; Leek, Jeffrey T.; Wright, Carrie; Pathology and Laboratory Medicine, School of MedicineMotivation: Software is vital for the advancement of biology and medicine. Impact evaluations of scientific software have primarily emphasized traditional citation metrics of associated papers, despite these metrics inadequately capturing the dynamic picture of impact and despite challenges with improper citation. Results: To understand how software developers evaluate their tools, we conducted a survey of participants in the Informatics Technology for Cancer Research (ITCR) program funded by the National Cancer Institute (NCI). We found that although developers realize the value of more extensive metric collection, they find a lack of funding and time hindering. We also investigated software among this community for how often infrastructure that supports more nontraditional metrics were implemented and how this impacted rates of papers describing usage of the software. We found that infrastructure such as social media presence, more in-depth documentation, the presence of software health metrics, and clear information on how to contact developers seemed to be associated with increased mention rates. Analysing more diverse metrics can enable developers to better understand user engagement, justify continued funding, identify novel use cases, pinpoint improvement areas, and ultimately amplify their software's impact. Challenges are associated, including distorted or misleading metrics, as well as ethical and security concerns. More attention to nuances involved in capturing impact across the spectrum of biomedical software is needed. For funders and developers, we outline guidance based on experience from our community. By considering how we evaluate software, we can empower developers to create tools that more effectively accelerate biological and medical research progress. Availability and implementation: More information about the analysis, as well as access to data and code is available at https://github.com/fhdsl/ITCR_Metrics_manuscript_website.Item Bioinformatics analysis of the potentially functional circRNA-miRNA-mRNA network in breast cancer(Public Library of Science, 2024-04-18) Erdogan, Cihat; Suer, Ilknur; Kaya, Murat; Ozturk, Sukru; Aydin, Nizamettin; Kurt, Zeyneb; Medical and Molecular Genetics, School of MedicineBreast cancer (BC) is the most common cancer among women with high morbidity and mortality. Therefore, new research is still needed for biomarker detection. GSE101124 and GSE182471 datasets were obtained from the Gene Expression Omnibus (GEO) database to evaluate differentially expressed circular RNAs (circRNAs). The Cancer Genome Atlas (TCGA) and Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) databases were used to identify the significantly dysregulated microRNAs (miRNAs) and genes considering the Prediction Analysis of Microarray classification (PAM50). The circRNA-miRNA-mRNA relationship was investigated using the Cancer-Specific CircRNA, miRDB, miRTarBase, and miRWalk databases. The circRNA-miRNA-mRNA regulatory network was annotated using Gene Ontology (GO) analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway database. The protein-protein interaction network was constructed by the STRING database and visualized by the Cytoscape tool. Then, raw miRNA data and genes were filtered using some selection criteria according to a specific expression level in PAM50 subgroups. A bottleneck method was utilized to obtain highly interacted hub genes using cytoHubba Cytoscape plugin. The Disease-Free Survival and Overall Survival analysis were performed for these hub genes, which are detected within the miRNA and circRNA axis in our study. We identified three circRNAs, three miRNAs, and eighteen candidate target genes that may play an important role in BC. In addition, it has been determined that these molecules can be useful in the classification of BC, especially in determining the basal-like breast cancer (BLBC) subtype. We conclude that hsa_circ_0000515/miR-486-5p/SDC1 axis may be an important biomarker candidate in distinguishing patients in the BLBC subgroup of BC.Item circMeta: a unified computational framework for genomic feature annotation and differential expression analysis of circular RNAs(Oxford University Press, 2020-01-15) Chen, Li; Wang, Feng; Bruggeman, Emily C.; Li, Chao; Yao, Bing; Medicine, School of MedicineMotivation: Circular RNAs (circRNAs), a class of non-coding RNAs generated from non-canonical back-splicing events, have emerged to play key roles in many biological processes. Though numerous tools have been developed to detect circRNAs from rRNA-depleted RNA-seq data based on back-splicing junction-spanning reads, computational tools to identify critical genomic features regulating circRNA biogenesis are still lacking. In addition, rigorous statistical methods to perform differential expression (DE) analysis of circRNAs remain under-developed. Results: We present circMeta, a unified computational framework for circRNA analyses. circMeta has three primary functional modules: (i) a pipeline for comprehensive genomic feature annotation related to circRNA biogenesis, including length of introns flanking circularized exons, repetitive elements such as Alu elements and SINEs, competition score for forming circulation and RNA editing in back-splicing flanking introns; (ii) a two-stage DE approach of circRNAs based on circular junction reads to quantitatively compare circRNA levels and (iii) a Bayesian hierarchical model for DE analysis of circRNAs based on the ratio of circular reads to linear reads in back-splicing sites to study spatial and temporal regulation of circRNA production. Both proposed DE methods without and with considering host genes outperform existing methods by obtaining better control of false discovery rate and comparable statistical power. Moreover, the identified DE circRNAs by the proposed two-stage DE approach display potential biological functions in Gene Ontology and circRNA-miRNA-mRNA networks that are not able to be detected using existing mRNA DE methods. Furthermore, top DE circRNAs have been further validated by RT-qPCR using divergent primers spanning back-splicing junctions.Item Computational Pathology Fusing Spatial Technologies(Wolters Kluwer, 2023) Border, Samuel; Lucarelli, Nicholas; Eadon, Michael T.; El-Achkar, Tarek M.; Jain, Sanjay; Sarder, Pinaki; Medicine, School of MedicineItem Computational protein design: assessment and applications(2015) Li, Zhixiu; Zhou, YaoqiComputational protein design aims at designing amino acid sequences that can fold into a target structure and perform a desired function. Many computational design methods have been developed and their applications have been successful during past two decades. However, the success rate of protein design remains too low to be of a useful tool by biochemists whom are not an expert of computational biology. In this dissertation, we first developed novel computational assessment techniques to assess several state-of-the-art computational techniques. We found that significant progresses were made in several important measures by two new scoring functions from RosettaDesign and from OSCAR-design, respectively. We also developed the first machine-learning technique called SPIN that predicts a sequence profile compatible to a given structure with a novel nonlocal energy-based feature. The accuracy of predicted sequences is comparable to RosettaDesign in term of sequence identity to wild type sequences. In the last two application chapters, we have designed self-inhibitory peptides of Escherichia coli methionine aminopeptidase (EcMetAP) and de novo designed barstar. Several peptides were confirmed inhibition of EcMetAP at the micromole-range 50% inhibitory concentration. Meanwhile, the assessment of designed barstar sequences indicates the improvement of OSCAR-design over RosettaDesign.Item A Computational Study of the Mechanism for F1-ATPase Inhibition by the Epsilon Subunit(2013) Thomson, Karen J.; Pu, Jingzhi; Ge, Haibo; Sardar, Rajesh; Long, Eric C. (Eric Charles)The multi-protein complex of F0F1 ATP synthase has been of great interest in the fields of microbiology and biochemistry, due to the ubiquitous use of ATP as a biological energy source. Efforts to better understand this complex have been made through structural determination of segments based on NMR and crystallographic data. Some experiments have provided useful data, while others have brought up more questions, especially when structures and functions are compared between bacteria and species with chloroplasts or mitochondria. The epsilon subunit is thought to play a signi cant role in the regulation of ATP synthesis and hydrolysis, yet the exact pathway is unknown due to the experimental difficulty in obtaining data along the transition pathway. Given starting and end point protein crystal structures, the transition pathway of the epsilon subunit was examined through computer simulation.The purpose of this investigation is to determine the likelihood of one such proposed mechanism for the involvement of the epsilon subunit in ATP regulation in bacterial species such as E. coli.
- «
- 1 (current)
- 2
- 3
- »