Development of Data-driven and AI-powered Systems Biology Methods for Understanding Human Disease
Date
Authors
Language
Embargo Lift Date
Department
Committee Chair
Committee Members
Degree
Degree Year
Department
Grantor
Journal Title
Journal ISSN
Volume Title
Found At
Abstract
Systems biology dynamic models, which are based on differential equations, offer a flexible and accurate framework to explain physiological properties emerging from complex biochem- ical or biological systems. These models enable explicit quantification and interpretation, allowing for simulation and perturbation analysis to study biological features and their inter- actions, as well as understanding system progression and convergence under various initial conditions. However, their application in human disease systems is limited due to unknown kinetics parameters under disease conditions and a reductionist paradigm that fails to cap- ture the complexity of diseases. Meanwhile, the advent of omics technologies provides high- resolution molecular measurements from single cells and spatially resolved samples, as well as comprehensive disease-specific molecular signatures from large patient cohorts. This wealth of data holds the promise for characterizing complex biological systems, necessitating ad- vanced systems biology models and computational tools that can harness multi-omics data to reliably depict biological processes. However, this endeavor faces the challenge of nonlinear relationships between omics data and the system’s dynamic properties, such as the global or local low-rank gene expression patterns across cell types and the nonlinear complexities within transcriptional regulatory networks revealed by single-cell RNA sequencing. The overall goal of this report is to develop new computational frameworks, AI-empowered methods, and related mathematical theories to explicitly represent and approximate the dy- namics of complex biological systems by using biological omics data. Our aim is to unravel the intricacies of context-specific dynamic systems using multi-Omics data. Specifically, we solved two different but related computational tasks and enabled the first-of-its-kind methods to (1) identify local low-rank matrices from large omics data, and (2) a robust optimization strategy to approximate metabolic flux. Subsequently, we delve into the realm of data-driven and AI-powered systems biology, harnessing the power of statistical learning and artificial intelligence to approximate differential equations or their representations. This research en- deavor not only contributes to the advancement of subspace modeling but also offers insights into a wide array of complex phenomena across diverse domains, with profound implications for computational biology and beyond.