Computational Methods for Proteoform Identification and Characterization Using Top-Down Mass Spectrometry

dc.contributor.advisorYan, Jingwen
dc.contributor.authorChen, Wenrong
dc.contributor.otherWang, Juexin
dc.contributor.otherWan, Jun
dc.contributor.otherZang, Yong
dc.contributor.otherLuo, Xiao
dc.contributor.otherLiu, Xiaowen
dc.date.accessioned2024-01-10T12:27:42Z
dc.date.available2024-01-10T12:27:42Z
dc.date.issued2023-12
dc.degree.date2023
dc.degree.discipline
dc.degree.grantorIndiana University
dc.degree.levelPh.D.
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)
dc.description.abstractProteoforms, distinct molecular forms of proteins, arise due to numerous factors such as genetic mutations, differential gene expression, alternative splicing, and a range of biological processes. These proteoforms are often characterized by primary structural variances such as amino acid substitutions, terminal truncations, and post-translational modifications (PTMs). Proteoforms from the same proteins can manifest varied functional behaviors based on the specific alterations. The complexity inherent to proteoforms has elevated the significance of top-down mass spectrometry (MS) due to its proficiency in providing intricate sequence information for these intact proteoforms. During a typical top-down MS experiment, intact proteoforms are separated through platforms like liquid chromatography (LC) or capillary zone electrophoresis (CZE) prior to tandem mass spectrometry (MS/MS) analysis. Despite advancements in instruments and protocols for top-down MS, computational challenges persist, with software tool development still in its early stage. In this dissertation, our research revolves around three primary goals, all aimed at refining proteoform characterization. First, we bridge RNA-Seq with top-down MS for a better proteoform identification. We propose TopPG, an innovative proteogenomic tool which is tailored to generate proteoform sequence databases from genetic and splicing variations explicitly for top-down MS in contrast to traditional approaches. Second, to boost the accuracy of proteoform detection, we utilize machine learning methods to predict proteoform retention and migration times in top-down MS, an area previously overshadowed by bottom-up MS paradigms. critically evaluating models in a realm traditionally dominated by bottom-up MS methodologies. Lastly, recognizing the indispensable role of post-translational modifications (PTMs) on cellular functions, we introduce PTM-TBA. This tool integrates the complementary strengths of both top-down and bottom-up MS, augmented with annotations, building a comprehensive strategy for precise PTM identification and localization.
dc.identifier.urihttps://hdl.handle.net/1805/37921
dc.language.isoen_US
dc.subjectMass spectrometry
dc.subjectPost-translational modification
dc.subjectProteoform
dc.subjectTop-down proteomics
dc.titleComputational Methods for Proteoform Identification and Characterization Using Top-Down Mass Spectrometry
dc.typeDissertation
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Chen_iupui_0104D_10730.pdf
Size:
6.07 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: