- Browse by Author
Browsing by Author "Basharat, Abdul Rehman"
Now showing 1 - 6 of 6
Results Per Page
Sort Options
Item Capillary zone electrophoresis-tandem mass spectrometry with activated ion electron transfer dissociation for large-scale top-down proteomics(Springer, 2019-12) McCool, Elijah N.; Basharat, Abdul Rehman; Liu, Xiaowen; Coon, Joshua J.; Sun, Liangliang; BioHealth Informatics, School of Informatics and ComputingCapillary zone electrophoresis (CZE)-tandem mass spectrometry (MS/MS) has been recognized as an efficient approach for top-down proteomics recently for its high-capacity separation and highly sensitive detection of proteoforms. However, the commonly used collision-based dissociation methods often cannot provide extensive fragmentation of proteoforms for thorough characterization. Activated ion electron transfer dissociation (AI-ETD), that combines infrared photoactivation concurrent with ETD, has shown better performance for proteoform fragmentation than higher energy-collisional dissociation (HCD) and standard ETD. Here, we present the first application of CZE-AI-ETD on an Orbitrap Fusion Lumos mass spectrometer for large-scale top-down proteomics of Escherichia coli (E. coli) cells. CZE-AI-ETD outperformed CZE-ETD regarding proteoform and protein identifications (IDs). CZE-AI-ETD reached comparable proteoform and protein IDs with CZE-HCD. CZE-AI-ETD tended to generate better expectation values (E values) of proteoforms than CZE-HCD and CZE-ETD, indicating a higher quality of MS/MS spectra from AI-ETD respecting the number of sequence-informative fragment ions generated. CZE-AI-ETD showed great reproducibility regarding the proteoform and protein IDs with relative standard deviations less than 4% and 2% (n = 3). Coupling size exclusion chromatography (SEC) to CZE-AI-ETD identified 3028 proteoforms and 387 proteins from E. coli cells with 1% spectrum level and 5% proteoform-level false discovery rates. The data represents the largest top-down proteomics dataset using the AI-ETD method so far. Single-shot CZE-AI-ETD of one SEC fraction identified 957 proteoforms and 253 proteins. N-terminal truncations, signal peptide cleavage, N-terminal methionine removal, and various post-translational modifications including protein N-terminal acetylation, methylation, S-thiolation, disulfide bonds, and lysine succinylation were detected.Item EnvCNN: A Convolutional Neural Network Model for Evaluating Isotopic Envelopes in Top-Down Mass-Spectral Deconvolution(ACS, 2020-06) Basharat, Abdul Rehman; Ning, Xia; Liu, Xiaowen; BioHealth Informatics, School of Informatics and ComputingTop-down mass spectrometry has become the main method for intact proteoform identification, characterization, and quantitation. Because of the complexity of top-down mass spectrometry data, spectral deconvolution is an indispensable step in spectral data analysis, which groups spectral peaks into isotopic envelopes and extracts monoisotopic masses of precursor or fragment ions. The performance of spectral deconvolution methods relies heavily on their scoring functions, which distinguish correct envelopes from incorrect ones. A good scoring function increases the accuracy of deconvoluted masses reported from mass spectra. In this paper, we present EnvCNN, a convolutional neural network-based model for evaluating isotopic envelopes. We show that the model outperforms other scoring functions in distinguishing correct envelopes from incorrect ones and that it increases the number of identifications and improves the statistical significance of identifications in top-down spectral interpretation.Item Large-Scale Qualitative and Quantitative Top-Down Proteomics Using Capillary Zone Electrophoresis-Electrospray Ionization-Tandem Mass Spectrometry with Nanograms of Proteome Samples(Springer, 2019-04-09) Lubeckyj, Rachele A.; Basharat, Abdul Rehman; Shen, Xiaojing; Liu, Xiaowen; Sun, Liangliang; BioHealth Informatics, School of Informatics and ComputingCapillary zone electrophoresis-electrospray ionization-tandem mass spectrometry (CZE-ESI-MS/MS) has attracted attention recently for top-down proteomics because it can achieve highly efficient separation and very sensitive detection of proteins. However, separation window and sample loading volume of CZE need to be boosted for a better proteome coverage using CZE-MS/MS. Here, we present an improved CZE-MS/MS system that achieved a 180-min separation window and a 2-μL sample loading volume for top-down characterization of protein mixtures. The system obtained highly efficient separation of proteins with nearly one million theoretical plates for myoglobin and enabled baseline separation of three different proteoforms of myoglobin. The CZE-MS/MS system identified 797±21 proteoforms and 258±7 proteins (n=2) from an Escherichia coli (E. coli) proteome sample in a single run with only 250 ng of proteins injected. The system still identified 449±40 proteoforms and 173±6 proteins (n=2) from the E. coli sample when only 25 ng of proteins were injected per run. Single-shot CZE-MS/MS analyses of zebrafish brain cerebellum (Cb) and optic tectum (Teo) regions identified 1 730±196 proteoforms (n=3) and 2 024±255 proteoforms (n=3), respectively, with only 500-ng proteins loaded per run. Label-free quantitative top-down proteomics of zebrafish brain Cb and Teo regions revealed significant differences between Cb and Teo regarding the proteoform abundance. Over 700 proteoforms from 131 proteins had significantly higher abundance in Cb compared to Teo, and these proteins were highly enriched in several biological processes, including muscle contraction, glycolytic process, and mesenchyme migration.Item Quantitative proteomics reveals the dynamic proteome landscape of zebrafish embryos during the maternal-to-zygotic transition(Elsevier, 2024-05-08) Fang, Fei; Chen, Daoyang; Basharat, Abdul Rehman; Poulos, William; Wang, Qianyi; Cibelli, Jose B.; Liu, Xiaowen; Sun, Liangliang; Biomedical Engineering and Informatics, Luddy School of Informatics, Computing, and EngineeringMaternal-to-zygotic transition (MZT) is central to early embryogenesis. However, its underlying molecular mechanisms are still not well described. Here, we revealed the expression dynamics of 5,000 proteins across four stages of zebrafish embryos during MZT, representing one of the most systematic surveys of proteome landscape of the zebrafish embryos during MZT. Nearly 700 proteins were differentially expressed and were divided into six clusters according to their expression patterns. The proteome expression profiles accurately reflect the main events that happen during the MZT, i.e., zygotic genome activation (ZGA), clearance of maternal mRNAs, and initiation of cellular differentiation and organogenesis. MZT is modulated by many proteins at multiple levels in a collaborative fashion, i.e., transcription factors, histones, histone-modifying enzymes, RNA helicases, and P-body proteins. Significant discrepancies were discovered between zebrafish proteome and transcriptome profiles during the MZT. The proteome dynamics database will be a valuable resource for bettering our understanding of MZT.Item TopDIA: A Software Tool for Top-Down Data-Independent Acquisition Proteomics(bioRxiv, 2024-04-09) Basharat, Abdul Rehman; Xiong, Xingzhao; Xu, Tian; Zang, Yong; Sun, Liangliang; Liu, Xiaowen; Biomedical Engineering and Informatics, Luddy School of Informatics, Computing, and EngineeringTop-down mass spectrometry is widely used for proteoform identification, characterization, and quantification owing to its ability to analyze intact proteoforms. In the last decade, top-down proteomics has been dominated by top-down data-dependent acquisition mass spectrometry (TD-DDA-MS), and top-down data-independent acquisition mass spectrometry (TD-DIA-MS) has not been well studied. While TD-DIA-MS produces complex multiplexed tandem mass spectrometry (MS/MS) spectra, which are challenging to confidently identify, it selects more precursor ions for MS/MS analysis and has the potential to increase proteoform identifications compared with TD-DDA-MS. Here we present TopDIA, the first software tool for proteoform identification by TD-DIA-MS. It generates demultiplexed pseudo MS/MS spectra from TD-DIA-MS data and then searches the pseudo MS/MS spectra against a protein sequence database for proteoform identification. We compared the performance of TD-DDA-MS and TD-DIA-MS using Escherichia coli K-12 MG1655 cells and demonstrated that TD-DIA-MS with TopDIA increased proteoform and protein identifications compared with TD-DDA-MS.Item TopFD: A Proteoform Feature Detection Tool for Top–Down Proteomics(American Chemical Society, 2023) Basharat, Abdul Rehman; Zang, Yong; Sun, Liangliang; Liu, Xiaowen; BioHealth Informatics, School of Informatics and ComputingTop-down liquid chromatography-mass spectrometry (LC-MS) analyzes intact proteoforms and generates mass spectra containing peaks of proteoforms with various isotopic compositions, charge states, and retention times. An essential step in top-down MS data analysis is proteoform feature detection, which aims to group these peaks into peak sets (features), each containing all peaks of a proteoform. Accurate protein feature detection enhances the accuracy in MS-based proteoform identification and quantification. Here, we present TopFD, a software tool for top-down MS feature detection that integrates algorithms for proteoform feature detection, feature boundary refinement, and machine learning models for proteoform feature evaluation. We performed extensive benchmarking of TopFD, ProMex, FlashDeconv, and Xtract using seven top-down MS data sets and demonstrated that TopFD outperforms other tools in feature accuracy, reproducibility, and feature abundance reproducibility.