Mass graphs and their applications in top-down proteomics

dc.contributor.authorKou, Qiang
dc.contributor.authorWu, Si
dc.contributor.authorTolić, Nikola
dc.contributor.authorPasa-Tolić, Ljiljana
dc.contributor.authorLiu, Xiaowen
dc.contributor.departmentDepartment of Biohealth Informatics, School of Informatics and Computingen_US
dc.date.accessioned2017-04-13T16:28:11Z
dc.date.available2017-04-13T16:28:11Z
dc.date.issued2015
dc.description.abstractAlthough proteomics has made rapid progress in the past decade, researchers are still in the early stage of exploring the world of complex proteoforms, which are protein products with various primary structure alterations resulting from gene mutations, alternative splicing, post-translational modifications, and other biological processes. Proteoform identification is essential to mapping proteoforms to their biological functions as well as discovering novel proteoforms and new protein functions. Top-down mass spectrometry is the method of choice for identifying complex proteoforms because it provides a "bird view" of intact proteoforms. The combinatorial explosion of possible proteoforms, which may result in billions of possible proteoforms for one protein, makes proteoform identification a challenging computational problem. Here we propose a new data structure, called the mass graph, for efficiently representing proteoforms. In addition, we design mass graph alignment algorithms for proteoform identification by top-down mass spectrometry. Experiments on a histone H4 mass spectrometry data set showed that the proposed methods outperformed MS-Align-E in identifying complex proteoforms.en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationKou, Q., Wu, S., Tolić, N., Pasa-Tolić, L., & Liu, X. (2015). Mass graphs and their applications in top-down proteomics. bioRxiv, 031997. https://doi.org/10.1101/031997en_US
dc.identifier.urihttps://hdl.handle.net/1805/12264
dc.language.isoenen_US
dc.relation.isversionof10.1101/031997en_US
dc.relation.journalbioRxiven_US
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 United States
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/3.0/us
dc.sourceOtheren_US
dc.subjectproteomicsen_US
dc.subjectmass graphsen_US
dc.subjectproteoform identificationen_US
dc.titleMass graphs and their applications in top-down proteomicsen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kou-2016-mass.pdf
Size:
335.46 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.88 KB
Format:
Item-specific license agreed upon to submission
Description: