A mass graph-based approach for the identification of modified proteoforms using top-down tandem mass spectra

dc.contributor.authorKou, Qiang
dc.contributor.authorWu, Si
dc.contributor.authorTolić, Nikola
dc.contributor.authorPaša-Tolić, Ljiljana
dc.contributor.authorLiu, Yunlong
dc.contributor.authorLiu, Xiaowen
dc.contributor.departmentBioHealth Informatics, School of Informatics and Computingen_US
dc.date.accessioned2018-10-18T15:57:47Z
dc.date.available2018-10-18T15:57:47Z
dc.date.issued2017-05-01
dc.description.abstractMotivation: Although proteomics has rapidly developed in the past decade, researchers are still in the early stage of exploring the world of complex proteoforms, which are protein products with various primary structure alterations resulting from gene mutations, alternative splicing, post-translational modifications, and other biological processes. Proteoform identification is essential to mapping proteoforms to their biological functions as well as discovering novel proteoforms and new protein functions. Top-down mass spectrometry is the method of choice for identifying complex proteoforms because it provides a 'bird's eye view' of intact proteoforms. The combinatorial explosion of various alterations on a protein may result in billions of possible proteoforms, making proteoform identification a challenging computational problem. Results: We propose a new data structure, called the mass graph, for efficient representation of proteoforms and design mass graph alignment algorithms. We developed TopMG, a mass graph-based software tool for proteoform identification by top-down mass spectrometry. Experiments on top-down mass spectrometry datasets showed that TopMG outperformed existing methods in identifying complex proteoforms.en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationKou, Q., Wu, S., Tolić, N., Paša-Tolić, L., Liu, Y., & Liu, X. (2017). A mass graph-based approach for the identification of modified proteoforms using top-down tandem mass spectra. Bioinformatics, 33(9), 1309–1316. http://doi.org/10.1093/bioinformatics/btw806en_US
dc.identifier.urihttps://hdl.handle.net/1805/17579
dc.language.isoen_USen_US
dc.publisherOxforden_US
dc.relation.isversionof10.1093/bioinformatics/btw806en_US
dc.relation.journalBioinformaticsen_US
dc.rightsPublisher Policyen_US
dc.sourcePMCen_US
dc.subjectAlgorithmsen_US
dc.subjectAlternative Splicingen_US
dc.subjectMolecular Weighten_US
dc.subjectProtein Processing, Post-Translationalen_US
dc.subjectProteomeen_US
dc.subjectTandem Mass Spectrometryen_US
dc.titleA mass graph-based approach for the identification of modified proteoforms using top-down tandem mass spectraen_US
dc.typeArticleen_US
ul.alternative.fulltexthttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860502/en_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
btw806.pdf
Size:
611.89 KB
Format:
Adobe Portable Document Format
Description:
Main article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: