- Browse by Author
Browsing by Author "Li, Shuai Cheng"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
Item Identification of Glycopeptides with Multiple Hydroxylysine O-Glycosylation Sites by Tandem Mass Spectrometry(ACS, 2015-11) Zhang, Yanlin; Yu, Chuan-Yih; Song, Ehwang; Li, Shuai Cheng; Mechref, Yehia; Tang, Haixu; Liu, Xiaowen; Department of Biohealth Informatics, IU School of Informatics and ComputingGlycosylation is one of the most common post-translational modifications in proteins, existing in ∼50% of mammalian proteins. Several research groups have demonstrated that mass spectrometry is an efficient technique for glycopeptide identification; however, this problem is still challenging because of the enormous diversity of glycan structures and the microheterogeneity of glycans. In addition, a glycopeptide may contain multiple glycosylation sites, making the problem complex. Current software tools often fail to identify glycopeptides with multiple glycosylation sites, and hence we present GlycoMID, a graph-based spectral alignment algorithm that can identify glycopeptides with multiple hydroxylysine O-glycosylation sites by tandem mass spectra. GlycoMID was tested on mass spectrometry data sets of the bovine collagen α-(II) chain protein, and experimental results showed that it identified more glycopeptide-spectrum matches than other existing tools, including many glycopeptides with two glycosylation sites.Item Spectral probabilities of top-down tandem mass spectra(Springer Nature, 2014) Liu, Xiaowen; Segar, Matthew W.; Li, Shuai Cheng; Kim, Sangtae; Biomedical Engineering and Informatics, Luddy School of Informatics, Computing, and EngineeringBackground: In mass spectrometry-based proteomics, the statistical significance of a peptide-spectrum or protein-spectrum match is an important indicator of the correctness of the peptide or protein identification. In bottom-up mass spectrometry, probabilistic models, such as the generating function method, have been successfully applied to compute the statistical significance of peptide-spectrum matches for short peptides containing no post-translational modifications. As top-down mass spectrometry, which often identifies intact proteins with post-translational modifications, becomes available in many laboratories, the estimation of statistical significance of top-down protein identification results has come into great demand. Results: In this paper, we study an extended generating function method for accurately computing the statistical significance of protein-spectrum matches with post-translational modifications. Experiments show that the extended generating function method achieves high accuracy in computing spectral probabilities and false discovery rates. Conclusions: The extended generating function method is a non-trivial extension of the generating function method for bottom-up mass spectrometry. It can be used to choose the correct protein-spectrum match from several candidate protein-spectrum matches for a spectrum, as well as separate correct protein-spectrum matches from incorrect ones identified from a large number of tandem mass spectra.