An efficient algorithm for the blocked pattern matching problem

dc.contributor.authorDeng, Fei
dc.contributor.authorWang, Lusheng
dc.contributor.authorLiu, Xiaowen
dc.contributor.departmentDepartment of BioHealth Informatics, School of Informatics and Computingen_US
dc.date.accessioned2015-11-06T15:12:57Z
dc.date.available2015-11-06T15:12:57Z
dc.date.issued2015-10
dc.description.abstractMotivation: Tandem mass spectrometry (MS) has become the method of choice for protein identification and quantification. In the era of big data biology, tandem mass spectra are often searched against huge protein databases generated from genomes or RNA-Seq data for peptide identification. However, most existing tools for MS-based peptide identification compare a tandem mass spectrum against all peptides in a database whose molecular masses are similar to the precursor mass of the spectrum, making mass spectral data analysis slow for huge databases. Tag-based methods extract peptide sequence tags from a tandem mass spectrum and use them as a filter to reduce the number of candidate peptides, thus speeding up the database search. Recently, gapped tags have been introduced into mass spectral data analysis because they improve the sensitivity of peptide identification compared with sequence tags. However, the blocked pattern matching (BPM) problem, which is an essential step in gapped tag-based peptide identification, has not been fully solved. Results: In this article, we propose a fast and memory-efficient algorithm for the BPM problem. Experiments on both simulated and real datasets showed that the proposed algorithm achieved high speed and high sensitivity for peptide filtration in peptide identification by database search.en_US
dc.eprint.versionAuthor's manuscripten_US
dc.identifier.citationDeng, F., Wang, L., & Liu, X. (2014). An efficient algorithm for the blocked pattern matching problem. Bioinformatics. http://dx.doi.org/10.1093/bioinformatics/btu678en_US
dc.identifier.urihttps://hdl.handle.net/1805/7371
dc.language.isoen_USen_US
dc.publisherOxforden_US
dc.relation.isversionof10.1093/bioinformatics/btu678en_US
dc.relation.journalBioinformaticsen_US
dc.rightsPublisher Policyen_US
dc.sourceAuthoren_US
dc.subjectblocked pattern matchingen_US
dc.subjectpeptide identificationen_US
dc.subjectsequence analysisen_US
dc.titleAn efficient algorithm for the blocked pattern matching problemen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Deng_2015_an_efficient.pdf
Size:
615.61 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.88 KB
Format:
Item-specific license agreed upon to submission
Description: