PEPPI: a peptidomic database of human protein isoforms for proteomics experiments

Zhou, Ao; Zhang, Fan; Chen, Jake Yue

PEPPI: a peptidomic database of human protein isoforms for proteomics experiments

dc.contributor.author	Zhou, Ao
dc.contributor.author	Zhang, Fan
dc.contributor.author	Chen, Jake Yue
dc.contributor.department	BioHealth Informatics, School of Informatics and Computing	en_US
dc.date.accessioned	2020-05-04T16:36:37Z
dc.date.available	2020-05-04T16:36:37Z
dc.date.issued	2010-10-07
dc.description.abstract	Background Protein isoform generation, which may derive from alternative splicing, genetic polymorphism, and posttranslational modification, is an essential source of achieving molecular diversity by eukaryotic cells. Previous studies have shown that protein isoforms play critical roles in disease diagnosis, risk assessment, sub-typing, prognosis, and treatment outcome predictions. Understanding the types, presence, and abundance of different protein isoforms in different cellular and physiological conditions is a major task in functional proteomics, and may pave ways to molecular biomarker discovery of human diseases. In tandem mass spectrometry (MS/MS) based proteomics analysis, peptide peaks with exact matches to protein sequence records in the proteomics database may be identified with mass spectrometry (MS) search software. However, due to limited annotation and poor coverage of protein isoforms in proteomics databases, high throughput protein isoform identifications, particularly those arising from alternative splicing and genetic polymorphism, have not been possible. Results Therefore, we present the PEPtidomics Protein Isoform Database (PEPPI, http://bio.informatics.iupui.edu/peppi), a comprehensive database of computationally-synthesized human peptides that can identify protein isoforms derived from either alternatively spliced mRNA transcripts or SNP variations. We collected genome, pre-mRNA alternative splicing and SNP information from Ensembl. We synthesized in silico isoform transcripts that cover all exons and theoretically possible junctions of exons and introns, as well as all their variations derived from known SNPs. With three case studies, we further demonstrated that the database can help researchers discover and characterize new protein isoform biomarkers from experimental proteomics data. Conclusions We developed a new tool for the proteomics community to characterize protein isoforms from MS-based proteomics experiments. By cataloguing each peptide configurations in the PEPPI database, users can study genetic variations and alternative splicing events at the proteome level. They can also batch-download peptide sequences in FASTA format to search for MS/MS spectra derived from human samples. The database can help generate novel hypotheses on molecular risk factors and molecular mechanisms of complex diseases, leading to identification of potentially highly specific protein isoform biomarkers.	en_US
dc.eprint.version	Final published version	en_US
dc.identifier.citation	Zhou, A., Zhang, F. & Chen, J.Y. PEPPI: a peptidomic database of human protein isoforms for proteomics experiments. BMC Bioinformatics 11, S7 (2010). https://doi.org/10.1186/1471-2105-11-S6-S7	en_US
dc.identifier.uri	https://hdl.handle.net/1805/22691
dc.language.iso	en_US	en_US
dc.publisher	BMC	en_US
dc.relation.isversionof	10.1186/1471-2105-11-S6-S7	en_US
dc.relation.journal	BMC Bioinformatics	en_US
dc.rights	Attribution 4.0 International	*
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	*
dc.source	Publisher	en_US
dc.subject	Protein Isoforms	en_US
dc.subject	Peptide Region	en_US
dc.subject	Alternative Splice Event	en_US
dc.subject	Human Fetal Liver	en_US
dc.subject	Type Peptide	en_US
dc.title	PEPPI: a peptidomic database of human protein isoforms for proteomics experiments	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 1471-2105-11-S6-S7.pdf
Size:: 3.17 MB
Format:: Adobe Portable Document Format
Description:: Main article

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.99 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Department of Biomedical Engineering and Informatics Works
Jake Chen