Functional enrichment analyses and construction of functional similarity networks with high confidence function prediction by PFP

dc.contributor.authorHawkins, Troy
dc.contributor.authorChitale, Meghana
dc.contributor.authorKihara, Daisuke
dc.contributor.departmentMedical and Molecular Genetics, School of Medicineen_US
dc.date.accessioned2020-05-18T19:50:08Z
dc.date.available2020-05-18T19:50:08Z
dc.date.issued2010-05-19
dc.description.abstractBackground A new paradigm of biological investigation takes advantage of technologies that produce large high throughput datasets, including genome sequences, interactions of proteins, and gene expression. The ability of biologists to analyze and interpret such data relies on functional annotation of the included proteins, but even in highly characterized organisms many proteins can lack the functional evidence necessary to infer their biological relevance. Results Here we have applied high confidence function predictions from our automated prediction system, PFP, to three genome sequences, Escherichia coli, Saccharomyces cerevisiae, and Plasmodium falciparum (malaria). The number of annotated genes is increased by PFP to over 90% for all of the genomes. Using the large coverage of the function annotation, we introduced the functional similarity networks which represent the functional space of the proteomes. Four different functional similarity networks are constructed for each proteome, one each by considering similarity in a single Gene Ontology (GO) category, i.e. Biological Process, Cellular Component, and Molecular Function, and another one by considering overall similarity with the funSim score. The functional similarity networks are shown to have higher modularity than the protein-protein interaction network. Moreover, the funSim score network is distinct from the single GO-score networks by showing a higher clustering degree exponent value and thus has a higher tendency to be hierarchical. In addition, examining function assignments to the protein-protein interaction network and local regions of genomes has identified numerous cases where subnetworks or local regions have functionally coherent proteins. These results will help interpreting interactions of proteins and gene orders in a genome. Several examples of both analyses are highlighted. Conclusion The analyses demonstrate that applying high confidence predictions from PFP can have a significant impact on a researchers' ability to interpret the immense biological data that are being generated today. The newly introduced functional similarity networks of the three organisms show different network properties as compared with the protein-protein interaction networks.en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationHawkins, T., Chitale, M. & Kihara, D. Functional enrichment analyses and construction of functional similarity networks with high confidence function prediction by PFP. BMC Bioinformatics 11, 265 (2010). https://doi.org/10.1186/1471-2105-11-265en_US
dc.identifier.urihttps://hdl.handle.net/1805/22792
dc.language.isoen_USen_US
dc.publisherBMCen_US
dc.relation.isversionof10.1186/1471-2105-11-265en_US
dc.relation.journalBMC Bioinformaticsen_US
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.sourcePublisheren_US
dc.subjectGene Ontologyen_US
dc.subjectMalariaen_US
dc.subjectCluster Coefficienten_US
dc.subjectSimilarity Thresholden_US
dc.subjectFunction Assignmenten_US
dc.titleFunctional enrichment analyses and construction of functional similarity networks with high confidence function prediction by PFPen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1471-2105-11-265.pdf
Size:
4.88 MB
Format:
Adobe Portable Document Format
Description:
Main article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: