↓ Skip to main content

PLOS

Protein Molecular Function Prediction by Bayesian Phylogenomics

Overview of attention for article published in PLoS Computational Biology, October 2005
Altmetric Badge

Mentioned by

blogs
1 blog

Citations

dimensions_citation
160 Dimensions

Readers on

mendeley
202 Mendeley
citeulike
18 CiteULike
connotea
5 Connotea
Title
Protein Molecular Function Prediction by Bayesian Phylogenomics
Published in
PLoS Computational Biology, October 2005
DOI 10.1371/journal.pcbi.0010045
Pubmed ID
Authors

Barbara E Engelhardt, Michael I Jordan, Kathryn E Muratore, Steven E Brenner

Abstract

We present a statistical graphical model to infer specific molecular function for unannotated protein sequences using homology. Based on phylogenomic principles, SIFTER (Statistical Inference of Function Through Evolutionary Relationships) accurately predicts molecular function for members of a protein family given a reconciled phylogeny and available function annotations, even when the data are sparse or noisy. Our method produced specific and consistent molecular function predictions across 100 Pfam families in comparison to the Gene Ontology annotation database, BLAST, GOtcha, and Orthostrapper. We performed a more detailed exploration of functional predictions on the adenosine-5'-monophosphate/adenosine deaminase family and the lactate/malate dehydrogenase family, in the former case comparing the predictions against a gold standard set of published functional characterizations. Given function annotations for 3% of the proteins in the deaminase family, SIFTER achieves 96% accuracy in predicting molecular function for experimentally characterized proteins as reported in the literature. The accuracy of SIFTER on this dataset is a significant improvement over other currently available methods such as BLAST (75%), GeneQuiz (64%), GOtcha (89%), and Orthostrapper (11%). We also experimentally characterized the adenosine deaminase from Plasmodium falciparum, confirming SIFTER's prediction. The results illustrate the predictive power of exploiting a statistical model of function evolution in phylogenomic problems. A software implementation of SIFTER is available from the authors.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 202 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 20 10%
Brazil 5 2%
United Kingdom 4 2%
Spain 2 <1%
Netherlands 1 <1%
Portugal 1 <1%
Germany 1 <1%
Mexico 1 <1%
France 1 <1%
Other 4 2%
Unknown 162 80%

Demographic breakdown

Readers by professional status Count As %
Researcher 56 28%
Student > Ph. D. Student 47 23%
Professor > Associate Professor 18 9%
Student > Master 16 8%
Student > Postgraduate 14 7%
Other 38 19%
Unknown 13 6%
Readers by discipline Count As %
Agricultural and Biological Sciences 117 58%
Biochemistry, Genetics and Molecular Biology 26 13%
Computer Science 17 8%
Chemistry 6 3%
Engineering 4 2%
Other 15 7%
Unknown 17 8%