↓ Skip to main content

PLOS

The CanOE Strategy: Integrating Genomic and Metabolic Contexts across Multiple Prokaryote Genomes to Find Candidate Genes for Orphan Enzymes

Overview of attention for article published in PLoS Computational Biology, May 2012
Altmetric Badge

Mentioned by

twitter
2 X users

Citations

dimensions_citation
32 Dimensions

Readers on

mendeley
69 Mendeley
citeulike
4 CiteULike
Title
The CanOE Strategy: Integrating Genomic and Metabolic Contexts across Multiple Prokaryote Genomes to Find Candidate Genes for Orphan Enzymes
Published in
PLoS Computational Biology, May 2012
DOI 10.1371/journal.pcbi.1002540
Pubmed ID
Authors

Adam Alexander Thil Smith, Eugeni Belda, Alain Viari, Claudine Medigue, David Vallenet

Abstract

Of all biochemically characterized metabolic reactions formalized by the IUBMB, over one out of four have yet to be associated with a nucleic or protein sequence, i.e. are sequence-orphan enzymatic activities. Few bioinformatics annotation tools are able to propose candidate genes for such activities by exploiting context-dependent rather than sequence-dependent data, and none are readily accessible and propose result integration across multiple genomes. Here, we present CanOE (Candidate genes for Orphan Enzymes), a four-step bioinformatics strategy that proposes ranked candidate genes for sequence-orphan enzymatic activities (or orphan enzymes for short). The first step locates "genomic metabolons", i.e. groups of co-localized genes coding proteins catalyzing reactions linked by shared metabolites, in one genome at a time. These metabolons can be particularly helpful for aiding bioanalysts to visualize relevant metabolic data. In the second step, they are used to generate candidate associations between un-annotated genes and gene-less reactions. The third step integrates these gene-reaction associations over several genomes using gene families, and summarizes the strength of family-reaction associations by several scores. In the final step, these scores are used to rank members of gene families which are proposed for metabolic reactions. These associations are of particular interest when the metabolic reaction is a sequence-orphan enzymatic activity. Our strategy found over 60,000 genomic metabolons in more than 1,000 prokaryote organisms from the MicroScope platform, generating candidate genes for many metabolic reactions, of which more than 70 distinct orphan reactions. A computational validation of the approach is discussed. Finally, we present a case study on the anaerobic allantoin degradation pathway in Escherichia coli K-12.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 69 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 6 9%
Brazil 2 3%
France 1 1%
Mexico 1 1%
United Kingdom 1 1%
Unknown 58 84%

Demographic breakdown

Readers by professional status Count As %
Researcher 27 39%
Student > Ph. D. Student 14 20%
Student > Bachelor 6 9%
Student > Master 6 9%
Student > Postgraduate 3 4%
Other 6 9%
Unknown 7 10%
Readers by discipline Count As %
Agricultural and Biological Sciences 40 58%
Biochemistry, Genetics and Molecular Biology 11 16%
Computer Science 5 7%
Chemistry 3 4%
Nursing and Health Professions 1 1%
Other 2 3%
Unknown 7 10%