↓ Skip to main content

PLOS

Genometa - A Fast and Accurate Classifier for Short Metagenomic Shotgun Reads

Overview of attention for article published in PLOS ONE, August 2012
Altmetric Badge

Mentioned by

blogs
1 blog
twitter
18 X users
peer_reviews
1 peer review site
f1000
1 research highlight platform

Citations

dimensions_citation
35 Dimensions

Readers on

mendeley
155 Mendeley
citeulike
4 CiteULike
Title
Genometa - A Fast and Accurate Classifier for Short Metagenomic Shotgun Reads
Published in
PLOS ONE, August 2012
DOI 10.1371/journal.pone.0041224
Pubmed ID
Authors

Colin F. Davenport, Jens Neugebauer, Nils Beckmann, Benedikt Friedrich, Burim Kameri, Svea Kokott, Malte Paetow, Björn Siekmann, Matthias Wieding-Drewes, Markus Wienhöfer, Stefan Wolf, Burkhard Tümmler, Volker Ahlers, Frauke Sprengel

Abstract

Metagenomic studies use high-throughput sequence data to investigate microbial communities in situ. However, considerable challenges remain in the analysis of these data, particularly with regard to speed and reliable analysis of microbial species as opposed to higher level taxa such as phyla. We here present Genometa, a computationally undemanding graphical user interface program that enables identification of bacterial species and gene content from datasets generated by inexpensive high-throughput short read sequencing technologies. Our approach was first verified on two simulated metagenomic short read datasets, detecting 100% and 94% of the bacterial species included with few false positives or false negatives. Subsequent comparative benchmarking analysis against three popular metagenomic algorithms on an Illumina human gut dataset revealed Genometa to attribute the most reads to bacteria at species level (i.e. including all strains of that species) and demonstrate similar or better accuracy than the other programs. Lastly, speed was demonstrated to be many times that of BLAST due to the use of modern short read aligners. Our method is highly accurate if bacteria in the sample are represented by genomes in the reference sequence but cannot find species absent from the reference. This method is one of the most user-friendly and resource efficient approaches and is thus feasible for rapidly analysing millions of short reads on a personal computer. Availability: The Genometa program, a step by step tutorial and Java source code are freely available from http://genomics1.mh-hannover.de/genometa/ and on http://code.google.com/p/genometa/. This program has been tested on Ubuntu Linux and Windows XP/7.

X Demographics

X Demographics

The data shown below were collected from the profiles of 18 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 155 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 8 5%
Sweden 3 2%
United Kingdom 2 1%
Brazil 2 1%
France 1 <1%
Finland 1 <1%
Germany 1 <1%
Saudi Arabia 1 <1%
Australia 1 <1%
Other 2 1%
Unknown 133 86%

Demographic breakdown

Readers by professional status Count As %
Researcher 40 26%
Student > Ph. D. Student 35 23%
Student > Master 24 15%
Professor > Associate Professor 11 7%
Professor 10 6%
Other 19 12%
Unknown 16 10%
Readers by discipline Count As %
Agricultural and Biological Sciences 82 53%
Computer Science 18 12%
Biochemistry, Genetics and Molecular Biology 16 10%
Medicine and Dentistry 6 4%
Mathematics 5 3%
Other 12 8%
Unknown 16 10%