↓ Skip to main content

PLOS

Fast Statistical Alignment

Overview of attention for article published in PLoS Computational Biology, May 2009
Altmetric Badge

Mentioned by

blogs
1 blog
twitter
1 X user
wikipedia
3 Wikipedia pages

Citations

dimensions_citation
314 Dimensions

Readers on

mendeley
350 Mendeley
citeulike
29 CiteULike
connotea
2 Connotea
Title
Fast Statistical Alignment
Published in
PLoS Computational Biology, May 2009
DOI 10.1371/journal.pcbi.1000392
Pubmed ID
Authors

Robert K. Bradley, Adam Roberts, Michael Smoot, Sudeep Juvekar, Jaeyoung Do, Colin Dewey, Ian Holmes, Lior Pachter

Abstract

We describe a new program for the alignment of multiple biological sequences that is both statistically motivated and fast enough for problem sizes that arise in practice. Our Fast Statistical Alignment program is based on pair hidden Markov models which approximate an insertion/deletion process on a tree and uses a sequence annealing algorithm to combine the posterior probabilities estimated from these models into a multiple alignment. FSA uses its explicit statistical model to produce multiple alignments which are accompanied by estimates of the alignment accuracy and uncertainty for every column and character of the alignment--previously available only with alignment programs which use computationally-expensive Markov Chain Monte Carlo approaches--yet can align thousands of long sequences. Moreover, FSA utilizes an unsupervised query-specific learning procedure for parameter estimation which leads to improved accuracy on benchmark reference alignments in comparison to existing programs. The centroid alignment approach taken by FSA, in combination with its learning procedure, drastically reduces the amount of false-positive alignment on biological data in comparison to that given by other methods. The FSA program and a companion visualization tool for exploring uncertainty in alignments can be used via a web interface at http://orangutan.math.berkeley.edu/fsa/, and the source code is available at http://fsa.sourceforge.net/.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 350 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 22 6%
United Kingdom 11 3%
Spain 3 <1%
Switzerland 3 <1%
Brazil 3 <1%
Canada 2 <1%
Nigeria 2 <1%
Netherlands 2 <1%
Italy 2 <1%
Other 14 4%
Unknown 286 82%

Demographic breakdown

Readers by professional status Count As %
Researcher 105 30%
Student > Ph. D. Student 84 24%
Student > Master 37 11%
Professor > Associate Professor 29 8%
Professor 22 6%
Other 53 15%
Unknown 20 6%
Readers by discipline Count As %
Agricultural and Biological Sciences 195 56%
Biochemistry, Genetics and Molecular Biology 50 14%
Computer Science 47 13%
Mathematics 6 2%
Medicine and Dentistry 5 1%
Other 23 7%
Unknown 24 7%