↓ Skip to main content

PLOS

Using Sequence Similarity Networks for Visualization of Relationships Across Diverse Protein Superfamilies

Overview of attention for article published in PLOS ONE, February 2009
Altmetric Badge

Mentioned by

blogs
2 blogs
twitter
2 X users

Readers on

mendeley
504 Mendeley
citeulike
10 CiteULike
Title
Using Sequence Similarity Networks for Visualization of Relationships Across Diverse Protein Superfamilies
Published in
PLOS ONE, February 2009
DOI 10.1371/journal.pone.0004345
Pubmed ID
Authors

Holly J. Atkinson, John H. Morris, Thomas E. Ferrin, Patricia C. Babbitt

Abstract

The dramatic increase in heterogeneous types of biological data--in particular, the abundance of new protein sequences--requires fast and user-friendly methods for organizing this information in a way that enables functional inference. The most widely used strategy to link sequence or structure to function, homology-based function prediction, relies on the fundamental assumption that sequence or structural similarity implies functional similarity. New tools that extend this approach are still urgently needed to associate sequence data with biological information in ways that accommodate the real complexity of the problem, while being accessible to experimental as well as computational biologists. To address this, we have examined the application of sequence similarity networks for visualizing functional trends across protein superfamilies from the context of sequence similarity. Using three large groups of homologous proteins of varying types of structural and functional diversity--GPCRs and kinases from humans, and the crotonase superfamily of enzymes--we show that overlaying networks with orthogonal information is a powerful approach for observing functional themes and revealing outliers. In comparison to other primary methods, networks provide both a good representation of group-wise sequence similarity relationships and a strong visual and quantitative correlation with phylogenetic trees, while enabling analysis and visualization of much larger sets of sequences than trees or multiple sequence alignments can easily accommodate. We also define important limitations and caveats in the application of these networks. As a broadly accessible and effective tool for the exploration of protein superfamilies, sequence similarity networks show great potential for generating testable hypotheses about protein structure-function relationships.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 504 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 14 3%
Brazil 4 <1%
Spain 4 <1%
Sweden 3 <1%
United Kingdom 3 <1%
Japan 2 <1%
Australia 1 <1%
Argentina 1 <1%
Denmark 1 <1%
Other 6 1%
Unknown 465 92%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 145 29%
Researcher 90 18%
Student > Master 66 13%
Student > Bachelor 45 9%
Student > Doctoral Student 21 4%
Other 65 13%
Unknown 72 14%
Readers by discipline Count As %
Agricultural and Biological Sciences 151 30%
Biochemistry, Genetics and Molecular Biology 116 23%
Chemistry 61 12%
Computer Science 32 6%
Engineering 9 2%
Other 54 11%
Unknown 81 16%