Report for: Using Sequence Similarity Networks for Visualization of Relationships Across Diverse Protein Superfamilies

Title	Using Sequence Similarity Networks for Visualization of Relationships Across Diverse Protein Superfamilies
Published in	PLOS ONE, February 2009
DOI	10.1371/journal.pone.0004345
Pubmed ID	19190775
Authors	Holly J. Atkinson, John H. Morris, Thomas E. Ferrin, Patricia C. Babbitt
Abstract	The dramatic increase in heterogeneous types of biological data--in particular, the abundance of new protein sequences--requires fast and user-friendly methods for organizing this information in a way that enables functional inference. The most widely used strategy to link sequence or structure to function, homology-based function prediction, relies on the fundamental assumption that sequence or structural similarity implies functional similarity. New tools that extend this approach are still urgently needed to associate sequence data with biological information in ways that accommodate the real complexity of the problem, while being accessible to experimental as well as computational biologists. To address this, we have examined the application of sequence similarity networks for visualizing functional trends across protein superfamilies from the context of sequence similarity. Using three large groups of homologous proteins of varying types of structural and functional diversity--GPCRs and kinases from humans, and the crotonase superfamily of enzymes--we show that overlaying networks with orthogonal information is a powerful approach for observing functional themes and revealing outliers. In comparison to other primary methods, networks provide both a good representation of group-wise sequence similarity relationships and a strong visual and quantitative correlation with phylogenetic trees, while enabling analysis and visualization of much larger sets of sequences than trees or multiple sequence alignments can easily accommodate. We also define important limitations and caveats in the application of these networks. As a broadly accessible and effective tool for the exploration of protein superfamilies, sequence similarity networks show great potential for generating testable hypotheses about protein structure-function relationships.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
Switzerland	1	50%
United States	1	50%

Demographic breakdown

Type	Count	As %
Scientists	1	50%
Members of the public	1	50%

Mendeley readers

The data shown below were compiled from readership statistics for 504 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
United States	14	3%
Brazil	4	<1%
Spain	4	<1%
Sweden	3	<1%
United Kingdom	3	<1%
Japan	2	<1%
Australia	1	<1%
Argentina	1	<1%
Denmark	1	<1%
Other	6	1%
Unknown	465	92%

Demographic breakdown

Readers by professional status	Count	As %
Student > Ph. D. Student	145	29%
Researcher	90	18%
Student > Master	66	13%
Student > Bachelor	45	9%
Student > Doctoral Student	21	4%
Other	65	13%
Unknown	72	14%

Readers by discipline	Count	As %
Agricultural and Biological Sciences	151	30%
Biochemistry, Genetics and Molecular Biology	116	23%
Chemistry	61	12%
Computer Science	32	6%
Engineering	9	2%
Other	54	11%
Unknown	81	16%

PLOS

Article Metrics

Using Sequence Similarity Networks for Visualization of Relationships Across Diverse Protein Superfamilies

Mentioned by

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown