SLiMFinder: A Probabilistic Method for Identifying Over-Represented, Convergently Evolved, Short Linear Motifs in Proteins

Overview of attention for article published in PLOS ONE, October 2007

Altmetric Badge

Mentioned by

blogs: 1 blog

Citations

dimensions_citation: 135 Dimensions

Readers on

mendeley: 106 Mendeley
citeulike: 5 CiteULike

Summary Blogs Dimensions citations

Title	SLiMFinder: A Probabilistic Method for Identifying Over-Represented, Convergently Evolved, Short Linear Motifs in Proteins
Published in	PLOS ONE, October 2007
DOI	10.1371/journal.pone.0000967
Pubmed ID	17912346
Authors	Richard J. Edwards, Norman E. Davey, Denis C. Shields
Abstract	Short linear motifs (SLiMs) in proteins are functional microdomains of fundamental importance in many biological systems. SLiMs typically consist of a 3 to 10 amino acid stretch of the primary protein sequence, of which as few as two sites may be important for activity, making identification of novel SLiMs extremely difficult. In particular, it can be very difficult to distinguish a randomly recurring "motif" from a truly over-represented one. Incorporating ambiguous amino acid positions and/or variable-length wildcard spacers between defined residues further complicates the matter. In this paper we present two algorithms. SLiMBuild identifies convergently evolved, short motifs in a dataset of proteins. Motifs are built by combining dimers into longer patterns, retaining only those motifs occurring in a sufficient number of unrelated proteins. Motifs with fixed amino acid positions are identified and then combined to incorporate amino acid ambiguity and variable-length wildcard spacers. The algorithm is computationally efficient compared to alternatives, particularly when datasets include homologous proteins, and provides great flexibility in the nature of motifs returned. The SLiMChance algorithm estimates the probability of returned motifs arising by chance, correcting for the size and composition of the dataset, and assigns a significance value to each motif. These algorithms are implemented in a software package, SLiMFinder. SLiMFinder default settings identify known SLiMs with 100% specificity, and have a low false discovery rate on random test data. The efficiency of SLiMBuild and low false discovery rate of SLiMChance make SLiMFinder highly suited to high throughput motif discovery and individual high quality analyses alike. Examples of such analyses on real biological data, and how SLiMFinder results can help direct future discoveries, are provided. SLiMFinder is freely available for download under a GNU license from http://bioinformatics.ucd.ie/shields/software/slimfinder/.

View on publisher site Alert me about new mentions

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 106 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
United States	3	3%
Ireland	3	3%
United Kingdom	2	2%
Australia	1	<1%
Finland	1	<1%
France	1	<1%
Argentina	1	<1%
Norway	1	<1%
Spain	1	<1%
Other	1	<1%
Unknown	91	86%

Demographic breakdown

Readers by professional status	Count	As %
Student > Ph. D. Student	28	26%
Researcher	21	20%
Student > Master	14	13%
Student > Bachelor	13	12%
Professor	6	6%
Other	17	16%
Unknown	7	7%

Readers by discipline	Count	As %
Agricultural and Biological Sciences	48	45%
Biochemistry, Genetics and Molecular Biology	25	24%
Computer Science	11	10%
Chemistry	4	4%
Mathematics	2	2%
Other	8	8%
Unknown	8	8%