↓ Skip to main content

PLOS

Evaluation of Sequence Features from Intrinsically Disordered Regions for the Estimation of Protein Function

Overview of attention for article published in PLOS ONE, February 2014
Altmetric Badge

Mentioned by

twitter
1 X user
f1000
1 research highlight platform

Readers on

mendeley
35 Mendeley
Title
Evaluation of Sequence Features from Intrinsically Disordered Regions for the Estimation of Protein Function
Published in
PLOS ONE, February 2014
DOI 10.1371/journal.pone.0089890
Pubmed ID
Authors

Alok Sharma, Abdollah Dehzangi, James Lyons, Seiya Imoto, Satoru Miyano, Kenta Nakai, Ashwini Patil

Abstract

With the exponential increase in the number of sequenced organisms, automated annotation of proteins is becoming increasingly important. Intrinsically disordered regions are known to play a significant role in protein function. Despite their abundance, especially in eukaryotes, they are rarely used to inform function prediction systems. In this study, we extracted seven sequence features in intrinsically disordered regions and developed a scheme to use them to predict Gene Ontology Slim terms associated with proteins. We evaluated the function prediction performance of each feature. Our results indicate that the residue composition based features have the highest precision while bigram probabilities, based on sequence profiles of intrinsically disordered regions obtained from PSIBlast, have the highest recall. Amino acid bigrams and features based on secondary structure show an intermediate level of precision and recall. Almost all features showed a high prediction performance for GO Slim terms related to extracellular matrix, nucleus, RNA and DNA binding. However, feature performance varied significantly for different GO Slim terms emphasizing the need for a unique classifier optimized for the prediction of each functional term. These findings provide a first comprehensive and quantitative evaluation of sequence features in intrinsically disordered regions and will help in the development of a more informative protein function predictor.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 35 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Israel 1 3%
Unknown 34 97%

Demographic breakdown

Readers by professional status Count As %
Student > Master 6 17%
Student > Ph. D. Student 6 17%
Researcher 5 14%
Student > Bachelor 3 9%
Professor > Associate Professor 3 9%
Other 7 20%
Unknown 5 14%
Readers by discipline Count As %
Agricultural and Biological Sciences 12 34%
Biochemistry, Genetics and Molecular Biology 7 20%
Computer Science 6 17%
Chemistry 3 9%
Engineering 1 3%
Other 0 0%
Unknown 6 17%