↓ Skip to main content

PLOS

Hierarchical Classification of Protein Folds Using a Novel Ensemble Classifier

Overview of attention for article published in PLOS ONE, February 2013
Altmetric Badge

Mentioned by

facebook
1 Facebook page

Citations

dimensions_citation
140 Dimensions

Readers on

mendeley
50 Mendeley
Title
Hierarchical Classification of Protein Folds Using a Novel Ensemble Classifier
Published in
PLOS ONE, February 2013
DOI 10.1371/journal.pone.0056499
Pubmed ID
Authors

Chen Lin, Ying Zou, Ji Qin, Xiangrong Liu, Yi Jiang, Caihuan Ke, Quan Zou

Abstract

The analysis of biological information from protein sequences is important for the study of cellular functions and interactions, and protein fold recognition plays a key role in the prediction of protein structures. Unfortunately, the prediction of protein fold patterns is challenging due to the existence of compound protein structures. Here, we processed the latest release of the Structural Classification of Proteins (SCOP, version 1.75) database and exploited novel techniques to impressively increase the accuracy of protein fold classification. The techniques proposed in this paper include ensemble classifying and a hierarchical framework, in the first layer of which similar or redundant sequences were deleted in two manners; a set of base classifiers, fused by various selection strategies, divides the input into seven classes; in the second layer of which, an analogous ensemble method is adopted to predict all protein folds. To our knowledge, it is the first time all protein folds can be intelligently detected hierarchically. Compared with prior studies, our experimental results demonstrated the efficiency and effectiveness of our proposed method, which achieved a success rate of 74.21%, which is much higher than results obtained with previous methods (ranging from 45.6% to 70.5%). When applied to the second layer of classification, the prediction accuracy was in the range between 23.13% and 46.05%. This value, which may not be remarkably high, is scientifically admirable and encouraging as compared to the relatively low counts of proteins from most fold recognition programs. The web server Hierarchical Protein Fold Prediction (HPFP) is available at http://datamining.xmu.edu.cn/software/hpfp.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 50 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Israel 1 2%
United States 1 2%
Australia 1 2%
Unknown 47 94%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 16 32%
Student > Master 7 14%
Researcher 6 12%
Student > Bachelor 4 8%
Student > Postgraduate 4 8%
Other 6 12%
Unknown 7 14%
Readers by discipline Count As %
Computer Science 24 48%
Agricultural and Biological Sciences 6 12%
Engineering 3 6%
Medicine and Dentistry 2 4%
Mathematics 1 2%
Other 5 10%
Unknown 9 18%