↓ Skip to main content

PLOS

Defining an Essence of Structure Determining Residue Contacts in Proteins

Overview of attention for article published in PLoS Computational Biology, December 2009
Altmetric Badge

Mentioned by

blogs
1 blog

Citations

dimensions_citation
40 Dimensions

Readers on

mendeley
80 Mendeley
citeulike
10 CiteULike
Title
Defining an Essence of Structure Determining Residue Contacts in Proteins
Published in
PLoS Computational Biology, December 2009
DOI 10.1371/journal.pcbi.1000584
Pubmed ID
Authors

R. Sathyapriya, Jose M. Duarte, Henning Stehr, Ioannis Filippis, Michael Lappe

Abstract

The network of native non-covalent residue contacts determines the three-dimensional structure of a protein. However, not all contacts are of equal structural significance, and little knowledge exists about a minimal, yet sufficient, subset required to define the global features of a protein. Characterisation of this "structural essence" has remained elusive so far: no algorithmic strategy has been devised to-date that could outperform a random selection in terms of 3D reconstruction accuracy (measured as the Ca RMSD). It is not only of theoretical interest (i.e., for design of advanced statistical potentials) to identify the number and nature of essential native contacts-such a subset of spatial constraints is very useful in a number of novel experimental methods (like EPR) which rely heavily on constraint-based protein modelling. To derive accurate three-dimensional models from distance constraints, we implemented a reconstruction pipeline using distance geometry. We selected a test-set of 12 protein structures from the four major SCOP fold classes and performed our reconstruction analysis. As a reference set, series of random subsets (ranging from 10% to 90% of native contacts) are generated for each protein, and the reconstruction accuracy is computed for each subset. We have developed a rational strategy, termed "cone-peeling" that combines sequence features and network descriptors to select minimal subsets that outperform the reference sets. We present, for the first time, a rational strategy to derive a structural essence of residue contacts and provide an estimate of the size of this minimal subset. Our algorithm computes sparse subsets capable of determining the tertiary structure at approximately 4.8 A Ca RMSD with as little as 8% of the native contacts (Ca-Ca and Cb-Cb). At the same time, a randomly chosen subset of native contacts needs about twice as many contacts to reach the same level of accuracy. This "structural essence" opens new avenues in the fields of structure prediction, empirical potentials and docking.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 80 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 7 9%
United Kingdom 3 4%
Germany 2 3%
Switzerland 2 3%
Korea, Republic of 1 1%
Turkey 1 1%
Brazil 1 1%
Unknown 63 79%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 27 34%
Researcher 23 29%
Student > Bachelor 6 8%
Student > Postgraduate 6 8%
Professor 4 5%
Other 11 14%
Unknown 3 4%
Readers by discipline Count As %
Agricultural and Biological Sciences 37 46%
Computer Science 12 15%
Biochemistry, Genetics and Molecular Biology 9 11%
Chemistry 4 5%
Physics and Astronomy 3 4%
Other 11 14%
Unknown 4 5%