↓ Skip to main content

PLOS

A Mathematical Framework for Protein Structure Comparison

Overview of attention for article published in PLoS Computational Biology, February 2011
Altmetric Badge

Mentioned by

twitter
3 X users
f1000
1 research highlight platform

Citations

dimensions_citation
37 Dimensions

Readers on

mendeley
109 Mendeley
citeulike
4 CiteULike
Title
A Mathematical Framework for Protein Structure Comparison
Published in
PLoS Computational Biology, February 2011
DOI 10.1371/journal.pcbi.1001075
Pubmed ID
Authors

Wei Liu, Anuj Srivastava, Jinfeng Zhang

Abstract

Comparison of protein structures is important for revealing the evolutionary relationship among proteins, predicting protein functions and predicting protein structures. Many methods have been developed in the past to align two or multiple protein structures. Despite the importance of this problem, rigorous mathematical or statistical frameworks have seldom been pursued for general protein structure comparison. One notable issue in this field is that with many different distances used to measure the similarity between protein structures, none of them are proper distances when protein structures of different sequences are compared. Statistical approaches based on those non-proper distances or similarity scores as random variables are thus not mathematically rigorous. In this work, we develop a mathematical framework for protein structure comparison by treating protein structures as three-dimensional curves. Using an elastic Riemannian metric on spaces of curves, geodesic distance, a proper distance on spaces of curves, can be computed for any two protein structures. In this framework, protein structures can be treated as random variables on the shape manifold, and means and covariance can be computed for populations of protein structures. Furthermore, these moments can be used to build Gaussian-type probability distributions of protein structures for use in hypothesis testing. The covariance of a population of protein structures can reveal the population-specific variations and be helpful in improving structure classification. With curves representing protein structures, the matching is performed using elastic shape analysis of curves, which can effectively model conformational changes and insertions/deletions. We show that our method performs comparably with commonly used methods in protein structure classification on a large manually annotated data set.

X Demographics

X Demographics

The data shown below were collected from the profiles of 3 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 109 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 8 7%
United Kingdom 3 3%
China 2 2%
Canada 2 2%
Germany 1 <1%
India 1 <1%
Portugal 1 <1%
France 1 <1%
Argentina 1 <1%
Other 3 3%
Unknown 86 79%

Demographic breakdown

Readers by professional status Count As %
Researcher 35 32%
Student > Ph. D. Student 25 23%
Student > Bachelor 10 9%
Professor 8 7%
Student > Doctoral Student 7 6%
Other 22 20%
Unknown 2 2%
Readers by discipline Count As %
Agricultural and Biological Sciences 34 31%
Computer Science 22 20%
Biochemistry, Genetics and Molecular Biology 16 15%
Chemistry 8 7%
Physics and Astronomy 7 6%
Other 16 15%
Unknown 6 6%