↓ Skip to main content

PLOS

Determining Effects of Non-synonymous SNPs on Protein-Protein Interactions using Supervised and Semi-supervised Learning

Overview of attention for article published in PLoS Computational Biology, May 2014
Altmetric Badge

Mentioned by

twitter
26 X users
patent
2 patents
googleplus
1 Google+ user

Readers on

mendeley
155 Mendeley
citeulike
6 CiteULike
Title
Determining Effects of Non-synonymous SNPs on Protein-Protein Interactions using Supervised and Semi-supervised Learning
Published in
PLoS Computational Biology, May 2014
DOI 10.1371/journal.pcbi.1003592
Pubmed ID
Authors

Nan Zhao, Jing Ginger Han, Chi-Ren Shyu, Dmitry Korkin

Abstract

Single nucleotide polymorphisms (SNPs) are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs) have been found near or inside the protein-protein interaction (PPI) interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor). Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1) a 2-class problem (strengthening/weakening PPI mutations), (2) another 2-class problem (mutations that disrupt/preserve a PPI), and (3) a 3-class classification (detrimental/neutral/beneficial mutation effects). In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the rewiring of large-scale protein-protein interaction networks, and can be useful for functional annotation of disease-associated SNPs. SNIP-IN tool is freely accessible as a web-server at http://korkinlab.org/snpintool/.

X Demographics

X Demographics

The data shown below were collected from the profiles of 26 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 155 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 6 4%
United Kingdom 2 1%
India 1 <1%
Spain 1 <1%
Netherlands 1 <1%
Unknown 144 93%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 33 21%
Student > Master 31 20%
Researcher 21 14%
Student > Bachelor 14 9%
Professor 8 5%
Other 21 14%
Unknown 27 17%
Readers by discipline Count As %
Agricultural and Biological Sciences 44 28%
Biochemistry, Genetics and Molecular Biology 32 21%
Computer Science 22 14%
Engineering 9 6%
Medicine and Dentistry 6 4%
Other 12 8%
Unknown 30 19%