Report for: Prediction and Validation of Gene-Disease Associations Using Methods Inspired by Social Network Analyses

Title	Prediction and Validation of Gene-Disease Associations Using Methods Inspired by Social Network Analyses
Published in	PLOS ONE, May 2013
DOI	10.1371/journal.pone.0058977
Pubmed ID	23650495
Authors	U. Martin Singh-Blom, Nagarajan Natarajan, Ambuj Tewari, John O. Woods, Inderjit S. Dhillon, Edward M. Marcotte
Abstract	Correctly identifying associations of genes with diseases has long been a goal in biology. With the emergence of large-scale gene-phenotype association datasets in biology, we can leverage statistical and machine learning methods to help us achieve this goal. In this paper, we present two methods for predicting gene-disease associations based on functional gene associations and gene-phenotype associations in model organisms. The first method, the Katz measure, is motivated from its success in social network link prediction, and is very closely related to some of the recent methods proposed for gene-disease association inference. The second method, called Catapult (Combining dATa Across species using Positive-Unlabeled Learning Techniques), is a supervised machine learning method that uses a biased support vector machine where the features are derived from walks in a heterogeneous gene-trait network. We study the performance of the proposed methods and related state-of-the-art methods using two different evaluation strategies, on two distinct data sets, namely OMIM phenotypes and drug-target interactions. Finally, by measuring the performance of the methods using two different evaluation strategies, we show that even though both methods perform very well, the Katz measure is better at identifying associations between traits and poorly studied genes, whereas Catapult is better suited to correctly identifying gene-trait associations overall. The authors want to thank Jon Laurent and Kris McGary for some of the data used, and Li and Patra for making their code available. Most of Ambuj Tewari's contribution to this work happened while he was a postdoctoral fellow at the University of Texas at Austin.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
Israel	1	50%
United States	1	50%

Demographic breakdown

Type	Count	As %
Members of the public	1	50%
Scientists	1	50%

Mendeley readers

The data shown below were compiled from readership statistics for 171 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
United States	3	2%
Brazil	2	1%
United Kingdom	2	1%
Canada	1	<1%
Korea, Republic of	1	<1%
Japan	1	<1%
Spain	1	<1%
Unknown	160	94%

Demographic breakdown

Readers by professional status	Count	As %
Student > Ph. D. Student	54	32%
Researcher	25	15%
Professor > Associate Professor	14	8%
Student > Master	14	8%
Student > Bachelor	9	5%
Other	26	15%
Unknown	29	17%

Readers by discipline	Count	As %
Computer Science	48	28%
Agricultural and Biological Sciences	33	19%
Biochemistry, Genetics and Molecular Biology	19	11%
Medicine and Dentistry	7	4%
Social Sciences	5	3%
Other	25	15%
Unknown	34	20%

PLOS

Article Metrics

Prediction and Validation of Gene-Disease Associations Using Methods Inspired by Social Network Analyses

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown