↓ Skip to main content

PLOS

Predicting Disease Risk Using Bootstrap Ranking and Classification Algorithms

Overview of attention for article published in PLoS Computational Biology, August 2013
Altmetric Badge

Mentioned by

twitter
2 X users

Citations

dimensions_citation
22 Dimensions

Readers on

mendeley
99 Mendeley
citeulike
3 CiteULike
Title
Predicting Disease Risk Using Bootstrap Ranking and Classification Algorithms
Published in
PLoS Computational Biology, August 2013
DOI 10.1371/journal.pcbi.1003200
Pubmed ID
Authors

Ohad Manor, Eran Segal

Abstract

Genome-wide association studies (GWAS) are widely used to search for genetic loci that underlie human disease. Another goal is to predict disease risk for different individuals given their genetic sequence. Such predictions could either be used as a "black box" in order to promote changes in life-style and screening for early diagnosis, or as a model that can be studied to better understand the mechanism of the disease. Current methods for risk prediction typically rank single nucleotide polymorphisms (SNPs) by the p-value of their association with the disease, and use the top-associated SNPs as input to a classification algorithm. However, the predictive power of such methods is relatively poor. To improve the predictive power, we devised BootRank, which uses bootstrapping in order to obtain a robust prioritization of SNPs for use in predictive models. We show that BootRank improves the ability to predict disease risk of unseen individuals in the Wellcome Trust Case Control Consortium (WTCCC) data and results in a more robust set of SNPs and a larger number of enriched pathways being associated with the different diseases. Finally, we show that combining BootRank with seven different classification algorithms improves performance compared to previous studies that used the WTCCC data. Notably, diseases for which BootRank results in the largest improvements were recently shown to have more heritability than previously thought, likely due to contributions from variants with low minimum allele frequency (MAF), suggesting that BootRank can be beneficial in cases where SNPs affecting the disease are poorly tagged or have low MAF. Overall, our results show that improving disease risk prediction from genotypic information may be a tangible goal, with potential implications for personalized disease screening and treatment.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 99 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 4 4%
Israel 2 2%
Switzerland 1 1%
Sweden 1 1%
India 1 1%
Germany 1 1%
New Zealand 1 1%
United Kingdom 1 1%
Unknown 87 88%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 29 29%
Researcher 24 24%
Student > Master 16 16%
Professor 6 6%
Professor > Associate Professor 6 6%
Other 12 12%
Unknown 6 6%
Readers by discipline Count As %
Agricultural and Biological Sciences 38 38%
Biochemistry, Genetics and Molecular Biology 13 13%
Computer Science 12 12%
Engineering 5 5%
Medicine and Dentistry 5 5%
Other 14 14%
Unknown 12 12%