↓ Skip to main content

PLOS

LBoost: A Boosting Algorithm with Application for Epistasis Discovery

Overview of attention for article published in PLOS ONE, November 2012
Altmetric Badge

Mentioned by

twitter
2 X users

Readers on

mendeley
19 Mendeley
Title
LBoost: A Boosting Algorithm with Application for Epistasis Discovery
Published in
PLOS ONE, November 2012
DOI 10.1371/journal.pone.0047281
Pubmed ID
Authors

Bethany J. Wolf, Elizabeth G. Hill, Elizabeth H. Slate, Carola A. Neumann, Emily Kistner-Griffin

Abstract

Many human diseases are attributable to complex interactions among genetic and environmental factors. Statistical tools capable of modeling such complex interactions are necessary to improve identification of genetic factors that increase a patient's risk of disease. Logic Forest (LF), a bagging ensemble algorithm based on logic regression (LR), is able to discover interactions among binary variables predictive of response such as the biologic interactions that predispose individuals to disease. However, LF's ability to recover interactions degrades for more infrequently occurring interactions. A rare genetic interaction may occur if, for example, the interaction increases disease risk in a patient subpopulation that represents only a small proportion of the overall patient population. We present an alternative ensemble adaptation of LR based on boosting rather than bagging called LBoost. We compare the ability of LBoost and LF to identify variable interactions in simulation studies. Results indicate that LBoost is superior to LF for identifying genetic interactions associated with disease that are infrequent in the population. We apply LBoost to a subset of single nucleotide polymorphisms on the PRDX genes from the Cancer Genetic Markers of Susceptibility Breast Cancer Scan to investigate genetic risk for breast cancer. LBoost is publicly available on CRAN as part of the LogicForest package, http://cran.r-project.org/.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 19 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Mexico 1 5%
Unknown 18 95%

Demographic breakdown

Readers by professional status Count As %
Researcher 7 37%
Student > Ph. D. Student 3 16%
Student > Bachelor 2 11%
Professor > Associate Professor 2 11%
Student > Master 2 11%
Other 1 5%
Unknown 2 11%
Readers by discipline Count As %
Agricultural and Biological Sciences 6 32%
Mathematics 3 16%
Biochemistry, Genetics and Molecular Biology 2 11%
Medicine and Dentistry 2 11%
Computer Science 1 5%
Other 1 5%
Unknown 4 21%