↓ Skip to main content

PLOS

PUMA: A Unified Framework for Penalized Multiple Regression Analysis of GWAS Data

Overview of attention for article published in PLoS Computational Biology, June 2013
Altmetric Badge

Mentioned by

news
1 news outlet
blogs
1 blog
twitter
1 X user

Citations

dimensions_citation
38 Dimensions

Readers on

mendeley
87 Mendeley
citeulike
2 CiteULike
Title
PUMA: A Unified Framework for Penalized Multiple Regression Analysis of GWAS Data
Published in
PLoS Computational Biology, June 2013
DOI 10.1371/journal.pcbi.1003101
Pubmed ID
Authors

Gabriel E. Hoffman, Benjamin A. Logsdon, Jason G. Mezey

Abstract

Penalized Multiple Regression (PMR) can be used to discover novel disease associations in GWAS datasets. In practice, proposed PMR methods have not been able to identify well-supported associations in GWAS that are undetectable by standard association tests and thus these methods are not widely applied. Here, we present a combined algorithmic and heuristic framework for PUMA (Penalized Unified Multiple-locus Association) analysis that solves the problems of previously proposed methods including computational speed, poor performance on genome-scale simulated data, and identification of too many associations for real data to be biologically plausible. The framework includes a new minorize-maximization (MM) algorithm for generalized linear models (GLM) combined with heuristic model selection and testing methods for identification of robust associations. The PUMA framework implements the penalized maximum likelihood penalties previously proposed for GWAS analysis (i.e. Lasso, Adaptive Lasso, NEG, MCP), as well as a penalty that has not been previously applied to GWAS (i.e. LOG). Using simulations that closely mirror real GWAS data, we show that our framework has high performance and reliably increases power to detect weak associations, while existing PMR methods can perform worse than single marker testing in overall performance. To demonstrate the empirical value of PUMA, we analyzed GWAS data for type 1 diabetes, Crohns's disease, and rheumatoid arthritis, three autoimmune diseases from the original Wellcome Trust Case Control Consortium. Our analysis replicates known associations for these diseases and we discover novel etiologically relevant susceptibility loci that are invisible to standard single marker tests, including six novel associations implicating genes involved in pancreatic function, insulin pathways and immune-cell function in type 1 diabetes; three novel associations implicating genes in pro- and anti-inflammatory pathways in Crohn's disease; and one novel association implicating a gene involved in apoptosis pathways in rheumatoid arthritis. We provide software for applying our PUMA analysis framework.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 87 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 3 3%
United Kingdom 1 1%
Switzerland 1 1%
Unknown 82 94%

Demographic breakdown

Readers by professional status Count As %
Researcher 29 33%
Student > Ph. D. Student 21 24%
Professor > Associate Professor 9 10%
Student > Master 7 8%
Student > Bachelor 4 5%
Other 12 14%
Unknown 5 6%
Readers by discipline Count As %
Agricultural and Biological Sciences 36 41%
Computer Science 12 14%
Mathematics 8 9%
Biochemistry, Genetics and Molecular Biology 7 8%
Medicine and Dentistry 6 7%
Other 11 13%
Unknown 7 8%