↓ Skip to main content

PLOS

CGBayesNets: Conditional Gaussian Bayesian Network Learning and Inference with Mixed Discrete and Continuous Data

Overview of attention for article published in PLoS Computational Biology, June 2014
Altmetric Badge

Mentioned by

twitter
3 X users

Citations

dimensions_citation
60 Dimensions

Readers on

mendeley
158 Mendeley
citeulike
3 CiteULike
Title
CGBayesNets: Conditional Gaussian Bayesian Network Learning and Inference with Mixed Discrete and Continuous Data
Published in
PLoS Computational Biology, June 2014
DOI 10.1371/journal.pcbi.1003676
Pubmed ID
Authors

Michael J. McGeachie, Hsun-Hsien Chang, Scott T. Weiss

Abstract

Bayesian Networks (BN) have been a popular predictive modeling formalism in bioinformatics, but their application in modern genomics has been slowed by an inability to cleanly handle domains with mixed discrete and continuous variables. Existing free BN software packages either discretize continuous variables, which can lead to information loss, or do not include inference routines, which makes prediction with the BN impossible. We present CGBayesNets, a BN package focused around prediction of a clinical phenotype from mixed discrete and continuous variables, which fills these gaps. CGBayesNets implements Bayesian likelihood and inference algorithms for the conditional Gaussian Bayesian network (CGBNs) formalism, one appropriate for predicting an outcome of interest from, e.g., multimodal genomic data. We provide four different network learning algorithms, each making a different tradeoff between computational cost and network likelihood. CGBayesNets provides a full suite of functions for model exploration and verification, including cross validation, bootstrapping, and AUC manipulation. We highlight several results obtained previously with CGBayesNets, including predictive models of wood properties from tree genomics, leukemia subtype classification from mixed genomic data, and robust prediction of intensive care unit mortality outcomes from metabolomic profiles. We also provide detailed example analysis on public metabolomic and gene expression datasets. CGBayesNets is implemented in MATLAB and available as MATLAB source code, under an Open Source license and anonymous download at http://www.cgbayesnets.com.

X Demographics

X Demographics

The data shown below were collected from the profiles of 3 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 158 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United Kingdom 3 2%
Brazil 2 1%
United States 2 1%
Spain 1 <1%
Belarus 1 <1%
Unknown 149 94%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 46 29%
Researcher 29 18%
Student > Master 17 11%
Professor > Associate Professor 8 5%
Student > Postgraduate 7 4%
Other 32 20%
Unknown 19 12%
Readers by discipline Count As %
Agricultural and Biological Sciences 34 22%
Computer Science 21 13%
Engineering 21 13%
Medicine and Dentistry 17 11%
Biochemistry, Genetics and Molecular Biology 11 7%
Other 28 18%
Unknown 26 16%