↓ Skip to main content

PLOS

Biomarker Discovery by Sparse Canonical Correlation Analysis of Complex Clinical Phenotypes of Tuberculosis and Malaria

Overview of attention for article published in PLoS Computational Biology, April 2013
Altmetric Badge

Mentioned by

twitter
2 X users

Citations

dimensions_citation
20 Dimensions

Readers on

mendeley
83 Mendeley
citeulike
3 CiteULike
Title
Biomarker Discovery by Sparse Canonical Correlation Analysis of Complex Clinical Phenotypes of Tuberculosis and Malaria
Published in
PLoS Computational Biology, April 2013
DOI 10.1371/journal.pcbi.1003018
Pubmed ID
Authors

Juho Rousu, Daniel D. Agranoff, Olugbemiro Sodeinde, John Shawe-Taylor, Delmiro Fernandez-Reyes

Abstract

Biomarker discovery aims to find small subsets of relevant variables in 'omics data that correlate with the clinical syndromes of interest. Despite the fact that clinical phenotypes are usually characterized by a complex set of clinical parameters, current computational approaches assume univariate targets, e.g. diagnostic classes, against which associations are sought for. We propose an approach based on asymmetrical sparse canonical correlation analysis (SCCA) that finds multivariate correlations between the 'omics measurements and the complex clinical phenotypes. We correlated plasma proteomics data to multivariate overlapping complex clinical phenotypes from tuberculosis and malaria datasets. We discovered relevant 'omic biomarkers that have a high correlation to profiles of clinical measurements and are remarkably sparse, containing 1.5-3% of all 'omic variables. We show that using clinical view projections we obtain remarkable improvements in diagnostic class prediction, up to 11% in tuberculosis and up to 5% in malaria. Our approach finds proteomic-biomarkers that correlate with complex combinations of clinical-biomarkers. Using the clinical-biomarkers improves the accuracy of diagnostic class prediction while not requiring the measurement plasma proteomic profiles of each subject. Our approach makes it feasible to use omics' data to build accurate diagnostic algorithms that can be deployed to community health centres lacking the expensive 'omics measurement capabilities.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 83 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
India 3 4%
United States 2 2%
United Kingdom 1 1%
Japan 1 1%
Belgium 1 1%
Unknown 75 90%

Demographic breakdown

Readers by professional status Count As %
Researcher 18 22%
Student > Ph. D. Student 16 19%
Student > Master 10 12%
Student > Bachelor 8 10%
Professor > Associate Professor 6 7%
Other 15 18%
Unknown 10 12%
Readers by discipline Count As %
Agricultural and Biological Sciences 22 27%
Medicine and Dentistry 15 18%
Computer Science 15 18%
Mathematics 6 7%
Biochemistry, Genetics and Molecular Biology 6 7%
Other 7 8%
Unknown 12 14%