↓ Skip to main content

PLOS

Genomics and Privacy: Implications of the New Reality of Closed Data for the Field

Overview of attention for article published in PLoS Computational Biology, December 2011
Altmetric Badge

Mentioned by

blogs
2 blogs
twitter
37 X users
facebook
1 Facebook page
wikipedia
1 Wikipedia page
googleplus
1 Google+ user
q&a
2 Q&A threads

Citations

dimensions_citation
71 Dimensions

Readers on

mendeley
192 Mendeley
citeulike
11 CiteULike
Title
Genomics and Privacy: Implications of the New Reality of Closed Data for the Field
Published in
PLoS Computational Biology, December 2011
DOI 10.1371/journal.pcbi.1002278
Pubmed ID
Authors

Dov Greenbaum, Andrea Sboner, Xinmeng Jasmine Mu, Mark Gerstein

Abstract

Open source and open data have been driving forces in bioinformatics in the past. However, privacy concerns may soon change the landscape, limiting future access to important data sets, including personal genomics data. Here we survey this situation in some detail, describing, in particular, how the large scale of the data from personal genomic sequencing makes it especially hard to share data, exacerbating the privacy problem. We also go over various aspects of genomic privacy: first, there is basic identifiability of subjects having their genome sequenced. However, even for individuals who have consented to be identified, there is the prospect of very detailed future characterization of their genotype, which, unanticipated at the time of their consent, may be more personal and invasive than the release of their medical records. We go over various computational strategies for dealing with the issue of genomic privacy. One can "slice" and reformat datasets to allow them to be partially shared while securing the most private variants. This is particularly applicable to functional genomics information, which can be largely processed without variant information. For handling the most private data there are a number of legal and technological approaches-for example, modifying the informed consent procedure to acknowledge that privacy cannot be guaranteed, and/or employing a secure cloud computing environment. Cloud computing in particular may allow access to the data in a more controlled fashion than the current practice of downloading and computing on large datasets. Furthermore, it may be particularly advantageous for small labs, given that the burden of many privacy issues falls disproportionately on them in comparison to large corporations and genome centers. Finally, we discuss how education of future genetics researchers will be important, with curriculums emphasizing privacy and data security. However, teaching personal genomics with identifiable subjects in the university setting will, in turn, create additional privacy issues and social conundrums.

X Demographics

X Demographics

The data shown below were collected from the profiles of 37 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 192 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 13 7%
United Kingdom 5 3%
Sweden 3 2%
Spain 2 1%
France 2 1%
Canada 2 1%
Malaysia 1 <1%
Portugal 1 <1%
Hong Kong 1 <1%
Other 5 3%
Unknown 157 82%

Demographic breakdown

Readers by professional status Count As %
Researcher 57 30%
Student > Ph. D. Student 35 18%
Student > Master 23 12%
Student > Bachelor 17 9%
Other 12 6%
Other 31 16%
Unknown 17 9%
Readers by discipline Count As %
Agricultural and Biological Sciences 63 33%
Computer Science 34 18%
Biochemistry, Genetics and Molecular Biology 25 13%
Medicine and Dentistry 14 7%
Social Sciences 12 6%
Other 23 12%
Unknown 21 11%