Title |
Genome Modeling System: A Knowledge Management Platform for Genomics
|
---|---|
Published in |
PLoS Computational Biology, July 2015
|
DOI | 10.1371/journal.pcbi.1004274 |
Pubmed ID | |
Authors |
Malachi Griffith, Obi L. Griffith, Scott M. Smith, Avinash Ramu, Matthew B. Callaway, Anthony M. Brummett, Michael J. Kiwala, Adam C. Coffman, Allison A. Regier, Ben J. Oberkfell, Gabriel E. Sanderson, Thomas P. Mooney, Nathaniel G. Nutter, Edward A. Belter, Feiyu Du, Robert L. Long, Travis E. Abbott, Ian T. Ferguson, David L. Morton, Mark M. Burnett, James V. Weible, Joshua B. Peck, Adam Dukes, Joshua F. McMichael, Justin T. Lolofie, Brian R. Derickson, Jasreet Hundal, Zachary L. Skidmore, Benjamin J. Ainscough, Nathan D. Dees, William S. Schierding, Cyriac Kandoth, Kyung H. Kim, Charles Lu, Christopher C. Harris, Nicole Maher, Christopher A. Maher, Vincent J. Magrini, Benjamin S. Abbott, Ken Chen, Eric Clark, Indraniel Das, Xian Fan, Amy E. Hawkins, Todd G. Hepler, Todd N. Wylie, Shawn M. Leonard, William E. Schroeder, Xiaoqi Shi, Lynn K. Carmichael, Matthew R. Weil, Richard W. Wohlstadter, Gary Stiehr, Michael D. McLellan, Craig S. Pohl, Christopher A. Miller, Daniel C. Koboldt, Jason R. Walker, James M. Eldred, David E. Larson, David J. Dooling, Li Ding, Elaine R. Mardis, Richard K. Wilson |
Abstract |
In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395) and matched lymphoblastoid line (HCC1395BL). These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 35 | 38% |
France | 5 | 5% |
United Kingdom | 4 | 4% |
India | 4 | 4% |
Germany | 3 | 3% |
Netherlands | 2 | 2% |
Belgium | 2 | 2% |
New Zealand | 2 | 2% |
Spain | 2 | 2% |
Other | 6 | 7% |
Unknown | 26 | 29% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 49 | 54% |
Members of the public | 40 | 44% |
Practitioners (doctors, other healthcare professionals) | 1 | 1% |
Science communicators (journalists, bloggers, editors) | 1 | 1% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 4 | 3% |
United Kingdom | 3 | 2% |
Taiwan | 2 | 1% |
Switzerland | 1 | <1% |
Brazil | 1 | <1% |
Germany | 1 | <1% |
Spain | 1 | <1% |
Luxembourg | 1 | <1% |
Unknown | 141 | 91% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 47 | 30% |
Student > Ph. D. Student | 24 | 15% |
Student > Master | 16 | 10% |
Student > Bachelor | 13 | 8% |
Other | 9 | 6% |
Other | 25 | 16% |
Unknown | 21 | 14% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 44 | 28% |
Biochemistry, Genetics and Molecular Biology | 27 | 17% |
Computer Science | 16 | 10% |
Medicine and Dentistry | 13 | 8% |
Engineering | 5 | 3% |
Other | 23 | 15% |
Unknown | 27 | 17% |