Title |
RNA-Seq Mapping and Detection of Gene Fusions with a Suffix Array Algorithm
|
---|---|
Published in |
PLoS Computational Biology, April 2012
|
DOI | 10.1371/journal.pcbi.1002464 |
Pubmed ID | |
Authors |
Onur Sakarya, Heinz Breu, Milan Radovich, Yongzhi Chen, Yulei N. Wang, Catalin Barbacioru, Sowmi Utiramerur, Penn P. Whitley, Joel P. Brockman, Paolo Vatta, Zheng Zhang, Liviu Popescu, Matthew W. Muller, Vidya Kudlingar, Nriti Garg, Chieh-Yuan Li, Benjamin S. Kong, John P. Bodeau, Robert C. Nutter, Jian Gu, Kelli S. Bramlett, Jeffrey K. Ichikawa, Fiona C. Hyland, Asim S. Siddiqui |
Abstract |
High-throughput RNA sequencing enables quantification of transcripts (both known and novel), exon/exon junctions and fusions of exons from different genes. Discovery of gene fusions-particularly those expressed with low abundance- is a challenge with short- and medium-length sequencing reads. To address this challenge, we implemented an RNA-Seq mapping pipeline within the LifeScope software. We introduced new features including filter and junction mapping, annotation-aided pairing rescue and accurate mapping quality values. We combined this pipeline with a Suffix Array Spliced Read (SASR) aligner to detect chimeric transcripts. Performing paired-end RNA-Seq of the breast cancer cell line MCF-7 using the SOLiD system, we called 40 gene fusions among over 120,000 splicing junctions. We validated 36 of these 40 fusions with TaqMan assays, of which 25 were expressed in MCF-7 but not the Human Brain Reference. An intra-chromosomal gene fusion involving the estrogen receptor alpha gene ESR1, and another involving the RPS6KB1 (Ribosomal protein S6 kinase beta-1) were recurrently expressed in a number of breast tumor cell lines and a clinical tumor sample. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 3 | 23% |
Italy | 1 | 8% |
Canada | 1 | 8% |
Germany | 1 | 8% |
Unknown | 7 | 54% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 8 | 62% |
Scientists | 5 | 38% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 7 | 5% |
Germany | 3 | 2% |
United Kingdom | 2 | 2% |
Italy | 2 | 2% |
Netherlands | 1 | <1% |
Norway | 1 | <1% |
Brazil | 1 | <1% |
Sweden | 1 | <1% |
Switzerland | 1 | <1% |
Other | 4 | 3% |
Unknown | 106 | 82% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 40 | 31% |
Student > Ph. D. Student | 26 | 20% |
Student > Master | 12 | 9% |
Professor > Associate Professor | 11 | 9% |
Professor | 8 | 6% |
Other | 22 | 17% |
Unknown | 10 | 8% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 67 | 52% |
Biochemistry, Genetics and Molecular Biology | 20 | 16% |
Computer Science | 15 | 12% |
Medicine and Dentistry | 9 | 7% |
Engineering | 2 | 2% |
Other | 4 | 3% |
Unknown | 12 | 9% |