↓ Skip to main content

PLOS

Knowledge-Based Reconstruction of mRNA Transcripts with Short Sequencing Reads for Transcriptome Research

Overview of attention for article published in PLOS ONE, February 2012
Altmetric Badge

Mentioned by

twitter
2 X users

Readers on

mendeley
48 Mendeley
citeulike
1 CiteULike
Title
Knowledge-Based Reconstruction of mRNA Transcripts with Short Sequencing Reads for Transcriptome Research
Published in
PLOS ONE, February 2012
DOI 10.1371/journal.pone.0031440
Pubmed ID
Authors

Junhee Seok, Weihong Xu, Hui Jiang, Ronald W. Davis, Wenzhong Xiao

Abstract

While most transcriptome analyses in high-throughput clinical studies focus on gene level expression, the existence of alternative isoforms of gene transcripts is a major source of the diversity in the biological functionalities of the human genome. It is, therefore, essential to annotate isoforms of gene transcripts for genome-wide transcriptome studies. Recently developed mRNA sequencing technology presents an unprecedented opportunity to discover new forms of transcripts, and at the same time brings bioinformatic challenges due to its short read length and incomplete coverage for the transcripts. In this work, we proposed a computational approach to reconstruct new mRNA transcripts from short sequencing reads with reference information of known transcripts in existing databases. The prior knowledge helped to define exon boundaries and fill in the transcript regions not covered by sequencing data. This approach was demonstrated using a deep sequencing data set of human muscle tissue with transcript annotations in RefSeq as prior knowledge. We identified 2,973 junctions, 7,471 exons, and 7,571 transcripts not previously annotated in RefSeq. 73% of these new transcripts found supports from UCSC Known Genes, Ensembl or EST transcript annotations. In addition, the reconstructed transcripts were much longer than those from de novo approaches that assume no prior knowledge. These previously un-annotated transcripts can be integrated with known transcript annotations to improve both the design of microarrays and the follow-up analyses of isoform expression. The overall results demonstrated that incorporating transcript annotations from genomic databases significantly helps the reconstruction of novel transcripts from short sequencing reads for transcriptome research.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 48 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 4 8%
Brazil 2 4%
Switzerland 1 2%
Italy 1 2%
Denmark 1 2%
Czechia 1 2%
Unknown 38 79%

Demographic breakdown

Readers by professional status Count As %
Researcher 20 42%
Student > Ph. D. Student 11 23%
Professor > Associate Professor 5 10%
Other 3 6%
Student > Master 3 6%
Other 5 10%
Unknown 1 2%
Readers by discipline Count As %
Agricultural and Biological Sciences 33 69%
Biochemistry, Genetics and Molecular Biology 6 13%
Medicine and Dentistry 2 4%
Computer Science 1 2%
Social Sciences 1 2%
Other 3 6%
Unknown 2 4%