Modeling ChIP Sequencing In Silico with Applications

Overview of attention for article published in PLoS Computational Biology, August 2008

Altmetric Badge

Mentioned by

patent: 6 patents

Citations

dimensions_citation: 73 Dimensions

Readers on

mendeley: 222 Mendeley
citeulike: 18 CiteULike
connotea: 1 Connotea

Summary Patents Misc. Dimensions citations

Title	Modeling ChIP Sequencing In Silico with Applications
Published in	PLoS Computational Biology, August 2008
DOI	10.1371/journal.pcbi.1000158
Pubmed ID	18725927
Authors	Zhengdong D. Zhang, Joel Rozowsky, Michael Snyder, Joseph Chang, Mark Gerstein
Abstract	ChIP sequencing (ChIP-seq) is a new method for genomewide mapping of protein binding sites on DNA. It has generated much excitement in functional genomics. To score data and determine adequate sequencing depth, both the genomic background and the binding sites must be properly modeled. To develop a computational foundation to tackle these issues, we first performed a study to characterize the observed statistical nature of this new type of high-throughput data. By linking sequence tags into clusters, we show that there are two components to the distribution of tag counts observed in a number of recent experiments: an initial power-law distribution and a subsequent long right tail. Then we develop in silico ChIP-seq, a computational method to simulate the experimental outcome by placing tags onto the genome according to particular assumed distributions for the actual binding sites and for the background genomic sequence. In contrast to current assumptions, our results show that both the background and the binding sites need to have a markedly nonuniform distribution in order to correctly model the observed ChIP-seq data, with, for instance, the background tag counts modeled by a gamma distribution. On the basis of these results, we extend an existing scoring approach by using a more realistic genomic-background model. This enables us to identify transcription-factor binding sites in ChIP-seq data in a statistically rigorous fashion.

View on publisher site Alert me about new mentions

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 222 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
United States	15	7%
Germany	7	3%
United Kingdom	4	2%
France	2	<1%
Italy	2	<1%
Sweden	2	<1%
China	2	<1%
Netherlands	1	<1%
Singapore	1	<1%
Other	9	4%
Unknown	177	80%

Demographic breakdown

Readers by professional status	Count	As %
Researcher	78	35%
Student > Ph. D. Student	64	29%
Student > Master	19	9%
Professor > Associate Professor	17	8%
Student > Bachelor	12	5%
Other	22	10%
Unknown	10	5%

Readers by discipline	Count	As %
Agricultural and Biological Sciences	135	61%
Biochemistry, Genetics and Molecular Biology	32	14%
Computer Science	17	8%
Mathematics	7	3%
Engineering	6	3%
Other	11	5%
Unknown	14	6%