↓ Skip to main content

PLOS

COPS: Detecting Co-Occurrence and Spatial Arrangement of Transcription Factor Binding Motifs in Genome-Wide Datasets

Overview of attention for article published in PLOS ONE, December 2012
Altmetric Badge

Citations

dimensions_citation
16 Dimensions

Readers on

mendeley
68 Mendeley
citeulike
1 CiteULike
Title
COPS: Detecting Co-Occurrence and Spatial Arrangement of Transcription Factor Binding Motifs in Genome-Wide Datasets
Published in
PLOS ONE, December 2012
DOI 10.1371/journal.pone.0052055
Pubmed ID
Authors

Nati Ha, Maria Polychronidou, Ingrid Lohmann

Abstract

In multi-cellular organisms, spatiotemporal activity of cis-regulatory DNA elements depends on their occupancy by different transcription factors (TFs). In recent years, genome-wide ChIP-on-Chip, ChIP-Seq and DamID assays have been extensively used to unravel the combinatorial interaction of TFs with cis-regulatory modules (CRMs) in the genome. Even though genome-wide binding profiles are increasingly becoming available for different TFs, single TF binding profiles are in most cases not sufficient for dissecting complex regulatory networks. Thus, potent computational tools detecting statistically significant and biologically relevant TF-motif co-occurrences in genome-wide datasets are essential for analyzing context-dependent transcriptional regulation. We have developed COPS (Co-Occurrence Pattern Search), a new bioinformatics tool based on a combination of association rules and Markov chain models, which detects co-occurring TF binding sites (BSs) on genomic regions of interest. COPS scans DNA sequences for frequent motif patterns using a Frequent-Pattern tree based data mining approach, which allows efficient performance of the software with respect to both data structure and implementation speed, in particular when mining large datasets. Since transcriptional gene regulation very often relies on the formation of regulatory protein complexes mediated by closely adjoining TF binding sites on CRMs, COPS additionally detects preferred short distance between co-occurring TF motifs. The performance of our software with respect to biological significance was evaluated using three published datasets containing genomic regions that are independently bound by several TFs involved in a defined biological process. In sum, COPS is a fast, efficient and user-friendly tool mining statistically and biologically significant TFBS co-occurrences and therefore allows the identification of TFs that combinatorially regulate gene expression.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 68 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Germany 2 3%
Belgium 2 3%
France 1 1%
United Kingdom 1 1%
Italy 1 1%
Spain 1 1%
United States 1 1%
Unknown 59 87%

Demographic breakdown

Readers by professional status Count As %
Researcher 23 34%
Student > Ph. D. Student 18 26%
Student > Master 9 13%
Student > Bachelor 3 4%
Student > Doctoral Student 2 3%
Other 8 12%
Unknown 5 7%
Readers by discipline Count As %
Agricultural and Biological Sciences 29 43%
Biochemistry, Genetics and Molecular Biology 20 29%
Computer Science 6 9%
Engineering 3 4%
Psychology 1 1%
Other 4 6%
Unknown 5 7%