在 ChIP-seq 实验中检测转录因子结合位点的方法的实际比较。

A practical comparison of methods for detecting transcription factor binding sites in ChIP-seq experiments.

机构信息

Turku Centre for Biotechnology, FI-20521 Turku, Finland.

出版信息

BMC Genomics. 2009 Dec 18;10:618. doi: 10.1186/1471-2164-10-618.

DOI:10.1186/1471-2164-10-618

PMID:20017957

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2804666/

Abstract

BACKGROUND

Chromatin immunoprecipitation coupled with massively parallel sequencing (ChIP-seq) is increasingly being applied to study transcriptional regulation on a genome-wide scale. While numerous algorithms have recently been proposed for analysing the large ChIP-seq datasets, their relative merits and potential limitations remain unclear in practical applications.

RESULTS

The present study compares the state-of-the-art algorithms for detecting transcription factor binding sites in four diverse ChIP-seq datasets under a variety of practical research settings. First, we demonstrate how the biological conclusions may change dramatically when the different algorithms are applied. The reproducibility across biological replicates is then investigated as an internal validation of the detections. Finally, the predicted binding sites with each method are compared to high-scoring binding motifs as well as binding regions confirmed in independent qPCR experiments.

CONCLUSIONS

In general, our results indicate that the optimal choice of the computational approach depends heavily on the dataset under analysis. In addition to revealing valuable information to the users of this technology about the characteristics of the binding site detection approaches, the systematic evaluation framework provides also a useful reference to the developers of improved algorithms for ChIP-seq data.

摘要

背景

染色质免疫沉淀结合大规模平行测序（ChIP-seq）越来越多地被应用于全基因组范围内研究转录调控。虽然最近已经提出了许多用于分析大型 ChIP-seq 数据集的算法，但在实际应用中，它们的相对优点和潜在局限性仍不清楚。

结果

本研究比较了四种不同 ChIP-seq 数据集在各种实际研究环境下检测转录因子结合位点的最新算法。首先，我们展示了当应用不同算法时，生物学结论可能会发生巨大变化。然后，通过对生物学重复的重复性进行调查，作为检测的内部验证。最后，将每种方法预测的结合位点与高得分的结合基序以及独立 qPCR 实验中证实的结合区域进行比较。

结论

总的来说，我们的结果表明，计算方法的最佳选择在很大程度上取决于所分析的数据集。除了向该技术的使用者揭示有关结合位点检测方法的特征的有价值的信息外，系统评估框架还为 ChIP-seq 数据的改进算法的开发人员提供了有用的参考。

相似文献

A practical comparison of methods for detecting transcription factor binding sites in ChIP-seq experiments.

BMC Genomics. 2009 Dec 18;10:618. doi: 10.1186/1471-2164-10-618.

FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets.

DNA Res. 2019 Jun 1;26(3):231-242. doi: 10.1093/dnares/dsz004.

Design and analysis of ChIP-seq experiments for DNA-binding proteins.

Nat Biotechnol. 2008 Dec;26(12):1351-9. doi: 10.1038/nbt.1508. Epub 2008 Nov 16.

dPeak: high resolution identification of transcription factor binding sites from PET and SET ChIP-Seq data.

PLoS Comput Biol. 2013;9(10):e1003246. doi: 10.1371/journal.pcbi.1003246. Epub 2013 Oct 17.

A novel statistical method for quantitative comparison of multiple ChIP-seq datasets.

Bioinformatics. 2015 Jun 15;31(12):1889-96. doi: 10.1093/bioinformatics/btv094. Epub 2015 Feb 13.

Using combined evidence from replicates to evaluate ChIP-seq peaks.

Bioinformatics. 2015 Sep 1;31(17):2761-9. doi: 10.1093/bioinformatics/btv293. Epub 2015 May 7.

On the detection and refinement of transcription factor binding sites using ChIP-Seq data.

Nucleic Acids Res. 2010 Apr;38(7):2154-67. doi: 10.1093/nar/gkp1180. Epub 2010 Jan 6.

ChIPulate: A comprehensive ChIP-seq simulation pipeline.

PLoS Comput Biol. 2019 Mar 21;15(3):e1006921. doi: 10.1371/journal.pcbi.1006921. eCollection 2019 Mar.

Computational analysis of ChIP-seq data.

Methods Mol Biol. 2010;674:143-59. doi: 10.1007/978-1-60761-854-6_9.

AIControl: replacing matched control experiments with machine learning improves ChIP-seq peak identification.

Nucleic Acids Res. 2019 Jun 4;47(10):e58. doi: 10.1093/nar/gkz156.

引用本文的文献

Guidelines to Analyze ChIP-Seq Data: Journey Through QC and Analysis Considerations.

Methods Mol Biol. 2025;2889:193-206. doi: 10.1007/978-1-0716-4322-8_14.

Challenges and best practices in omics benchmarking.

Nat Rev Genet. 2024 May;25(5):326-339. doi: 10.1038/s41576-023-00679-6. Epub 2024 Jan 12.

ChIP-AP: an integrated analysis pipeline for unbiased ChIP-seq analysis.

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab537.

WACS: improving ChIP-seq peak calling by optimally weighting controls.

BMC Bioinformatics. 2021 Feb 15;22(1):69. doi: 10.1186/s12859-020-03927-2.

Comparative analysis of commonly used peak calling programs for ChIP-Seq analysis.

Genomics Inform. 2020 Dec;18(4):e42. doi: 10.5808/GI.2020.18.4.e42. Epub 2020 Dec 14.

Comparative analysis of ChIP-exo peak-callers: impact of data quality, read duplication and binding subtypes.

BMC Bioinformatics. 2020 Feb 21;21(1):65. doi: 10.1186/s12859-020-3403-3.

Population size estimation for quality control of ChIP-Seq datasets.

PLoS One. 2019 Aug 29;14(8):e0221760. doi: 10.1371/journal.pone.0221760. eCollection 2019.

AIControl: replacing matched control experiments with machine learning improves ChIP-seq peak identification.

Nucleic Acids Res. 2019 Jun 4;47(10):e58. doi: 10.1093/nar/gkz156.

Differential analysis of chromatin accessibility and histone modifications for predicting mouse developmental enhancers.

Nucleic Acids Res. 2018 Nov 30;46(21):11184-11201. doi: 10.1093/nar/gky753.

Prediction of protein-DNA interactions of transcription factors linking proteomics and transcriptomics data.

EuPA Open Proteom. 2016 Sep 15;13:14-23. doi: 10.1016/j.euprot.2016.09.001. eCollection 2016 Dec.

本文引用的文献

Insights from genomic profiling of transcription factors.

Nat Rev Genet. 2009 Sep;10(9):605-16. doi: 10.1038/nrg2636. Epub 2009 Aug 11.

The transcriptional program controlled by the stem cell leukemia gene Scl/Tal1 during early embryonic hematopoietic development.

Blood. 2009 May 28;113(22):5456-65. doi: 10.1182/blood-2009-01-200048. Epub 2009 Apr 3.

ChIP-Seq of ERalpha and RNA polymerase II defines genes differentially responding to ligands.

EMBO J. 2009 May 20;28(10):1418-28. doi: 10.1038/emboj.2009.88. Epub 2009 Apr 4.

Genomic location analysis by ChIP-Seq.

J Cell Biochem. 2009 May 1;107(1):11-8. doi: 10.1002/jcb.22077.

Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data.

Nat Methods. 2008 Sep;5(9):829-34. doi: 10.1038/nmeth.1246.

Efficient yeast ChIP-Seq using multiplex short-read DNA sequencing.

BMC Genomics. 2009 Jan 21;10:37. doi: 10.1186/1471-2164-10-37.

PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls.

Nat Biotechnol. 2009 Jan;27(1):66-75. doi: 10.1038/nbt.1518. Epub 2009 Jan 4.

Global analysis of the insulator binding protein CTCF in chromatin barrier regions reveals demarcation of active and repressive domains.

Genome Res. 2009 Jan;19(1):24-32. doi: 10.1101/gr.082800.108. Epub 2008 Dec 3.

Design and analysis of ChIP-seq experiments for DNA-binding proteins.

Nat Biotechnol. 2008 Dec;26(12):1351-9. doi: 10.1038/nbt.1508. Epub 2008 Nov 16.

Genome-wide profiling of PPARgamma:RXR and RNA polymerase II occupancy reveals temporal activation of distinct metabolic pathways and changes in RXR dimer composition during adipogenesis.

Genes Dev. 2008 Nov 1;22(21):2953-67. doi: 10.1101/gad.501108.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在 ChIP-seq 实验中检测转录因子结合位点的方法的实际比较。

A practical comparison of methods for detecting transcription factor binding sites in ChIP-seq experiments.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献