从 ChIP-Seq 进行全基因组转录因子结合位点分析的集成管道。

An integrated pipeline for the genome-wide analysis of transcription factor binding sites from ChIP-Seq.

机构信息

Computational Biology Unit, Institut de Recherche Clinique de Montreal, Montreal, Canada.

出版信息

PLoS One. 2011 Feb 16;6(2):e16432. doi: 10.1371/journal.pone.0016432.

DOI:10.1371/journal.pone.0016432

PMID:21358819

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3040171/

Abstract

ChIP-Seq has become the standard method for genome-wide profiling DNA association of transcription factors. To simplify analyzing and interpreting ChIP-Seq data, which typically involves using multiple applications, we describe an integrated, open source, R-based analysis pipeline. The pipeline addresses data input, peak detection, sequence and motif analysis, visualization, and data export, and can readily be extended via other R and Bioconductor packages. Using a standard multicore computer, it can be used with datasets consisting of tens of thousands of enriched regions. We demonstrate its effectiveness on published human ChIP-Seq datasets for FOXA1, ER, CTCF and STAT1, where it detected co-occurring motifs that were consistent with the literature but not detected by other methods. Our pipeline provides the first complete set of Bioconductor tools for sequence and motif analysis of ChIP-Seq and ChIP-chip data.

摘要

ChIP-Seq 已成为用于全基因组分析转录因子与 DNA 关联的标准方法。为了简化分析和解释 ChIP-Seq 数据（通常需要使用多个应用程序），我们描述了一个集成的、开源的、基于 R 的分析管道。该管道解决了数据输入、峰检测、序列和基序分析、可视化和数据导出等问题，并且可以通过其他 R 和 Bioconductor 包轻松扩展。使用标准的多核计算机，它可以用于包含数万个富集区域的数据集。我们在已发表的人类 ChIP-Seq 数据集（FOXA1、ER、CTCF 和 STAT1）上验证了其有效性，该数据集检测到了与文献一致但其他方法未检测到的共同基序。我们的管道提供了用于 ChIP-Seq 和 ChIP-chip 数据的序列和基序分析的第一个完整的 Bioconductor 工具集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2881/3040171/8be808eaf0e8/pone.0016432.g001.jpg

相似文献

An integrated pipeline for the genome-wide analysis of transcription factor binding sites from ChIP-Seq.

PLoS One. 2011 Feb 16;6(2):e16432. doi: 10.1371/journal.pone.0016432.

Genome-wide identification of in vivo protein-DNA binding sites from ChIP-Seq data.

Nucleic Acids Res. 2008 Sep;36(16):5221-31. doi: 10.1093/nar/gkn488. Epub 2008 Aug 6.

Discovering transcription factor binding sites in highly repetitive regions of genomes with multi-read analysis of ChIP-Seq data.

PLoS Comput Biol. 2011 Jul;7(7):e1002111. doi: 10.1371/journal.pcbi.1002111. Epub 2011 Jul 14.

MACE: model based analysis of ChIP-exo.

Nucleic Acids Res. 2014 Nov 10;42(20):e156. doi: 10.1093/nar/gku846. Epub 2014 Sep 23.

7C: Computational Chromosome Conformation Capture by Correlation of ChIP-seq at CTCF motifs.

BMC Genomics. 2019 Oct 25;20(1):777. doi: 10.1186/s12864-019-6088-0.

HiChIP: a high-throughput pipeline for integrative analysis of ChIP-Seq data.

BMC Bioinformatics. 2014 Aug 15;15(1):280. doi: 10.1186/1471-2105-15-280.

HPeak: an HMM-based algorithm for defining read-enriched regions in ChIP-Seq data.

BMC Bioinformatics. 2010 Jul 2;11:369. doi: 10.1186/1471-2105-11-369.

Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite.

BMC Syst Biol. 2011;5 Suppl 2(Suppl 2):S3. doi: 10.1186/1752-0509-5-S2-S3. Epub 2011 Dec 14.

Role of ChIP-seq in the discovery of transcription factor binding sites, differential gene regulation mechanism, epigenetic marks and beyond.

Cell Cycle. 2014;13(18):2847-52. doi: 10.4161/15384101.2014.949201.

ChIP-exo signal associated with DNA-binding motifs provides insight into the genomic binding of the glucocorticoid receptor and cooperating transcription factors.

Genome Res. 2015 Jun;25(6):825-35. doi: 10.1101/gr.185157.114. Epub 2015 Feb 26.

引用本文的文献

The evaluation of transcription factor binding site prediction tools in human and Arabidopsis genomes.

BMC Bioinformatics. 2024 Dec 2;25(1):371. doi: 10.1186/s12859-024-05995-0.

Less-is-more: selecting transcription factor binding regions informative for motif inference.

Nucleic Acids Res. 2024 Feb 28;52(4):e20. doi: 10.1093/nar/gkad1240.

CRMnet: A deep learning model for predicting gene expression from large regulatory sequence datasets.

Front Big Data. 2023 Mar 14;6:1113402. doi: 10.3389/fdata.2023.1113402. eCollection 2023.

Intra-Domain Residue Coevolution in Transcription Factors Contributes to DNA Binding Specificity.

Microbiol Spectr. 2023 Mar 21;11(2):e0365122. doi: 10.1128/spectrum.03651-22.

Dynamic transcriptome analysis reveals signatures of paradoxical effect of vemurafenib on human dermal fibroblasts.

Cell Commun Signal. 2021 Dec 20;19(1):123. doi: 10.1186/s12964-021-00801-3.

MODER2: first-order Markov modeling and discovery of monomeric and dimeric binding motifs.

Bioinformatics. 2020 May 1;36(9):2690-2696. doi: 10.1093/bioinformatics/btaa045.

The Identification and Interpretation of -Regulatory Noncoding Mutations in Cancer.

High Throughput. 2018 Dec 20;8(1):1. doi: 10.3390/ht8010001.

Systems and Synthetic Biology Approaches to Engineer Fungi for Fine Chemical Production.

Front Bioeng Biotechnol. 2018 Oct 3;6:117. doi: 10.3389/fbioe.2018.00117. eCollection 2018.

Maser: one-stop platform for NGS big data from analysis to visualization.

Database (Oxford). 2018 Jan 1;2018. doi: 10.1093/database/bay027.

Modular discovery of monomeric and dimeric transcription factor binding motifs for large data sets.

Nucleic Acids Res. 2018 May 4;46(8):e44. doi: 10.1093/nar/gky027.

本文引用的文献

Rapid innovation in ChIP-seq peak-calling algorithms is outdistancing benchmarking efforts.

Brief Bioinform. 2011 Nov;12(6):626-33. doi: 10.1093/bib/bbq068. Epub 2010 Nov 8.

Deep and wide digging for binding motifs in ChIP-Seq data.

Bioinformatics. 2010 Oct 15;26(20):2622-3. doi: 10.1093/bioinformatics/btq488. Epub 2010 Aug 24.

Genomic information infrastructure after the deluge.

Genome Biol. 2010;11(7):402. doi: 10.1186/gb-2010-11-7-402. Epub 2010 Jul 26.

PICS: probabilistic inference for ChIP-seq.

Biometrics. 2011 Mar;67(1):151-63. doi: 10.1111/j.1541-0420.2010.01441.x.

ChIPpeakAnno: a Bioconductor package to annotate ChIP-seq and ChIP-chip data.

BMC Bioinformatics. 2010 May 11;11:237. doi: 10.1186/1471-2105-11-237.

De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis.

Nucleic Acids Res. 2010 Jun;38(11):e126. doi: 10.1093/nar/gkq217. Epub 2010 Apr 7.

rMAT--an R/Bioconductor package for analyzing ChIP-chip experiments.

Bioinformatics. 2010 Mar 1;26(5):678-9. doi: 10.1093/bioinformatics/btq023. Epub 2010 Jan 19.

Fast and accurate long-read alignment with Burrows-Wheeler transform.

Bioinformatics. 2010 Mar 1;26(5):589-95. doi: 10.1093/bioinformatics/btp698. Epub 2010 Jan 15.

On the detection and refinement of transcription factor binding sites using ChIP-Seq data.

Nucleic Acids Res. 2010 Apr;38(7):2154-67. doi: 10.1093/nar/gkp1180. Epub 2010 Jan 6.

A practical comparison of methods for detecting transcription factor binding sites in ChIP-seq experiments.

BMC Genomics. 2009 Dec 18;10:618. doi: 10.1186/1471-2164-10-618.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从 ChIP-Seq 进行全基因组转录因子结合位点分析的集成管道。

An integrated pipeline for the genome-wide analysis of transcription factor binding sites from ChIP-Seq.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献