RSAT 峰基序：全尺寸 ChIP-seq 数据集的基序分析。

RSAT peak-motifs: motif analysis in full-size ChIP-seq datasets.

机构信息

Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestrasse 73, Berlin 14195, Germany.

出版信息

Nucleic Acids Res. 2012 Feb;40(4):e31. doi: 10.1093/nar/gkr1104. Epub 2011 Dec 8.

DOI:10.1093/nar/gkr1104

PMID:22156162

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3287167/

Abstract

ChIP-seq is increasingly used to characterize transcription factor binding and chromatin marks at a genomic scale. Various tools are now available to extract binding motifs from peak data sets. However, most approaches are only available as command-line programs, or via a website but with size restrictions. We present peak-motifs, a computational pipeline that discovers motifs in peak sequences, compares them with databases, exports putative binding sites for visualization in the UCSC genome browser and generates an extensive report suited for both naive and expert users. It relies on time- and memory-efficient algorithms enabling the treatment of several thousand peaks within minutes. Regarding time efficiency, peak-motifs outperforms all comparable tools by several orders of magnitude. We demonstrate its accuracy by analyzing data sets ranging from 4000 to 1,28,000 peaks for 12 embryonic stem cell-specific transcription factors. In all cases, the program finds the expected motifs and returns additional motifs potentially bound by cofactors. We further apply peak-motifs to discover tissue-specific motifs in peak collections for the p300 transcriptional co-activator. To our knowledge, peak-motifs is the only tool that performs a complete motif analysis and offers a user-friendly web interface without any restriction on sequence size or number of peaks.

摘要

ChIP-seq 技术越来越多地用于在基因组范围内描述转录因子结合和染色质标记。现在有各种工具可从峰数据集提取结合基序。然而，大多数方法仅作为命令行程序提供，或者通过网站提供，但有大小限制。我们提出了 peak-motifs，这是一个计算流程，用于在峰序列中发现基序，将它们与数据库进行比较，导出潜在的结合位点以便在 UCSC 基因组浏览器中可视化，并生成适合新手和专家用户的综合报告。它依赖于时间和内存效率高的算法，能够在几分钟内处理数千个峰。关于时间效率，peak-motifs 的性能比所有可比工具高出几个数量级。我们通过分析 12 个胚胎干细胞特异性转录因子的 4000 到 128000 个峰数据集来证明其准确性。在所有情况下，该程序都找到了预期的基序，并返回了可能由辅助因子结合的其他基序。我们进一步将 peak-motifs 应用于发现 p300 转录共激活因子峰集合中的组织特异性基序。据我们所知，peak-motifs 是唯一执行完整基序分析并提供用户友好的网络界面而没有序列大小或峰数限制的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d27/3287167/8a3c2bf884c9/gkr1104f1.jpg

相似文献

RSAT peak-motifs: motif analysis in full-size ChIP-seq datasets.RSAT 峰基序：全尺寸 ChIP-seq 数据集的基序分析。

Nucleic Acids Res. 2012 Feb;40(4):e31. doi: 10.1093/nar/gkr1104. Epub 2011 Dec 8.

A complete workflow for the analysis of full-size ChIP-seq (and similar) data sets using peak-motifs.使用峰基序对全尺寸 ChIP-seq（和类似）数据集进行分析的完整工作流程。

Nat Protoc. 2012 Jul 26;7(8):1551-68. doi: 10.1038/nprot.2012.088.

MEME-ChIP: motif analysis of large DNA datasets.MEME-ChIP：大 DNA 数据集的基序分析。

Bioinformatics. 2011 Jun 15;27(12):1696-7. doi: 10.1093/bioinformatics/btr189. Epub 2011 Apr 12.

RSAT::Plants: Motif Discovery in ChIP-Seq Peaks of Plant Genomes.RSAT::植物：植物基因组ChIP-Seq峰中的基序发现

Methods Mol Biol. 2016;1482:297-322. doi: 10.1007/978-1-4939-6396-6_19.

Inferring direct DNA binding from ChIP-seq.从 ChIP-seq 推断直接 DNA 结合。

Nucleic Acids Res. 2012 Sep 1;40(17):e128. doi: 10.1093/nar/gks433. Epub 2012 May 18.

SIOMICS: a novel approach for systematic identification of motifs in ChIP-seq data.SIOMICS：一种系统鉴定 ChIP-seq 数据中基序的新方法。

Nucleic Acids Res. 2014 Mar;42(5):e35. doi: 10.1093/nar/gkt1288. Epub 2013 Dec 9.

DREME: motif discovery in transcription factor ChIP-seq data.DREME：转录因子 ChIP-seq 数据中的 motif 发现。

Bioinformatics. 2011 Jun 15;27(12):1653-9. doi: 10.1093/bioinformatics/btr261. Epub 2011 May 4.

Differential motif enrichment analysis of paired ChIP-seq experiments.配对染色质免疫沉淀测序（ChIP-seq）实验的差异基序富集分析

BMC Genomics. 2014 Sep 2;15(1):752. doi: 10.1186/1471-2164-15-752.

RSAT matrix-clustering: dynamic exploration and redundancy reduction of transcription factor binding motif collections.RSAT矩阵聚类：转录因子结合基序集合的动态探索与冗余减少

Nucleic Acids Res. 2017 Jul 27;45(13):e119. doi: 10.1093/nar/gkx314.

COPS: detecting co-occurrence and spatial arrangement of transcription factor binding motifs in genome-wide datasets.COPS：在全基因组数据集中检测转录因子结合基序的共现和空间排列。

PLoS One. 2012;7(12):e52055. doi: 10.1371/journal.pone.0052055. Epub 2012 Dec 18.

引用本文的文献

The derived transposase 5 (PGBD5) can interact with human -like elements.衍生转座酶5（PGBD5）可与人源样元件相互作用。

bioRxiv. 2025 Aug 2:2025.07.31.667870. doi: 10.1101/2025.07.31.667870.

Molecular circuit between Aspergillus nidulans transcription factors MsnA and VelB to coordinate fungal stress and developmental responses.构巢曲霉转录因子MsnA和VelB之间的分子回路，以协调真菌的应激和发育反应。

PLoS Genet. 2025 Jul 17;21(7):e1011578. doi: 10.1371/journal.pgen.1011578. eCollection 2025 Jul.

Divergence in a eukaryotic transcription factor's co-TF dependence involves multiple intrinsically disordered regions.真核转录因子的共转录因子依赖性差异涉及多个内在无序区域。

Nat Commun. 2025 Jun 18;16(1):5340. doi: 10.1038/s41467-025-59244-w.

Regulation of meristem and hormone function revealed through analysis of directly-regulated SHOOT MERISTEMLESS target genes.通过对直接调控的无茎尖分生组织靶基因的分析揭示分生组织和激素功能的调控

Sci Rep. 2025 Jan 2;15(1):240. doi: 10.1038/s41598-024-83985-1.

When HSFs bring the heat-mapping the transcriptional circuitries of HSF-type regulators in .当热休克转录因子对热休克转录因子类型调节因子的转录回路进行热图绘制时。（注：原句不完整，翻译仅供参考其大致意思，完整准确的翻译需结合完整句子）

mSphere. 2025 Jan 28;10(1):e0064423. doi: 10.1128/msphere.00644-23. Epub 2024 Dec 20.

ZBTB24 is a conserved multifaceted transcription factor at genes and centromeres that governs the DNA methylation state and expression of satellite repeats.ZBTB24是一种在基因和着丝粒处保守的多面转录因子，它控制着DNA甲基化状态和卫星重复序列的表达。

Hum Mol Genet. 2025 Jan 29;34(2):161-177. doi: 10.1093/hmg/ddae163.

DNA methylation shapes the Polycomb landscape during the exit from naive pluripotency.在从幼稚多能性退出的过程中，DNA甲基化塑造了多梳蛋白景观。

Nat Struct Mol Biol. 2025 Feb;32(2):346-357. doi: 10.1038/s41594-024-01405-4. Epub 2024 Oct 24.

The RNA-binding protein PCBP1 modulates transcription by recruiting the G-quadruplex-specific helicase DHX9.RNA结合蛋白PCBP1通过招募G-四链体特异性解旋酶DHX9来调节转录。

J Biol Chem. 2024 Nov;300(11):107830. doi: 10.1016/j.jbc.2024.107830. Epub 2024 Sep 27.

Divergence in a Eukaryotic Transcription Factor's co-TF Dependence Involves Multiple Intrinsically Disordered Regions Affecting Activation and Autoinhibition.真核转录因子的共转录因子依赖性差异涉及多个影响激活和自抑制的内在无序区域。

bioRxiv. 2025 Jan 2:2024.04.20.590343. doi: 10.1101/2024.04.20.590343.

Identification of transcription factor co-binding patterns with non-negative matrix factorization.利用非负矩阵分解鉴定转录因子共结合模式。

Nucleic Acids Res. 2024 Oct 14;52(18):e85. doi: 10.1093/nar/gkae743.

本文引用的文献

RSAT 2011: regulatory sequence analysis tools.RSAT 2011：调控序列分析工具。

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W86-91. doi: 10.1093/nar/gkr377.

DREME: motif discovery in transcription factor ChIP-seq data.DREME：转录因子 ChIP-seq 数据中的 motif 发现。

Bioinformatics. 2011 Jun 15;27(12):1653-9. doi: 10.1093/bioinformatics/btr261. Epub 2011 May 4.

MEME-ChIP: motif analysis of large DNA datasets.MEME-ChIP：大 DNA 数据集的基序分析。

Bioinformatics. 2011 Jun 15;27(12):1696-7. doi: 10.1093/bioinformatics/btr189. Epub 2011 Apr 12.

GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments.GimmeMotifs：一种用于 ChIP-seq 实验的从头预测基序管道。

Bioinformatics. 2011 Jan 15;27(2):270-1. doi: 10.1093/bioinformatics/btq636. Epub 2010 Nov 15.

RegulonDB version 7.0: transcriptional regulation of Escherichia coli K-12 integrated within genetic sensory response units (Gensor Units).RegulonDB 7.0版本：整合在遗传感应反应单元（Gensor单元）内的大肠杆菌K-12转录调控。

Nucleic Acids Res. 2011 Jan;39(Database issue):D98-105. doi: 10.1093/nar/gkq1110. Epub 2010 Nov 4.

UniPROBE, update 2011: expanded content and search tools in the online database of protein-binding microarray data on protein-DNA interactions.UniPROBE 2011年更新：蛋白质 - DNA相互作用的蛋白质结合微阵列数据在线数据库中的内容和搜索工具得到扩展。

Nucleic Acids Res. 2011 Jan;39(Database issue):D124-8. doi: 10.1093/nar/gkq992. Epub 2010 Oct 30.

Identification of context-dependent motifs by contrasting ChIP binding data.通过对比 ChIP 结合数据鉴定上下文相关基序。

Bioinformatics. 2010 Nov 15;26(22):2826-32. doi: 10.1093/bioinformatics/btq546. Epub 2010 Sep 23.

High resolution models of transcription factor-DNA affinities improve in vitro and in vivo binding predictions.转录因子-DNA 亲和力的高分辨率模型可改善体外和体内结合预测。

PLoS Comput Biol. 2010 Sep 9;6(9):e1000916. doi: 10.1371/journal.pcbi.1000916.

Deep and wide digging for binding motifs in ChIP-Seq data.深度和广泛挖掘 ChIP-Seq 数据中的结合基序。

Bioinformatics. 2010 Oct 15;26(20):2622-3. doi: 10.1093/bioinformatics/btq488. Epub 2010 Aug 24.

ChIP-Seq identification of weakly conserved heart enhancers.ChIP-Seq 鉴定弱保守的心脏增强子。

Nat Genet. 2010 Sep;42(9):806-10. doi: 10.1038/ng.650. Epub 2010 Aug 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

RSAT 峰基序：全尺寸 ChIP-seq 数据集的基序分析。

RSAT peak-motifs: motif analysis in full-size ChIP-seq datasets.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献