GimmeMotifs：一种用于 ChIP-seq 实验的从头预测基序管道。

GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments.

机构信息

Department of Molecular Biology, Faculty of Science, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen, Nijmegen, The Netherlands.

出版信息

Bioinformatics. 2011 Jan 15;27(2):270-1. doi: 10.1093/bioinformatics/btq636. Epub 2010 Nov 15.

DOI:10.1093/bioinformatics/btq636

PMID:21081511

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3018809/

Abstract

SUMMARY

Accurate prediction of transcription factor binding motifs that are enriched in a collection of sequences remains a computational challenge. Here we report on GimmeMotifs, a pipeline that incorporates an ensemble of computational tools to predict motifs de novo from ChIP-sequencing (ChIP-seq) data. Similar redundant motifs are compared using the weighted information content (WIC) similarity score and clustered using an iterative procedure. A comprehensive output report is generated with several different evaluation metrics to compare and evaluate the results. Benchmarks show that the method performs well on human and mouse ChIP-seq datasets. GimmeMotifs consists of a suite of command-line scripts that can be easily implemented in a ChIP-seq analysis pipeline.

AVAILABILITY

GimmeMotifs is implemented in Python and runs on Linux. The source code is freely available for download at http://www.ncmls.eu/bioinfo/gimmemotifs/.

CONTACT

s.vanheeringen@ncmls.ru.nl

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

准确预测富含序列集合的转录因子结合基序仍然是一个计算挑战。本文报告了 GimmeMotifs，这是一个从 ChIP-seq （ChIP-seq）数据中从头预测基序的组合计算工具的管道。使用加权信息内容（WIC）相似性评分比较相似的冗余基序，并使用迭代过程进行聚类。生成了一个带有多个不同评估指标的综合输出报告，以比较和评估结果。基准测试表明，该方法在人类和小鼠 ChIP-seq 数据集上表现良好。GimmeMotifs 由一套命令行脚本组成，可以轻松地在 ChIP-seq 分析管道中实现。

可用性

GimmeMotifs 是用 Python 编写的，可在 Linux 上运行。源代码可在 http://www.ncmls.eu/bioinfo/gimmemotifs/ 免费下载。

联系方式

s.vanheeringen@ncmls.ru.nl

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9f6/3018809/a5bd9ef07690/btq636f1.jpg

相似文献

GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments.

Bioinformatics. 2011 Jan 15;27(2):270-1. doi: 10.1093/bioinformatics/btq636. Epub 2010 Nov 15.

Using combined evidence from replicates to evaluate ChIP-seq peaks.

Bioinformatics. 2015 Sep 1;31(17):2761-9. doi: 10.1093/bioinformatics/btv293. Epub 2015 May 7.

ProSampler: an ultrafast and accurate motif finder in large ChIP-seq datasets for combinatory motif discovery.

Bioinformatics. 2019 Nov 1;35(22):4632-4639. doi: 10.1093/bioinformatics/btz290.

CacPred: a cascaded convolutional neural network for TF-DNA binding prediction.

BMC Genomics. 2025 Mar 18;26(Suppl 2):264. doi: 10.1186/s12864-025-11399-y.

Differential motif enrichment analysis of paired ChIP-seq experiments.

BMC Genomics. 2014 Sep 2;15(1):752. doi: 10.1186/1471-2164-15-752.

MEME-ChIP: motif analysis of large DNA datasets.

Bioinformatics. 2011 Jun 15;27(12):1696-7. doi: 10.1093/bioinformatics/btr189. Epub 2011 Apr 12.

SIOMICS: a novel approach for systematic identification of motifs in ChIP-seq data.

Nucleic Acids Res. 2014 Mar;42(5):e35. doi: 10.1093/nar/gkt1288. Epub 2013 Dec 9.

Improved linking of motifs to their TFs using domain information.

Bioinformatics. 2020 Mar 1;36(6):1655-1662. doi: 10.1093/bioinformatics/btz855.

coMOTIF: a mixture framework for identifying transcription factor and a coregulator motif in ChIP-seq data.

Bioinformatics. 2011 Oct 1;27(19):2625-32. doi: 10.1093/bioinformatics/btr397. Epub 2011 Jul 19.

Crunch: integrated processing and modeling of ChIP-seq data in terms of regulatory motifs.

Genome Res. 2019 Jul;29(7):1164-1177. doi: 10.1101/gr.239319.118. Epub 2019 May 28.

引用本文的文献

Single-cell polygenic risk scores dissect cellular and molecular heterogeneity of complex human diseases.

Nat Biotechnol. 2025 Jul 25. doi: 10.1038/s41587-025-02725-6.

Early developmental origins of cortical disorders modeled in human neural stem cells.

Nat Commun. 2025 Jul 9;16(1):6347. doi: 10.1038/s41467-025-61316-w.

The lincRNA Pantr1 is a FOXG1 target gene conferring site-specific chromatin binding of FOXG1.

Nucleic Acids Res. 2025 Jun 20;53(12). doi: 10.1093/nar/gkaf539.

Genomic alterations and transcriptional phenotypes in circulating free DNA and matched metastatic tumor.

Genome Med. 2025 Feb 25;17(1):15. doi: 10.1186/s13073-025-01438-4.

Multi-omics analysis reveals distinct gene regulatory mechanisms between primary and organoid-derived human hepatocytes.

Dis Model Mech. 2025 Jan 1;18(1). doi: 10.1242/dmm.050883. Epub 2025 Jan 29.

Integrative multiomics reveals common endotypes across PSEN1, PSEN2, and APP mutations in familial Alzheimer's disease.

Alzheimers Res Ther. 2025 Jan 4;17(1):5. doi: 10.1186/s13195-024-01659-6.

Stem cell expression of CXCR4 regulates tissue composition in the vomeronasal organ.

J Cell Sci. 2025 Jan 1;138(1). doi: 10.1242/jcs.263451. Epub 2025 Jan 9.

DNA methylation controls stemness of astrocytes in health and ischaemia.

Nature. 2024 Oct;634(8033):415-423. doi: 10.1038/s41586-024-07898-9. Epub 2024 Sep 4.

Macrophage-mediated myelin recycling fuels brain cancer malignancy.

Cell. 2024 Sep 19;187(19):5336-5356.e30. doi: 10.1016/j.cell.2024.07.030. Epub 2024 Aug 12.

Comprehensive mapping and modelling of the rice regulome landscape unveils the regulatory architecture underlying complex traits.

Nat Commun. 2024 Aug 3;15(1):6562. doi: 10.1038/s41467-024-50787-y.

本文引用的文献

Genome-wide profiling of p63 DNA-binding sites identifies an element that regulates gene expression during limb development in the 7q21 SHFM1 locus.

PLoS Genet. 2010 Aug 19;6(8):e1001065. doi: 10.1371/journal.pgen.1001065.

W-ChIPMotifs: a web application tool for de novo motif discovery from ChIP-based high-throughput data.

Bioinformatics. 2009 Dec 1;25(23):3191-3. doi: 10.1093/bioinformatics/btp570. Epub 2009 Oct 1.

ChIP-seq: advantages and challenges of a maturing technology.

Nat Rev Genet. 2009 Oct;10(10):669-80. doi: 10.1038/nrg2641. Epub 2009 Sep 8.

SCOPE: a web server for practical de novo motif discovery.

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W259-64. doi: 10.1093/nar/gkm310. Epub 2007 May 7.

Limitations and potentials of current motif discovery algorithms.

Nucleic Acids Res. 2005 Sep 2;33(15):4899-913. doi: 10.1093/nar/gki791. Print 2005.

Assessing computational tools for the discovery of transcription factor binding sites.

Nat Biotechnol. 2005 Jan;23(1):137-44. doi: 10.1038/nbt1053.

JASPAR: an open-access database for eukaryotic transcription factor binding profiles.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D91-4. doi: 10.1093/nar/gkh012.

Rank order metrics for quantifying the association of sequence features with gene regulation.

Bioinformatics. 2003 Jan 22;19(2):212-8. doi: 10.1093/bioinformatics/19.2.212.

Sequence logos: a new way to display consensus sequences.

Nucleic Acids Res. 1990 Oct 25;18(20):6097-100. doi: 10.1093/nar/18.20.6097.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

GimmeMotifs：一种用于 ChIP-seq 实验的从头预测基序管道。

GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments.

机构信息

出版信息

SUMMARY

AVAILABILITY

CONTACT

SUPPLEMENTARY INFORMATION

摘要

可用性

联系方式

补充信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献