• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 ChIP-Seq 数据检测和改进转录因子结合位点。

On the detection and refinement of transcription factor binding sites using ChIP-Seq data.

机构信息

Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan 48109, USA.

出版信息

Nucleic Acids Res. 2010 Apr;38(7):2154-67. doi: 10.1093/nar/gkp1180. Epub 2010 Jan 6.

DOI:10.1093/nar/gkp1180
PMID:20056654
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2853110/
Abstract

Coupling chromatin immunoprecipitation (ChIP) with recently developed massively parallel sequencing technologies has enabled genome-wide detection of protein-DNA interactions with unprecedented sensitivity and specificity. This new technology, ChIP-Seq, presents opportunities for in-depth analysis of transcription regulation. In this study, we explore the value of using ChIP-Seq data to better detect and refine transcription factor binding sites (TFBS). We introduce a novel computational algorithm named Hybrid Motif Sampler (HMS), specifically designed for TFBS motif discovery in ChIP-Seq data. We propose a Bayesian model that incorporates sequencing depth information to aid motif identification. Our model also allows intra-motif dependency to describe more accurately the underlying motif pattern. Our algorithm combines stochastic sampling and deterministic 'greedy' search steps into a novel hybrid iterative scheme. This combination accelerates the computation process. Simulation studies demonstrate favorable performance of HMS compared to other existing methods. When applying HMS to real ChIP-Seq datasets, we find that (i) the accuracy of existing TFBS motif patterns can be significantly improved; and (ii) there is significant intra-motif dependency inside all the TFBS motifs we tested; modeling these dependencies further improves the accuracy of these TFBS motif patterns. These findings may offer new biological insights into the mechanisms of transcription factor regulation.

摘要

将染色质免疫沉淀(ChIP)与最近开发的大规模平行测序技术相结合,使我们能够以前所未有的灵敏度和特异性在全基因组范围内检测蛋白质-DNA 相互作用。这项新技术 ChIP-Seq 为深入分析转录调控提供了机会。在本研究中,我们探讨了使用 ChIP-Seq 数据更好地检测和细化转录因子结合位点(TFBS)的价值。我们引入了一种名为 Hybrid Motif Sampler(HMS)的新型计算算法,该算法专门用于在 ChIP-Seq 数据中发现 TFBS 基序。我们提出了一个贝叶斯模型,该模型结合了测序深度信息,以帮助识别基序。我们的模型还允许基序内的依赖性,以更准确地描述潜在的基序模式。我们的算法将随机抽样和确定性“贪婪”搜索步骤结合到一种新颖的混合迭代方案中。这种组合加速了计算过程。模拟研究表明,HMS 的性能优于其他现有方法。当将 HMS 应用于真实的 ChIP-Seq 数据集时,我们发现:(i)现有 TFBS 基序模式的准确性可以显著提高;(ii)我们测试的所有 TFBS 基序中都存在显著的基序内依赖性;对这些依赖性进行建模可以进一步提高这些 TFBS 基序模式的准确性。这些发现可能为转录因子调控机制提供新的生物学见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c4/2853110/ddb03953c792/gkp1180f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c4/2853110/718b8d79a48a/gkp1180f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c4/2853110/44983f0c117d/gkp1180f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c4/2853110/ddb03953c792/gkp1180f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c4/2853110/718b8d79a48a/gkp1180f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c4/2853110/44983f0c117d/gkp1180f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2c4/2853110/ddb03953c792/gkp1180f3.jpg

相似文献

1
On the detection and refinement of transcription factor binding sites using ChIP-Seq data.利用 ChIP-Seq 数据检测和改进转录因子结合位点。
Nucleic Acids Res. 2010 Apr;38(7):2154-67. doi: 10.1093/nar/gkp1180. Epub 2010 Jan 6.
2
Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment.基于拓扑基序富集改进ChIP-Seq数据中转录因子结合位点的分析。
BMC Genomics. 2014 Jun 13;15(1):472. doi: 10.1186/1471-2164-15-472.
3
Inferring intra-motif dependencies of DNA binding sites from ChIP-seq data.从ChIP-seq数据推断DNA结合位点的基序内依赖性。
BMC Bioinformatics. 2015 Nov 9;16:375. doi: 10.1186/s12859-015-0797-4.
4
A Fast Cluster Motif Finding Algorithm for ChIP-Seq Data Sets.一种用于ChIP-Seq数据集的快速聚类基序发现算法。
Biomed Res Int. 2015;2015:218068. doi: 10.1155/2015/218068. Epub 2015 Jul 5.
5
De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis.从头鉴定基序可提高 ChIP-Seq 数据分析中预测转录因子结合位点的准确性。
Nucleic Acids Res. 2010 Jun;38(11):e126. doi: 10.1093/nar/gkq217. Epub 2010 Apr 7.
6
Role of ChIP-seq in the discovery of transcription factor binding sites, differential gene regulation mechanism, epigenetic marks and beyond.染色质免疫沉淀测序(ChIP-seq)在转录因子结合位点发现、差异基因调控机制、表观遗传标记及其他方面的作用。
Cell Cycle. 2014;13(18):2847-52. doi: 10.4161/15384101.2014.949201.
7
From binding motifs in ChIP-Seq data to improved models of transcription factor binding sites.从染色质免疫沉淀测序(ChIP-Seq)数据中的结合基序到转录因子结合位点的改进模型
J Bioinform Comput Biol. 2013 Feb;11(1):1340004. doi: 10.1142/S0219720013400040. Epub 2013 Jan 16.
8
An improved ChIP-seq peak detection system for simultaneously identifying post-translational modified transcription factors by combinatorial fusion, using SUMOylation as an example.一种改良的 ChIP-seq 峰检测系统,用于通过组合融合,以 SUMOylation 为例,同时鉴定翻译后修饰的转录因子。
BMC Genomics. 2014;15 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2164-15-S1-S1. Epub 2014 Jan 24.
9
MACE: model based analysis of ChIP-exo.MACE:基于模型的ChIP-exo分析
Nucleic Acids Res. 2014 Nov 10;42(20):e156. doi: 10.1093/nar/gku846. Epub 2014 Sep 23.
10
Design and analysis of ChIP-seq experiments for DNA-binding proteins.用于DNA结合蛋白的ChIP-seq实验的设计与分析。
Nat Biotechnol. 2008 Dec;26(12):1351-9. doi: 10.1038/nbt.1508. Epub 2008 Nov 16.

引用本文的文献

1
Identification of a BACH1 lung cancer signature: A novel tool for understanding BACH1 biology and identifying new inhibitors.一种BACH1肺癌特征的鉴定:理解BACH1生物学特性和鉴定新抑制剂的新工具。
Redox Biol. 2025 Jul 23;85:103789. doi: 10.1016/j.redox.2025.103789.
2
Benchmarking transcription factor binding site prediction models: a comparative analysis on synthetic and biological data.基准测试转录因子结合位点预测模型:对合成数据和生物数据的比较分析
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf363.
3
BertSNR: an interpretable deep learning framework for single-nucleotide resolution identification of transcription factor binding sites based on DNA language model.

本文引用的文献

1
Hierarchical hidden Markov model with application to joint analysis of ChIP-chip and ChIP-seq data.用于ChIP-chip和ChIP-seq数据联合分析的分层隐马尔可夫模型
Bioinformatics. 2009 Jul 15;25(14):1715-21. doi: 10.1093/bioinformatics/btp312. Epub 2009 May 14.
2
Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data.基于染色质免疫沉淀测序(ChIP-Seq)数据的转录因子结合位点全基因组分析。
Nat Methods. 2008 Sep;5(9):829-34. doi: 10.1038/nmeth.1246.
3
PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls.
BertSNR:一种基于 DNA 语言模型的单核苷酸分辨率转录因子结合位点识别的可解释深度学习框架。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae461.
4
Validating a re-implementation of an algorithm to integrate transcriptome and ChIP-seq data.验证一种重新实现的算法,以整合转录组和 ChIP-seq 数据。
PeerJ. 2023 Oct 20;11:e16318. doi: 10.7717/peerj.16318. eCollection 2023.
5
Widespread effects of DNA methylation and intra-motif dependencies revealed by novel transcription factor binding models.新型转录因子结合模型揭示的 DNA 甲基化和基序内依赖的广泛影响。
Nucleic Acids Res. 2023 Oct 13;51(18):e95. doi: 10.1093/nar/gkad693.
6
Systematic Evaluation of DNA Sequence Variations on Transcription Factor Binding Affinity.DNA序列变异对转录因子结合亲和力的系统评估
Front Genet. 2021 Sep 9;12:667866. doi: 10.3389/fgene.2021.667866. eCollection 2021.
7
A Clustering Approach for Motif Discovery in ChIP-Seq Dataset.一种用于ChIP-Seq数据集中基序发现的聚类方法。
Entropy (Basel). 2019 Aug 16;21(8):802. doi: 10.3390/e21080802.
8
Promoter Architecture and Promoter Engineering in .中的启动子结构与启动子工程
Metabolites. 2020 Aug 6;10(8):320. doi: 10.3390/metabo10080320.
9
Set cover-based methods for motif selection.基于集合覆盖的 motif 选择方法。
Bioinformatics. 2020 Feb 15;36(4):1044-1051. doi: 10.1093/bioinformatics/btz697.
10
ProSampler: an ultrafast and accurate motif finder in large ChIP-seq datasets for combinatory motif discovery.ProSampler:一种在大型 ChIP-seq 数据集中用于组合基序发现的超快速和准确的基序查找器。
Bioinformatics. 2019 Nov 1;35(22):4632-4639. doi: 10.1093/bioinformatics/btz290.
PeakSeq能够相对于对照对ChIP-seq实验进行系统评分。
Nat Biotechnol. 2009 Jan;27(1):66-75. doi: 10.1038/nbt.1518. Epub 2009 Jan 4.
4
Empirical methods for controlling false positives and estimating confidence in ChIP-Seq peaks.控制ChIP-Seq峰中假阳性并估计置信度的经验方法。
BMC Bioinformatics. 2008 Dec 5;9:523. doi: 10.1186/1471-2105-9-523.
5
Design and analysis of ChIP-seq experiments for DNA-binding proteins.用于DNA结合蛋白的ChIP-seq实验的设计与分析。
Nat Biotechnol. 2008 Dec;26(12):1351-9. doi: 10.1038/nbt.1508. Epub 2008 Nov 16.
6
An integrated software system for analyzing ChIP-chip and ChIP-seq data.一个用于分析染色质免疫沉淀芯片(ChIP-chip)和染色质免疫沉淀测序(ChIP-seq)数据的集成软件系统。
Nat Biotechnol. 2008 Nov;26(11):1293-300. doi: 10.1038/nbt.1505. Epub 2008 Nov 2.
7
Model-based analysis of ChIP-Seq (MACS).基于模型的染色质免疫沉淀测序分析(MACS)
Genome Biol. 2008;9(9):R137. doi: 10.1186/gb-2008-9-9-r137. Epub 2008 Sep 17.
8
Genome-wide identification of in vivo protein-DNA binding sites from ChIP-Seq data.从ChIP-Seq数据中进行全基因组范围内体内蛋白质-DNA结合位点的鉴定。
Nucleic Acids Res. 2008 Sep;36(16):5221-31. doi: 10.1093/nar/gkn488. Epub 2008 Aug 6.
9
FindPeaks 3.1: a tool for identifying areas of enrichment from massively parallel short-read sequencing technology.FindPeaks 3.1:一种用于从大规模平行短读长测序技术中识别富集区域的工具。
Bioinformatics. 2008 Aug 1;24(15):1729-30. doi: 10.1093/bioinformatics/btn305. Epub 2008 Jul 3.
10
Genome-wide maps of chromatin state in pluripotent and lineage-committed cells.多能细胞和谱系定向细胞中染色质状态的全基因组图谱。
Nature. 2007 Aug 2;448(7153):553-60. doi: 10.1038/nature06008. Epub 2007 Jul 1.