• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全面的人类转录因子结合位点图谱,用于组合结合基序的发现。

Comprehensive human transcription factor binding site map for combinatory binding motifs discovery.

机构信息

Computational Biology and Bioinformatics Group, Max Planck Institute for Molecular Biomedicine, Münster, Germany.

出版信息

PLoS One. 2012;7(11):e49086. doi: 10.1371/journal.pone.0049086. Epub 2012 Nov 28.

DOI:10.1371/journal.pone.0049086
PMID:23209563
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3509107/
Abstract

To know the map between transcription factors (TFs) and their binding sites is essential to reverse engineer the regulation process. Only about 10%-20% of the transcription factor binding motifs (TFBMs) have been reported. This lack of data hinders understanding gene regulation. To address this drawback, we propose a computational method that exploits never used TF properties to discover the missing TFBMs and their sites in all human gene promoters. The method starts by predicting a dictionary of regulatory "DNA words." From this dictionary, it distills 4098 novel predictions. To disclose the crosstalk between motifs, an additional algorithm extracts TF combinatorial binding patterns creating a collection of TF regulatory syntactic rules. Using these rules, we narrowed down a list of 504 novel motifs that appear frequently in syntax patterns. We tested the predictions against 509 known motifs confirming that our system can reliably predict ab initio motifs with an accuracy of 81%-far higher than previous approaches. We found that on average, 90% of the discovered combinatorial binding patterns target at least 10 genes, suggesting that to control in an independent manner smaller gene sets, supplementary regulatory mechanisms are required. Additionally, we discovered that the new TFBMs and their combinatorial patterns convey biological meaning, targeting TFs and genes related to developmental functions. Thus, among all the possible available targets in the genome, the TFs tend to regulate other TFs and genes involved in developmental functions. We provide a comprehensive resource for regulation analysis that includes a dictionary of "DNA words," newly predicted motifs and their corresponding combinatorial patterns. Combinatorial patterns are a useful filter to discover TFBMs that play a major role in orchestrating other factors and thus, are likely to lock/unlock cellular functional clusters.

摘要

要了解转录因子(TF)与其结合位点之间的关系对于反推调控过程至关重要。目前已报道的转录因子结合基序(TFBM)仅有约 10%-20%。这种数据的缺乏阻碍了对基因调控的理解。为了解决这一缺陷,我们提出了一种计算方法,该方法利用从未使用过的 TF 特性来发现所有人类基因启动子中缺失的 TFBM 及其位点。该方法首先预测一个调控“DNA 单词”的字典。从这个字典中,它提炼出 4098 个新的预测。为了揭示基序之间的串扰,另一个算法提取了 TF 组合结合模式,创建了一个 TF 调控语法规则的集合。使用这些规则,我们将经常出现在语法模式中的 504 个新基序缩小到一个列表中。我们将预测结果与 509 个已知基序进行了测试,证实我们的系统可以可靠地预测从头开始的基序,准确率为 81%-远远高于以前的方法。我们发现,平均而言,发现的组合结合模式中有 90%的模式至少靶向 10 个基因,这表明为了以独立的方式控制更小的基因集,需要额外的调控机制。此外,我们发现新的 TFBM 及其组合模式具有生物学意义,靶向与发育功能相关的 TF 和基因。因此,在基因组中所有可能的靶标中,TF 倾向于调节其他参与发育功能的 TF 和基因。我们提供了一个全面的调控分析资源,包括一个“DNA 单词”字典、新预测的基序及其对应的组合模式。组合模式是发现在协调其他因子方面发挥主要作用的 TFBM 的有用筛选器,因此,很可能锁定/解锁细胞功能簇。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/07d5cb9f4937/pone.0049086.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/38633e7d0f7f/pone.0049086.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/38f217d5616d/pone.0049086.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/4d73886af90a/pone.0049086.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/01fd1d98dfa6/pone.0049086.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/96e5ddf7cb6e/pone.0049086.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/6b382f250385/pone.0049086.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/d3d53dafc226/pone.0049086.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/07d5cb9f4937/pone.0049086.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/38633e7d0f7f/pone.0049086.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/38f217d5616d/pone.0049086.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/4d73886af90a/pone.0049086.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/01fd1d98dfa6/pone.0049086.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/96e5ddf7cb6e/pone.0049086.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/6b382f250385/pone.0049086.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/d3d53dafc226/pone.0049086.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a415/3509107/07d5cb9f4937/pone.0049086.g008.jpg

相似文献

1
Comprehensive human transcription factor binding site map for combinatory binding motifs discovery.全面的人类转录因子结合位点图谱,用于组合结合基序的发现。
PLoS One. 2012;7(11):e49086. doi: 10.1371/journal.pone.0049086. Epub 2012 Nov 28.
2
Identification of abiotic stress miRNA transcription factor binding motifs (TFBMs) in rice.鉴定水稻中非生物胁迫 miRNA 转录因子结合基序 (TFBMs)。
Gene. 2013 Nov 15;531(1):15-22. doi: 10.1016/j.gene.2013.08.060. Epub 2013 Aug 28.
3
Principal component analysis for predicting transcription-factor binding motifs from array-derived data.用于从阵列衍生数据预测转录因子结合基序的主成分分析。
BMC Bioinformatics. 2005 Nov 18;6:276. doi: 10.1186/1471-2105-6-276.
4
COPS: detecting co-occurrence and spatial arrangement of transcription factor binding motifs in genome-wide datasets.COPS:在全基因组数据集中检测转录因子结合基序的共现和空间排列。
PLoS One. 2012;7(12):e52055. doi: 10.1371/journal.pone.0052055. Epub 2012 Dec 18.
5
WordSpy: identifying transcription factor binding motifs by building a dictionary and learning a grammar.WordSpy:通过构建词典和学习语法来识别转录因子结合基序。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W412-6. doi: 10.1093/nar/gki492.
6
Probing transcription factor combinatorics in different promoter classes and in enhancers.探究不同启动子类和增强子中的转录因子组合。
BMC Genomics. 2019 Feb 1;20(1):103. doi: 10.1186/s12864-018-5408-0.
7
Large-scale discovery of promoter motifs in Drosophila melanogaster.黑腹果蝇启动子基序的大规模发现
PLoS Comput Biol. 2007 Jan 19;3(1):e7. doi: 10.1371/journal.pcbi.0030007. Epub 2006 Dec 5.
8
Computational identification of transcription factor binding sites via a transcription-factor-centric clustering (TFCC) algorithm.通过以转录因子为中心的聚类(TFCC)算法对转录因子结合位点进行计算识别。
J Mol Biol. 2002 Apr 19;318(1):71-81. doi: 10.1016/S0022-2836(02)00026-8.
9
A biophysical model for analysis of transcription factor interaction and binding site arrangement from genome-wide binding data.基于全基因组结合数据的转录因子相互作用和结合位点排列的生物物理模型分析。
PLoS One. 2009 Dec 1;4(12):e8155. doi: 10.1371/journal.pone.0008155.
10
Systematic identification of non-canonical transcription factor motifs.系统识别非规范转录因子基序。
BMC Mol Cell Biol. 2021 Aug 31;22(1):44. doi: 10.1186/s12860-021-00382-6.

引用本文的文献

1
NaviSE: superenhancer navigator integrating epigenomics signal algebra.NaviSE:整合表观基因组学信号代数的超级增强子导航器
BMC Bioinformatics. 2017 Jun 6;18(1):296. doi: 10.1186/s12859-017-1698-5.
2
T-KDE: a method for genome-wide identification of constitutive protein binding sites from multiple ChIP-seq data sets.T-KDE:一种从多个 ChIP-seq 数据集识别全基因组组成型蛋白质结合位点的方法。
BMC Genomics. 2014 Jan 15;15:27. doi: 10.1186/1471-2164-15-27.
3
Disclosing the crosstalk among DNA methylation, transcription factors, and histone marks in human pluripotent cells through discovery of DNA methylation motifs.

本文引用的文献

1
A comprehensive analysis of PAX8 expression in human epithelial tumors.PAX8 在人类上皮性肿瘤中的表达的综合分析。
Am J Surg Pathol. 2011 Jun;35(6):816-26. doi: 10.1097/PAS.0b013e318216c112.
2
Combinatorial binding of transcription factors in the pluripotency control regions of the genome.基因组多能性调控区转录因子的组合结合。
Genome Res. 2011 Jul;21(7):1055-64. doi: 10.1101/gr.115824.110. Epub 2011 Apr 28.
3
Human-specific loss of regulatory DNA and the evolution of human-specific traits.人类特异性调控 DNA 的丢失与人类特异性特征的进化。
通过发现 DNA 甲基化基序,揭示人类多能细胞中 DNA 甲基化、转录因子和组蛋白标记之间的串扰。
Genome Res. 2013 Dec;23(12):2013-29. doi: 10.1101/gr.155960.113. Epub 2013 Oct 22.
Nature. 2011 Mar 10;471(7337):216-9. doi: 10.1038/nature09774.
4
The UCSC Genome Browser database: update 2011.加州大学圣克鲁兹分校基因组浏览器数据库:2011年更新
Nucleic Acids Res. 2011 Jan;39(Database issue):D876-82. doi: 10.1093/nar/gkq963. Epub 2010 Oct 18.
5
Motif Discovery in Physiological Datasets: A Methodology for Inferring Predictive Elements.生理数据集中的基序发现:一种推断预测元件的方法。
ACM Trans Knowl Discov Data. 2010 Jan;4(1):2. doi: 10.1145/1644873.1644875.
6
The value of position-specific priors in motif discovery using MEME.MEME 中位置特异性先验在基序发现中的价值。
BMC Bioinformatics. 2010 Apr 9;11:179. doi: 10.1186/1471-2105-11-179.
7
Systematic discovery of regulatory motifs in Fusarium graminearum by comparing four Fusarium genomes.通过比较四个镰刀菌基因组,系统性地发现禾谷镰刀菌中的调控基序。
BMC Genomics. 2010 Mar 26;11:208. doi: 10.1186/1471-2164-11-208.
8
Pluripotency maintenance mechanism of embryonic stem cells and reprogramming.胚胎干细胞的多能性维持机制与重编程。
Int J Hematol. 2010 Apr;91(3):360-72. doi: 10.1007/s12185-010-0517-9. Epub 2010 Feb 16.
9
Comprehensive mapping of long-range interactions reveals folding principles of the human genome.远距离相互作用的全面图谱揭示了人类基因组的折叠原理。
Science. 2009 Oct 9;326(5950):289-93. doi: 10.1126/science.1181369.
10
Different gene regulation strategies revealed by analysis of binding motifs.通过分析结合基序揭示不同的基因调控策略。
Trends Genet. 2009 Oct;25(10):434-40. doi: 10.1016/j.tig.2009.08.003. Epub 2009 Oct 6.