• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

转录因子序列特异性建模方法评估。

Evaluation of methods for modeling transcription factor sequence specificity.

机构信息

Banting and Best Department of Medical Research and Donnelly Centre, University of Toronto, Toronto, Ontario, Canada.

出版信息

Nat Biotechnol. 2013 Feb;31(2):126-34. doi: 10.1038/nbt.2486. Epub 2013 Jan 27.

DOI:10.1038/nbt.2486
PMID:23354101
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3687085/
Abstract

Genomic analyses often involve scanning for potential transcription factor (TF) binding sites using models of the sequence specificity of DNA binding proteins. Many approaches have been developed to model and learn a protein's DNA-binding specificity, but these methods have not been systematically compared. Here we applied 26 such approaches to in vitro protein binding microarray data for 66 mouse TFs belonging to various families. For nine TFs, we also scored the resulting motif models on in vivo data, and found that the best in vitro-derived motifs performed similarly to motifs derived from the in vivo data. Our results indicate that simple models based on mononucleotide position weight matrices trained by the best methods perform similarly to more complex models for most TFs examined, but fall short in specific cases (<10% of the TFs examined here). In addition, the best-performing motifs typically have relatively low information content, consistent with widespread degeneracy in eukaryotic TF sequence preferences.

摘要

基因组分析通常涉及使用 DNA 结合蛋白序列特异性模型来扫描潜在的转录因子 (TF) 结合位点。已经开发了许多方法来对蛋白质的 DNA 结合特异性进行建模和学习,但这些方法尚未得到系统比较。在这里,我们应用了 26 种这样的方法对 66 种属于不同家族的小鼠 TF 的体外蛋白质结合微阵列数据进行了分析。对于 9 个 TF,我们还在体内数据上对得到的基序模型进行了评分,发现体外衍生的最佳基序模型与从体内数据得到的基序模型表现相似。我们的结果表明,对于大多数所检查的 TF,基于最佳方法训练的单核苷酸位置权重矩阵的简单模型与更复杂的模型表现相似,但在特定情况下(<10%的所检查的 TF)表现不佳。此外,表现最好的基序通常具有相对较低的信息含量,这与真核生物 TF 序列偏好中的广泛简并性一致。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9686/3687085/c4fb012ba359/nihms399608f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9686/3687085/d4dc50252667/nihms399608f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9686/3687085/9e8fc6872d2b/nihms399608f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9686/3687085/8d2ce31f6e4c/nihms399608f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9686/3687085/c4fb012ba359/nihms399608f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9686/3687085/d4dc50252667/nihms399608f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9686/3687085/9e8fc6872d2b/nihms399608f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9686/3687085/8d2ce31f6e4c/nihms399608f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9686/3687085/c4fb012ba359/nihms399608f4.jpg

相似文献

1
Evaluation of methods for modeling transcription factor sequence specificity.转录因子序列特异性建模方法评估。
Nat Biotechnol. 2013 Feb;31(2):126-34. doi: 10.1038/nbt.2486. Epub 2013 Jan 27.
2
Optimally choosing PWM motif databases and sequence scanning approaches based on ChIP-seq data.基于染色质免疫沉淀测序(ChIP-seq)数据优化选择PWM基序数据库和序列扫描方法。
BMC Bioinformatics. 2015 May 1;16:140. doi: 10.1186/s12859-015-0573-5.
3
High resolution models of transcription factor-DNA affinities improve in vitro and in vivo binding predictions.转录因子-DNA 亲和力的高分辨率模型可改善体外和体内结合预测。
PLoS Comput Biol. 2010 Sep 9;6(9):e1000916. doi: 10.1371/journal.pcbi.1000916.
4
Nonconsensus Protein Binding to Repetitive DNA Sequence Elements Significantly Affects Eukaryotic Genomes.与重复DNA序列元件的非一致性蛋白质结合显著影响真核生物基因组。
PLoS Comput Biol. 2015 Aug 18;11(8):e1004429. doi: 10.1371/journal.pcbi.1004429. eCollection 2015 Aug.
5
Transcription factor-DNA binding: beyond binding site motifs.转录因子与DNA结合:超越结合位点基序
Curr Opin Genet Dev. 2017 Apr;43:110-119. doi: 10.1016/j.gde.2017.02.007. Epub 2017 Mar 27.
6
Curated collection of yeast transcription factor DNA binding specificity data reveals novel structural and gene regulatory insights.酵母转录因子 DNA 结合特异性数据的精选集合揭示了新的结构和基因调控见解。
Genome Biol. 2011 Dec 21;12(12):R125. doi: 10.1186/gb-2011-12-12-r125.
7
Contribution of Sequence Motif, Chromatin State, and DNA Structure Features to Predictive Models of Transcription Factor Binding in Yeast.序列基序、染色质状态和DNA结构特征对酵母转录因子结合预测模型的贡献
PLoS Comput Biol. 2015 Aug 20;11(8):e1004418. doi: 10.1371/journal.pcbi.1004418. eCollection 2015 Aug.
8
Simultaneously learning DNA motif along with its position and sequence rank preferences through expectation maximization algorithm.通过期望最大化算法同时学习DNA基序及其位置和序列排名偏好。
J Comput Biol. 2013 Mar;20(3):237-48. doi: 10.1089/cmb.2012.0233.
9
Optimized position weight matrices in prediction of novel putative binding sites for transcription factors in the Drosophila melanogaster genome.优化位置权重矩阵以预测果蝇基因组中转录因子的新型潜在结合位点。
PLoS One. 2013 Aug 6;8(8):e68712. doi: 10.1371/journal.pone.0068712. Print 2013.
10
Inferring intra-motif dependencies of DNA binding sites from ChIP-seq data.从ChIP-seq数据推断DNA结合位点的基序内依赖性。
BMC Bioinformatics. 2015 Nov 9;16:375. doi: 10.1186/s12859-015-0797-4.

引用本文的文献

1
Benchmarking transcription factor binding site prediction models: a comparative analysis on synthetic and biological data.基准测试转录因子结合位点预测模型:对合成数据和生物数据的比较分析
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf363.
2
analysis of promoters predicts conserved and human specific regulators of adipocyte thermogenesis.启动子分析预测脂肪细胞产热的保守及人类特异性调节因子。
iScience. 2025 Jun 21;28(7):112969. doi: 10.1016/j.isci.2025.112969. eCollection 2025 Jul 18.
3
Interpretable protein-DNA interactions captured by structure-sequence optimization.

本文引用的文献

1
Improved models for transcription factor binding site identification using nonindependent interactions.利用非独立相互作用改进转录因子结合位点识别模型。
Genetics. 2012 Jul;191(3):781-90. doi: 10.1534/genetics.112.138685. Epub 2012 Apr 13.
2
Comprehensive genome-wide protein-DNA interactions detected at single-nucleotide resolution.在单核苷酸分辨率下检测到全基因组范围内的蛋白质-DNA 相互作用。
Cell. 2011 Dec 9;147(6):1408-19. doi: 10.1016/j.cell.2011.11.013.
3
Cofactor binding evokes latent differences in DNA binding specificity between Hox proteins.
通过结构-序列优化捕获的可解释蛋白质-DNA相互作用
Elife. 2025 Jul 17;14:RP105565. doi: 10.7554/eLife.105565.
4
OptimDase: An Algorithm for Predicting DNA Binding Sites with Combined Feature Encoding.OptimDase:一种采用组合特征编码预测DNA结合位点的算法。
Interdiscip Sci. 2025 Jun 10. doi: 10.1007/s12539-025-00704-8.
5
Exploring the complexity of MECP2 function in Rett syndrome.探索雷特综合征中MECP2功能的复杂性。
Nat Rev Neurosci. 2025 May 13. doi: 10.1038/s41583-025-00926-1.
6
A TATA-box-binding protein binds single-stranded DNA in two modes: To poly(G) tracts and to flexible DNA regions.TATA 框结合蛋白以两种模式结合单链 DNA:与聚(G)序列以及与柔性 DNA 区域结合。
J Biol Chem. 2025 Apr 27;301(6):108552. doi: 10.1016/j.jbc.2025.108552.
7
Characterization of non-coding variants associated with transcription-factor binding through ATAC-seq-defined footprint QTLs in liver.通过肝脏中ATAC-seq定义的足迹QTL对与转录因子结合相关的非编码变异进行表征。
Am J Hum Genet. 2025 Apr 10. doi: 10.1016/j.ajhg.2025.03.019.
8
ShapeME: A tool and web front-end for de novo discovery of structural motifs underpinning protein-DNA interactions.ShapeME:一种用于从头发现支撑蛋白质 - DNA 相互作用的结构基序的工具及网络前端。
bioRxiv. 2025 Jan 31:2025.01.28.635290. doi: 10.1101/2025.01.28.635290.
9
A systematic survey of TF function in E. coli suggests RNAP stabilization is a prevalent strategy for both repressors and activators.一项对大肠杆菌中TF功能的系统调查表明,RNA聚合酶稳定化是阻遏物和激活物的普遍策略。
Nucleic Acids Res. 2025 Feb 8;53(4). doi: 10.1093/nar/gkaf058.
10
Predicting CTCF cell type active binding sites in human genome.预测人类基因组中CTCF细胞类型活性结合位点
Sci Rep. 2024 Dec 30;14(1):31744. doi: 10.1038/s41598-024-82238-5.
辅助因子结合引发了 Hox 蛋白之间 DNA 结合特异性的潜在差异。
Cell. 2011 Dec 9;147(6):1270-82. doi: 10.1016/j.cell.2011.10.053.
4
The UCSC Genome Browser database: extensions and updates 2011.UCSC 基因组浏览器数据库:扩展和更新 2011 年版。
Nucleic Acids Res. 2012 Jan;40(Database issue):D918-23. doi: 10.1093/nar/gkr1055. Epub 2011 Nov 15.
5
Direct measurement of DNA affinity landscapes on a high-throughput sequencing instrument.在高通量测序仪器上直接测量 DNA 亲和力图谱。
Nat Biotechnol. 2011 Jun 26;29(7):659-64. doi: 10.1038/nbt.1882.
6
Quantitative analysis demonstrates most transcription factors require only simple models of specificity.定量分析表明,大多数转录因子只需要简单的特异性模型。
Nat Biotechnol. 2011 Jun 7;29(6):480-3. doi: 10.1038/nbt.1893.
7
A linear model for transcription factor binding affinity prediction in protein binding microarrays.一种用于蛋白质结合微阵列中转录因子结合亲和力预测的线性模型。
PLoS One. 2011;6(5):e20059. doi: 10.1371/journal.pone.0020059. Epub 2011 May 26.
8
MEME-ChIP: motif analysis of large DNA datasets.MEME-ChIP:大 DNA 数据集的基序分析。
Bioinformatics. 2011 Jun 15;27(12):1696-7. doi: 10.1093/bioinformatics/btr189. Epub 2011 Apr 12.
9
hmChIP: a database and web server for exploring publicly available human and mouse ChIP-seq and ChIP-chip data.hmChIP:一个数据库和网络服务器,用于探索公开的人类和小鼠 ChIP-seq 和 ChIP-chip 数据。
Bioinformatics. 2011 May 15;27(10):1447-8. doi: 10.1093/bioinformatics/btr156. Epub 2011 Mar 30.
10
De-novo discovery of differentially abundant transcription factor binding sites including their positional preference.从头发现差异丰度转录因子结合位点,包括其位置偏好。
PLoS Comput Biol. 2011 Feb 10;7(2):e1001070. doi: 10.1371/journal.pcbi.1001070.