• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EGR家族蛋白质识别DNA的概率编码。

Probabilistic code for DNA recognition by proteins of the EGR family.

作者信息

Benos Panayiotis V, Lapedes Alan S, Stormo Gary D

机构信息

Department of Genetics, School of Medicine, Washington University, Campus Box 8232, St. Louis, MO 63110, USA.

出版信息

J Mol Biol. 2002 Nov 1;323(4):701-27. doi: 10.1016/s0022-2836(02)00917-8.

DOI:10.1016/s0022-2836(02)00917-8
PMID:12419259
Abstract

A recognition code for protein-DNA interactions would allow for the prediction of binding sites based on protein sequence, and the identification of binding proteins for specific DNA targets. Crystallographic studies of protein-DNA complexes showed that a simple, deterministic recognition code does not exist. Here, we present a probabilistic recognition code (P-code) that assigns energies to all possible base-pair-amino acid interactions for the early growth response factor (EGR) family of zinc-finger transcription factors. The specific energy values are determined by a maximum likelihood method using examples from in vitro randomisation experiments (namely, SELEX and phage display) reported in the literature. The accuracy of the model is tested in several ways, including the ability to predict in vivo binding sites of EGR proteins and other non-EGR zinc-finger proteins, and the correlation between predicted and measured binding affinities of various EGR proteins to several different DNA sites. We also show that this model improves significantly upon the prediction capabilities of previous qualitative and quantitative models. The probabilistic code we develop uses information about the interacting positions between the protein and DNA, but we show that such information is not necessary, although it reduces the number of parameters to be determined. We also employ the assumption that the total binding energy is the sum of the energies of the individual contacts, but we describe how that assumption can be relaxed at the cost of additional parameters.

摘要

蛋白质 - DNA相互作用的识别码将有助于基于蛋白质序列预测结合位点,并识别特定DNA靶标的结合蛋白。蛋白质 - DNA复合物的晶体学研究表明,不存在简单的、确定性的识别码。在此,我们提出一种概率识别码(P码),它为锌指转录因子的早期生长反应因子(EGR)家族的所有可能碱基对 - 氨基酸相互作用赋予能量。特定能量值通过最大似然法,利用文献中报道的体外随机化实验(即SELEX和噬菌体展示)的实例来确定。该模型的准确性通过多种方式进行测试,包括预测EGR蛋白和其他非EGR锌指蛋白的体内结合位点的能力,以及各种EGR蛋白与几个不同DNA位点的预测结合亲和力与测量结合亲和力之间的相关性。我们还表明,该模型在预测能力上比以前的定性和定量模型有显著提高。我们开发的概率码使用了有关蛋白质和DNA之间相互作用位置的信息,但我们表明,尽管这会减少待确定的参数数量,但此类信息并非必需。我们还采用了总结合能是各个接触点能量之和的假设,但我们描述了如何以增加参数为代价放宽该假设。

相似文献

1
Probabilistic code for DNA recognition by proteins of the EGR family.EGR家族蛋白质识别DNA的概率编码。
J Mol Biol. 2002 Nov 1;323(4):701-27. doi: 10.1016/s0022-2836(02)00917-8.
2
Analysis of zinc fingers optimized via phage display: evaluating the utility of a recognition code.通过噬菌体展示优化的锌指分析:评估识别密码的效用。
J Mol Biol. 1999 Feb 5;285(5):1917-34. doi: 10.1006/jmbi.1998.2421.
3
Context-dependent DNA recognition code for C2H2 zinc-finger transcription factors.C2H2型锌指转录因子的上下文依赖型DNA识别密码
Bioinformatics. 2008 Sep 1;24(17):1850-7. doi: 10.1093/bioinformatics/btn331. Epub 2008 Jun 27.
4
Quantitative analysis of EGR proteins binding to DNA: assessing additivity in both the binding site and the protein.与DNA结合的EGR蛋白的定量分析:评估结合位点和蛋白中的加和性。
BMC Bioinformatics. 2005 Jul 13;6:176. doi: 10.1186/1471-2105-6-176.
5
Rearrangement of side-chains in a Zif268 mutant highlights the complexities of zinc finger-DNA recognition.Zif268突变体中侧链的重排突出了锌指-DNA识别的复杂性。
J Mol Biol. 2001 Oct 19;313(2):309-15. doi: 10.1006/jmbi.2001.4975.
6
Coordinate expression and distinct DNA-binding characteristics of the four EGR-zinc finger proteins in Jurkat T lymphocytes.Jurkat T淋巴细胞中四种早期生长反应(EGR)锌指蛋白的协同表达及独特的DNA结合特性
Immunobiology. 1997 Dec;198(1-3):179-91. doi: 10.1016/S0171-2985(97)80039-3.
7
High affinity binding sites for the Wilms' tumour suppressor protein WT1.肾母细胞瘤抑癌蛋白WT1的高亲和力结合位点。
Nucleic Acids Res. 1995 Jan 25;23(2):277-84. doi: 10.1093/nar/23.2.277.
8
DNA recognition code of transcription factors in the helix-turn-helix, probe helix, hormone receptor, and zinc finger families.螺旋-转角-螺旋、探测螺旋、激素受体和锌指家族中转录因子的DNA识别密码。
Proc Natl Acad Sci U S A. 1994 Dec 20;91(26):12357-61. doi: 10.1073/pnas.91.26.12357.
9
Combining SELEX with quantitative assays to rapidly obtain accurate models of protein-DNA interactions.将指数富集的配体系统进化(SELEX)与定量分析相结合,以快速获得蛋白质 - DNA 相互作用的精确模型。
Nucleic Acids Res. 2005 Sep 25;33(17):e141. doi: 10.1093/nar/gni139.
10
Characterization of an early growth response gene, which encodes a zinc finger transcription factor, potentially involved in cell cycle regulation.一种早期生长反应基因的特征分析,该基因编码一种锌指转录因子,可能参与细胞周期调控。
Mol Endocrinol. 1995 Nov;9(11):1610-20. doi: 10.1210/mend.9.11.8584037.

引用本文的文献

1
Predicting the DNA binding specificity of transcription factor mutants using family-level biophysically interpretable machine learning.利用家族水平的具有生物物理可解释性的机器学习预测转录因子突变体的DNA结合特异性。
Nucleic Acids Res. 2025 Aug 27;53(16). doi: 10.1093/nar/gkaf831.
2
Predicting the DNA binding specificity of transcription factor mutants using family-level biophysically interpretable machine learning.利用家族水平的具有生物物理可解释性的机器学习预测转录因子突变体的DNA结合特异性
bioRxiv. 2025 Apr 2:2024.01.24.577115. doi: 10.1101/2024.01.24.577115.
3
Early growth response 1 transcription factor is essential for the pathogenic properties of human endometriotic epithelial cells.
早期生长反应因子 1 转录因子对于人子宫内膜异位症上皮细胞的致病特性是必需的。
Reproduction. 2022 Jun 27;164(2):41-54. doi: 10.1530/REP-22-0123. Print 2022 Aug 1.
4
Bayesian Markov models improve the prediction of binding motifs beyond first order.贝叶斯马尔可夫模型超越一阶,改进了对结合基序的预测。
NAR Genom Bioinform. 2021 Apr 20;3(2):lqab026. doi: 10.1093/nargab/lqab026. eCollection 2021 Jun.
5
On the prediction of DNA-binding preferences of C2H2-ZF domains using structural models: application on human CTCF.利用结构模型预测C2H2型锌指结构域的DNA结合偏好性:在人类CTCF上的应用
NAR Genom Bioinform. 2020 Jul 1;2(3):lqaa046. doi: 10.1093/nargab/lqaa046. eCollection 2020 Sep.
6
A missense mutation separates distinct functions of the Zic-family transcription factor REF-2.一个错义突变区分了Zic家族转录因子REF-2的不同功能。
MicroPubl Biol. 2020 Mar 16;2020. doi: 10.17912/micropub.biology.000232.
7
Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination.在结构相似的蛋白质之间共享 DNA 结合信息可实现特异性的准确判断。
Nucleic Acids Res. 2020 Jan 24;48(2):e9. doi: 10.1093/nar/gkz1087.
8
The folded k-spectrum kernel: A machine learning approach to detecting transcription factor binding sites with gapped nucleotide dependencies.折叠k谱核:一种利用有间隙核苷酸依赖性检测转录因子结合位点的机器学习方法。
PLoS One. 2017 Oct 5;12(10):e0185570. doi: 10.1371/journal.pone.0185570. eCollection 2017.
9
Identification of Position-Specific Correlations between DNA-Binding Domains and Their Binding Sites. Application to the MerR Family of Transcription Factors.DNA结合结构域与其结合位点之间位置特异性相关性的鉴定。应用于MerR转录因子家族。
PLoS One. 2016 Sep 30;11(9):e0162681. doi: 10.1371/journal.pone.0162681. eCollection 2016.
10
Determination of specificity influencing residues for key transcription factor families.关键转录因子家族特异性影响残基的确定。
Quant Biol. 2015 Sep 1;3(3):115-123. doi: 10.1007/s40484-015-0045-y. Epub 2015 Jun 16.