• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于未结合结构预测 DNA 结合蛋白的靶 DNA 序列。

Predicting target DNA sequences of DNA-binding proteins based on unbound structures.

机构信息

Department of Bio-Industrial Mechatronics Engineering, National Taiwan University, Taipei, Taiwan.

出版信息

PLoS One. 2012;7(2):e30446. doi: 10.1371/journal.pone.0030446. Epub 2012 Feb 1.

DOI:10.1371/journal.pone.0030446
PMID:22312425
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3270014/
Abstract

DNA-binding proteins such as transcription factors use DNA-binding domains (DBDs) to bind to specific sequences in the genome to initiate many important biological functions. Accurate prediction of such target sequences, often represented by position weight matrices (PWMs), is an important step to understand many biological processes. Recent studies have shown that knowledge-based potential functions can be applied on protein-DNA co-crystallized structures to generate PWMs that are considerably consistent with experimental data. However, this success has not been extended to DNA-binding proteins lacking co-crystallized structures. This study aims at investigating the possibility of predicting the DNA sequences bound by DNA-binding proteins from the proteins' unbound structures (structures of the unbound state). Given an unbound query protein and a template complex, the proposed method first employs structure alignment to generate synthetic protein-DNA complexes for the query protein. Once a complex is available, an atomic-level knowledge-based potential function is employed to predict PWMs characterizing the sequences to which the query protein can bind. The evaluation of the proposed method is based on seven DNA-binding proteins, which have structures of both DNA-bound and unbound forms for prediction as well as annotated PWMs for validation. Since this work is the first attempt to predict target sequences of DNA-binding proteins from their unbound structures, three types of structural variations that presumably influence the prediction accuracy were examined and discussed. Based on the analyses conducted in this study, the conformational change of proteins upon binding DNA was shown to be the key factor. This study sheds light on the challenge of predicting the target DNA sequences of a protein lacking co-crystallized structures, which encourages more efforts on the structure alignment-based approaches in addition to docking- and homology modeling-based approaches for generating synthetic complexes.

摘要

DNA 结合蛋白(如转录因子)使用 DNA 结合域(DBD)结合基因组中的特定序列,从而启动许多重要的生物学功能。准确预测这些靶序列(通常用位置权重矩阵(PWM)表示)是理解许多生物学过程的重要步骤。最近的研究表明,基于知识的势函数可应用于蛋白质-DNA 共结晶结构,以生成与实验数据相当一致的 PWM。然而,这种成功尚未扩展到缺乏共结晶结构的 DNA 结合蛋白。本研究旨在探讨从未结合状态的蛋白质结构预测 DNA 结合蛋白结合的 DNA 序列的可能性。给定一个未结合的查询蛋白和一个模板复合物,所提出的方法首先使用结构比对生成查询蛋白的合成蛋白-DNA 复合物。一旦获得复合物,就使用原子级基于知识的势函数来预测描述查询蛋白可以结合的序列的 PWM。该方法的评估基于七个 DNA 结合蛋白,这些蛋白具有 DNA 结合和未结合形式的结构,可用于预测,并具有已注释的 PWM 用于验证。由于这项工作是首次尝试从未结合的结构预测 DNA 结合蛋白的靶序列,因此检查和讨论了三种可能影响预测准确性的结构变化类型。基于本研究进行的分析,表明 DNA 结合时蛋白质构象的变化是关键因素。这项研究揭示了预测缺乏共结晶结构的蛋白质靶 DNA 序列的挑战,这鼓励在基于结构比对的方法以及基于对接和同源建模的方法之外,更多地致力于生成合成复合物的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/a31f8e34ae27/pone.0030446.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/4ab78c229c65/pone.0030446.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/d718ddb884c2/pone.0030446.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/a4f90ecc31a7/pone.0030446.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/4639ad4b91a6/pone.0030446.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/c1cf4cacce7b/pone.0030446.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/a31f8e34ae27/pone.0030446.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/4ab78c229c65/pone.0030446.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/d718ddb884c2/pone.0030446.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/a4f90ecc31a7/pone.0030446.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/4639ad4b91a6/pone.0030446.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/c1cf4cacce7b/pone.0030446.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1345/3270014/a31f8e34ae27/pone.0030446.g006.jpg

相似文献

1
Predicting target DNA sequences of DNA-binding proteins based on unbound structures.基于未结合结构预测 DNA 结合蛋白的靶 DNA 序列。
PLoS One. 2012;7(2):e30446. doi: 10.1371/journal.pone.0030446. Epub 2012 Feb 1.
2
DBD2BS: connecting a DNA-binding protein with its binding sites.DBD2BS:连接 DNA 结合蛋白与其结合位点。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W173-9. doi: 10.1093/nar/gks564. Epub 2012 Jun 11.
3
PiDNA: Predicting protein-DNA interactions with structural models.PiDNA:利用结构模型预测蛋白质-DNA 相互作用。
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W523-30. doi: 10.1093/nar/gkt388. Epub 2013 May 22.
4
Creating PWMs of transcription factors using 3D structure-based computation of protein-DNA free binding energies.使用基于 3D 结构的蛋白-DNA 自由结合能计算来创建转录因子的 PWMs。
BMC Bioinformatics. 2010 May 3;11:225. doi: 10.1186/1471-2105-11-225.
5
Protein-DNA complex structure modeling based on structural template.基于结构模板的蛋白质-DNA 复合物结构建模。
Biochem Biophys Res Commun. 2021 Nov 5;577:152-157. doi: 10.1016/j.bbrc.2021.09.018. Epub 2021 Sep 8.
6
Increasing coverage of transcription factor position weight matrices through domain-level homology.通过域级同源性提高转录因子位置权重矩阵的覆盖率。
PLoS One. 2012;7(8):e42779. doi: 10.1371/journal.pone.0042779. Epub 2012 Aug 27.
7
Structure-based prediction of C2H2 zinc-finger binding specificity: sensitivity to docking geometry.基于结构的C2H2锌指结合特异性预测:对接几何结构的敏感性
Nucleic Acids Res. 2007;35(4):1085-97. doi: 10.1093/nar/gkl1155. Epub 2007 Jan 30.
8
3DTF: a web server for predicting transcription factor PWMs using 3D structure-based energy calculations.3DTF:一个基于 3D 结构的能量计算预测转录因子 PWM 的网络服务器。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W180-5. doi: 10.1093/nar/gks551. Epub 2012 Jun 11.
9
FoldX accurate structural protein-DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1).使用 PADA1(蛋白质辅助 DNA 组装 1)进行 FoldX 精确的结构蛋白-DNA 结合预测。
Nucleic Acids Res. 2018 May 4;46(8):3852-3863. doi: 10.1093/nar/gky228.
10
Predicting protein conformational changes for unbound and homology docking: learning from intrinsic and induced flexibility.预测未结合和同源对接的蛋白质构象变化:从内在和诱导柔性中学习。
Proteins. 2017 Mar;85(3):544-556. doi: 10.1002/prot.25212. Epub 2016 Dec 5.

引用本文的文献

1
A novel method for improved accuracy of transcription factor binding site prediction.一种提高转录因子结合位点预测准确性的新方法。
Nucleic Acids Res. 2018 Jul 6;46(12):e72. doi: 10.1093/nar/gky237.
2
Knowledge-based three-body potential for transcription factor binding site prediction.用于转录因子结合位点预测的基于知识的三体势
IET Syst Biol. 2016 Feb;10(1):23-9. doi: 10.1049/iet-syb.2014.0066.
3
Structure-based modeling of protein: DNA specificity.基于结构的蛋白质:DNA特异性建模。

本文引用的文献

1
The Universal Protein Resource (UniProt) in 2010.2010 年的通用蛋白质资源(UniProt)。
Nucleic Acids Res. 2010 Jan;38(Database issue):D142-8. doi: 10.1093/nar/gkp846. Epub 2009 Oct 20.
2
Large-scale analysis of secondary structure changes in proteins suggests a role for disorder-to-order transitions in nucleotide binding proteins.大规模分析蛋白质的二级结构变化表明无序到有序的转变在核苷酸结合蛋白中起作用。
Proteins. 2010 Feb 1;78(2):236-48. doi: 10.1002/prot.22531.
3
An all-atom knowledge-based energy function for protein-DNA threading, docking decoy discrimination, and prediction of transcription-factor binding profiles.
Brief Funct Genomics. 2015 Jan;14(1):39-49. doi: 10.1093/bfgp/elu044. Epub 2014 Nov 19.
4
Toward single-molecule optical mapping of the epigenome.朝向单分子光学表观基因组图谱绘制。
ACS Nano. 2014 Jan 28;8(1):14-26. doi: 10.1021/nn4050694. Epub 2013 Dec 20.
5
PiDNA: Predicting protein-DNA interactions with structural models.PiDNA:利用结构模型预测蛋白质-DNA 相互作用。
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W523-30. doi: 10.1093/nar/gkt388. Epub 2013 May 22.
6
A structural-based strategy for recognition of transcription factor binding sites.基于结构的转录因子结合位点识别策略。
PLoS One. 2013;8(1):e52460. doi: 10.1371/journal.pone.0052460. Epub 2013 Jan 8.
7
Improved predictions of transcription factor binding sites using physicochemical features of DNA.利用 DNA 的理化特征提高转录因子结合位点的预测。
Nucleic Acids Res. 2012 Dec;40(22):e175. doi: 10.1093/nar/gks771. Epub 2012 Aug 25.
8
3DTF: a web server for predicting transcription factor PWMs using 3D structure-based energy calculations.3DTF:一个基于 3D 结构的能量计算预测转录因子 PWM 的网络服务器。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W180-5. doi: 10.1093/nar/gks551. Epub 2012 Jun 11.
9
DBD2BS: connecting a DNA-binding protein with its binding sites.DBD2BS:连接 DNA 结合蛋白与其结合位点。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W173-9. doi: 10.1093/nar/gks564. Epub 2012 Jun 11.
10
Characterization and prediction of the binding site in DNA-binding proteins: improvement of accuracy by combining residue composition, evolutionary conservation and structural parameters.DNA 结合蛋白结合位点的特性和预测:通过结合残基组成、进化保守性和结构参数来提高准确性。
Nucleic Acids Res. 2012 Aug;40(15):7150-61. doi: 10.1093/nar/gks405. Epub 2012 May 27.
一种用于蛋白质 - DNA 穿线、对接诱饵鉴别及转录因子结合谱预测的基于全原子知识的能量函数。
Proteins. 2009 Aug 15;76(3):718-30. doi: 10.1002/prot.22384.
4
The Protein Data Bank (PDB), its related services and software tools as key components for in silico guided drug discovery.蛋白质数据库(PDB)及其相关服务和软件工具是计算机辅助药物发现的关键组成部分。
J Med Chem. 2008 Nov 27;51(22):7021-40. doi: 10.1021/jm8005977.
5
DBD-Hunter: a knowledge-based method for the prediction of DNA-protein interactions.DBD-Hunter:一种基于知识的DNA-蛋白质相互作用预测方法。
Nucleic Acids Res. 2008 Jul;36(12):3978-92. doi: 10.1093/nar/gkn332. Epub 2008 May 31.
6
Structure-based prediction of transcription factor binding sites using a protein-DNA docking approach.使用蛋白质-DNA对接方法基于结构预测转录因子结合位点。
Proteins. 2008 Sep;72(4):1114-24. doi: 10.1002/prot.22002.
7
Genome-wide mapping of in vivo protein-DNA interactions.体内蛋白质-DNA相互作用的全基因组图谱绘制。
Science. 2007 Jun 8;316(5830):1497-502. doi: 10.1126/science.1141319. Epub 2007 May 31.
8
Connecting protein structure with predictions of regulatory sites.将蛋白质结构与调控位点预测相联系。
Proc Natl Acad Sci U S A. 2007 Apr 24;104(17):7068-73. doi: 10.1073/pnas.0701356104. Epub 2007 Apr 16.
9
Energetics of protein-DNA interactions.蛋白质与DNA相互作用的能量学
Nucleic Acids Res. 2007;35(4):1039-47. doi: 10.1093/nar/gkl1103. Epub 2007 Jan 26.
10
Information-driven protein-DNA docking using HADDOCK: it is a matter of flexibility.使用HADDOCK进行信息驱动的蛋白质- DNA对接:灵活性问题。
Nucleic Acids Res. 2006 Jul 4;34(11):3317-25. doi: 10.1093/nar/gkl412. Print 2006.