• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 PADA1(蛋白质辅助 DNA 组装 1)进行 FoldX 精确的结构蛋白-DNA 结合预测。

FoldX accurate structural protein-DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1).

机构信息

Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain.

Universitat Pompeu Fabra (UPF), Barcelona, Spain.

出版信息

Nucleic Acids Res. 2018 May 4;46(8):3852-3863. doi: 10.1093/nar/gky228.

DOI:10.1093/nar/gky228
PMID:29608705
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5934639/
Abstract

The speed at which new genomes are being sequenced highlights the need for genome-wide methods capable of predicting protein-DNA interactions. Here, we present PADA1, a generic algorithm that accurately models structural complexes and predicts the DNA-binding regions of resolved protein structures. PADA1 relies on a library of protein and double-stranded DNA fragment pairs obtained from a training set of 2103 DNA-protein complexes. It includes a fast statistical force field computed from atom-atom distances, to evaluate and filter the 3D docking models. Using published benchmark validation sets and 212 DNA-protein structures published after 2016 we predicted the DNA-binding regions with an RMSD of <1.8 Å per residue in >95% of the cases. We show that the quality of the docked templates is compatible with FoldX protein design tool suite to identify the crystallized DNA molecule sequence as the most energetically favorable in 80% of the cases. We highlighted the biological potential of PADA1 by reconstituting DNA and protein conformational changes upon protein mutagenesis of a meganuclease and its variants, and by predicting DNA-binding regions and nucleotide sequences in proteins crystallized without DNA. These results opens up new perspectives for the engineering of DNA-protein interfaces.

摘要

新基因组测序的速度凸显了对能够预测蛋白质-DNA 相互作用的全基因组方法的需求。在这里,我们提出了 PADA1,这是一种通用算法,能够准确地模拟结构复合物,并预测已解析蛋白质结构的 DNA 结合区域。PADA1 依赖于从 2103 个 DNA-蛋白质复合物的训练集中获得的蛋白质和双链 DNA 片段对的库。它包括一个从原子间距离计算得出的快速统计力场,用于评估和筛选 3D 对接模型。使用已发表的基准验证集和 2016 年后发表的 212 个 DNA-蛋白质结构,我们预测 DNA 结合区域的 RMSD 小于 1.8 Å/残基,在>95%的情况下。我们表明,对接模板的质量与 FoldX 蛋白质设计工具套件兼容,可识别结晶 DNA 分子序列作为 80%情况下最具能量优势的序列。我们通过重新构建在核酸酶及其变体的蛋白质突变后 DNA 和蛋白质构象变化,以及预测没有 DNA 的蛋白质结晶中的 DNA 结合区域和核苷酸序列,突出了 PADA1 的生物学潜力。这些结果为 DNA-蛋白质界面的工程设计开辟了新的前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/52dd3cd22e25/gky228fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/92d4879e3558/gky228fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/6524fa088d2f/gky228fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/2b7f415c3c1d/gky228fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/e4ea4c50d6f4/gky228fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/01533b970895/gky228fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/e4ceefcbe578/gky228fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/52dd3cd22e25/gky228fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/92d4879e3558/gky228fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/6524fa088d2f/gky228fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/2b7f415c3c1d/gky228fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/e4ea4c50d6f4/gky228fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/01533b970895/gky228fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/e4ceefcbe578/gky228fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28cd/5934639/52dd3cd22e25/gky228fig7.jpg

相似文献

1
FoldX accurate structural protein-DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1).使用 PADA1(蛋白质辅助 DNA 组装 1)进行 FoldX 精确的结构蛋白-DNA 结合预测。
Nucleic Acids Res. 2018 May 4;46(8):3852-3863. doi: 10.1093/nar/gky228.
2
Protein-assisted RNA fragment docking (RnaX) for modeling RNA-protein interactions using ModelX.使用 ModelX 通过蛋白辅助 RNA 片段对接(RnaX)进行 RNA-蛋白相互作用建模。
Proc Natl Acad Sci U S A. 2019 Dec 3;116(49):24568-24573. doi: 10.1073/pnas.1910999116. Epub 2019 Nov 15.
3
Survey of Computational Approaches for Prediction of DNA-Binding Residues on Protein Surfaces.蛋白质表面DNA结合残基预测的计算方法综述。
Methods Mol Biol. 2018;1754:223-234. doi: 10.1007/978-1-4939-7717-8_13.
4
Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking.基于结构的蛋白质功能位点自动预测:应用于评估基因组注释中同源蛋白功能继承的有效性及蛋白质对接。
J Mol Biol. 2001 Aug 10;311(2):395-408. doi: 10.1006/jmbi.2001.4870.
5
Structural models of protein-DNA complexes based on interface prediction and docking.基于界面预测和对接的蛋白质-DNA 复合物结构模型。
Curr Protein Pept Sci. 2011 Sep;12(6):531-9. doi: 10.2174/138920311796957694.
6
A discriminatory function for prediction of protein-DNA interactions based on alpha shape modeling.基于 Alpha 形状建模的蛋白质-DNA 相互作用预测的判别函数。
Bioinformatics. 2010 Oct 15;26(20):2541-8. doi: 10.1093/bioinformatics/btq478. Epub 2010 Aug 23.
7
A comprehensive comparative review of sequence-based predictors of DNA- and RNA-binding residues.基于序列的DNA和RNA结合残基预测因子的全面比较综述。
Brief Bioinform. 2016 Jan;17(1):88-105. doi: 10.1093/bib/bbv023. Epub 2015 May 1.
8
PSSM-based prediction of DNA binding sites in proteins.基于位置特异性得分矩阵的蛋白质中DNA结合位点预测
BMC Bioinformatics. 2005 Feb 19;6:33. doi: 10.1186/1471-2105-6-33.
9
DOCKGROUND system of databases for protein recognition studies: unbound structures for docking.用于蛋白质识别研究的DOCKGROUND数据库系统:用于对接的未结合结构。
Proteins. 2007 Dec 1;69(4):845-51. doi: 10.1002/prot.21714.
10
Predicting target DNA sequences of DNA-binding proteins based on unbound structures.基于未结合结构预测 DNA 结合蛋白的靶 DNA 序列。
PLoS One. 2012;7(2):e30446. doi: 10.1371/journal.pone.0030446. Epub 2012 Feb 1.

引用本文的文献

1
Artificial intelligence and first-principle methods in protein redesign: A marriage of convenience?蛋白质重新设计中的人工智能与第一性原理方法:权宜之计的结合?
Protein Sci. 2025 Aug;34(8):e70210. doi: 10.1002/pro.70210.
2
A computational workflow for analysis of missense mutations in precision oncology.一种用于精准肿瘤学中错义突变分析的计算工作流程。
J Cheminform. 2024 Jul 29;16(1):86. doi: 10.1186/s13321-024-00876-3.
3
PredictONCO: a web tool supporting decision-making in precision oncology by extending the bioinformatics predictions with advanced computing and machine learning.

本文引用的文献

1
The structure and polymerase-recognition mechanism of the crucial adaptor protein AND-1 in the human replisome.人类复制体中关键衔接蛋白AND-1的结构与聚合酶识别机制
J Biol Chem. 2017 Jun 9;292(23):9627-9636. doi: 10.1074/jbc.M116.758524. Epub 2017 Apr 5.
2
HU histone-like DNA-binding protein from Thermus thermophilus: structural and evolutionary analyses.嗜热栖热菌的HU类组蛋白DNA结合蛋白:结构与进化分析
Extremophiles. 2016 Sep;20(5):695-709. doi: 10.1007/s00792-016-0859-1. Epub 2016 Jun 24.
3
NPDock: a web server for protein-nucleic acid docking.
PredictONCO:一个通过扩展生物信息学预测并结合先进计算和机器学习来支持精准肿瘤学决策的网络工具。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad441.
4
Computational design of novel Cas9 PAM-interacting domains using evolution-based modelling and structural quality assessment.基于进化建模和结构质量评估的新型 Cas9 PAM 相互作用结构域的计算设计。
PLoS Comput Biol. 2023 Nov 17;19(11):e1011621. doi: 10.1371/journal.pcbi.1011621. eCollection 2023 Nov.
5
Correspondence between functional scores from deep mutational scans and predicted effects on protein stability.深突变扫描的功能评分与预测对蛋白质稳定性影响之间的对应关系。
Protein Sci. 2023 Jul;32(7):e4688. doi: 10.1002/pro.4688.
6
Bacterial expression of a designed single-chain IL-10 prevents severe lung inflammation.设计的单链 IL-10 的细菌表达可预防严重的肺部炎症。
Mol Syst Biol. 2023 Jan;19(1):e11037. doi: 10.15252/msb.202211037. Epub 2023 Jan 4.
7
Identification of Heme Oxygenase-1 as a Putative DNA-Binding Protein.鉴定血红素加氧酶-1为一种假定的DNA结合蛋白。
Antioxidants (Basel). 2022 Oct 28;11(11):2135. doi: 10.3390/antiox11112135.
8
LYRUS: a machine learning model for predicting the pathogenicity of missense variants.LYRUS:一种用于预测错义变异致病性的机器学习模型。
Bioinform Adv. 2021 Dec 25;2(1):vbab045. doi: 10.1093/bioadv/vbab045. eCollection 2022.
9
Gene Editing in Pluripotent Stem Cells and Their Derived Organoids.多能干细胞及其衍生类器官中的基因编辑
Stem Cells Int. 2021 Nov 30;2021:8130828. doi: 10.1155/2021/8130828. eCollection 2021.
10
Methods for Molecular Modelling of Protein Complexes.蛋白质复合物的分子建模方法。
Methods Mol Biol. 2021;2305:53-80. doi: 10.1007/978-1-0716-1406-8_3.
NPDock:一个用于蛋白质-核酸对接的网络服务器。
Nucleic Acids Res. 2015 Jul 1;43(W1):W425-30. doi: 10.1093/nar/gkv493. Epub 2015 May 14.
4
do_x3dna: a tool to analyze structural fluctuations of dsDNA or dsRNA from molecular dynamics simulations.do_x3dna:一种用于通过分子动力学模拟分析双链DNA或双链RNA结构波动的工具。
Bioinformatics. 2015 Aug 1;31(15):2583-5. doi: 10.1093/bioinformatics/btv190. Epub 2015 Apr 2.
5
An overview of the prediction of protein DNA-binding sites.蛋白质DNA结合位点预测综述。
Int J Mol Sci. 2015 Mar 6;16(3):5194-215. doi: 10.3390/ijms16035194.
6
BuD, a helix-loop-helix DNA-binding domain for genome modification.BuD,一种用于基因组修饰的螺旋-环-螺旋DNA结合结构域。
Acta Crystallogr D Biol Crystallogr. 2014 Jul;70(Pt 7):2042-52. doi: 10.1107/S1399004714011183. Epub 2014 Jun 29.
7
A new method for evaluating the specificity of indirect readout in protein-DNA recognition.一种评估蛋白质-DNA 识别中间接读出特异性的新方法。
Nucleic Acids Res. 2012 Sep 1;40(17):e129. doi: 10.1093/nar/gks462. Epub 2012 May 22.
8
Attributes of short linear motifs.短线性基序的属性。
Mol Biosyst. 2012 Jan;8(1):268-81. doi: 10.1039/c1mb05231d. Epub 2011 Sep 12.
9
ParaDock: a flexible non-specific DNA--rigid protein docking algorithm.ParaDock:一种灵活的非特异性 DNA-刚性蛋白对接算法。
Nucleic Acids Res. 2011 Nov 1;39(20):e135. doi: 10.1093/nar/gkr620. Epub 2011 Aug 10.
10
MetaDBSite: a meta approach to improve protein DNA-binding sites prediction.MetaDBSite:一种改进蛋白质DNA结合位点预测的元方法。
BMC Syst Biol. 2011 Jun 20;5 Suppl 1(Suppl 1):S7. doi: 10.1186/1752-0509-5-S1-S7.