• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

再探基于蛋白质知识的势场提取条件。

Another look at the conditions for the extraction of protein knowledge-based potentials.

作者信息

Betancourt Marcos R

机构信息

Department of Physics, Indiana University Purdue University Indianapolis, Indianapolis, Indiana 46202, USA.

出版信息

Proteins. 2009 Jul;76(1):72-85. doi: 10.1002/prot.22320.

DOI:10.1002/prot.22320
PMID:19089977
Abstract

Protein knowledge-based potentials are effective free energies obtained from databases of known protein structures. They are used to parameterize coarse-grained protein models in many folding simulation and structure prediction methods. Two common approaches are used in the derivation of knowledge-based potentials. One assumes that the energy parameters optimize the native structure stability. The other assumes that interaction events are related to their energies according to the Boltzmann distribution, and that they are distributed independently of other events, that is, the quasi-chemical approximation. Here, these assumptions are systematically tested by extracting contact energies from artificial databases of lattice proteins with predefined pairwise contact energies. Databases of protein sequences are designed to either satisfy the Boltzmann distribution at high or low temperatures, or to simultaneously optimize the native stability and folding kinetics. It is found that the quasi-chemical approximation, with the ideal reference state, accurately reproduce the true energies for high temperature Boltzmann distributed sequences (weakly interacting residues), but less accurately at low temperatures, where the sequences correspond to energy minima and the residues are strongly interacting. To overcome this problem, an iterative procedure for Boltzmann distributed sequences is introduced, which accounts for interacting residue correlations and eliminates the need for the quasi-chemical approximation. In this case, the energies are accurately reproduced at any ensemble temperature. However, when the database of sequences designed for optimal stability and kinetics is used, the energy correlation is less than optimal using either method, exhibiting random and systematic deviations from linearity. Therefore, the assumption that native structures are maximally stable or that sequences are determined according to the Boltzmann distribution seems to be inadequate for obtaining accurate energies. The limited number of sequences in the database and the inhomogeneous concentration of amino acids from one structure to another do not seem to be major obstacles for improving the quality of the extracted pairwise energies, with the exception of repulsive interactions.

摘要

基于蛋白质知识的势能是从已知蛋白质结构数据库中获得的有效自由能。它们被用于许多折叠模拟和结构预测方法中对粗粒度蛋白质模型进行参数化。在基于知识的势能推导中使用了两种常见方法。一种方法假设能量参数可优化天然结构稳定性。另一种方法假设相互作用事件根据玻尔兹曼分布与其能量相关,并且它们独立于其他事件分布,即准化学近似。在此,通过从具有预定义成对接触能的晶格蛋白质人工数据库中提取接触能,对这些假设进行了系统测试。设计蛋白质序列数据库以满足高温或低温下的玻尔兹曼分布,或者同时优化天然稳定性和折叠动力学。结果发现,具有理想参考态的准化学近似能准确再现高温玻尔兹曼分布序列(弱相互作用残基)的真实能量,但在低温下准确性较低,此时序列对应于能量最小值且残基强烈相互作用。为克服此问题,引入了一种针对玻尔兹曼分布序列的迭代程序,该程序考虑了相互作用残基的相关性并消除了对准化学近似的需求。在这种情况下,在任何系综温度下都能准确再现能量。然而,当使用为优化稳定性和动力学而设计的序列数据库时,无论使用哪种方法,能量相关性都未达到最佳,表现出与线性的随机和系统偏差。因此,天然结构是最大程度稳定或序列根据玻尔兹曼分布确定的假设似乎不足以获得准确的能量。数据库中序列数量有限以及从一个结构到另一个结构氨基酸浓度不均匀,除了排斥相互作用外,似乎并不是提高提取的成对能量质量的主要障碍。

相似文献

1
Another look at the conditions for the extraction of protein knowledge-based potentials.再探基于蛋白质知识的势场提取条件。
Proteins. 2009 Jul;76(1):72-85. doi: 10.1002/prot.22320.
2
How to derive a protein folding potential? A new approach to an old problem.如何推导蛋白质折叠势?解决一个老问题的新方法。
J Mol Biol. 1996 Dec 20;264(5):1164-79. doi: 10.1006/jmbi.1996.0704.
3
Factors governing the foldability of proteins.影响蛋白质可折叠性的因素。
Proteins. 1996 Dec;26(4):411-41. doi: 10.1002/(SICI)1097-0134(199612)26:4<411::AID-PROT4>3.0.CO;2-E.
4
An empirical energy potential with a reference state for protein fold and sequence recognition.一种具有参考状态的经验能量势,用于蛋白质折叠和序列识别。
Proteins. 1999 Aug 15;36(3):357-69.
5
Accurate prediction for atomic-level protein design and its application in diversifying the near-optimal sequence space.原子水平蛋白质设计的准确预测及其在扩展近最优序列空间中的应用。
Proteins. 2009 May 15;75(3):682-705. doi: 10.1002/prot.22280.
6
Long- and short-range interactions in native protein structures are consistent/minimally frustrated in sequence space.天然蛋白质结构中的长程和短程相互作用在序列空间中是一致的/最小受挫的。
Proteins. 2003 Jan 1;50(1):35-43. doi: 10.1002/prot.10242.
7
Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading.具有有利接触对项和不利高堆积密度项的残基-残基势,用于模拟和穿线。
J Mol Biol. 1996 Mar 1;256(3):623-44. doi: 10.1006/jmbi.1996.0114.
8
Pairwise energies for polypeptide coarse-grained models derived from atomic force fields.源自原子力场的多肽粗粒度模型的成对能量。
J Chem Phys. 2009 May 21;130(19):195103. doi: 10.1063/1.3137045.
9
Lessons from the design of a novel atomic potential for protein folding.蛋白质折叠新型原子势设计中的经验教训。
Protein Sci. 2005 Jul;14(7):1741-52. doi: 10.1110/ps.051440705.
10
Kinetics of protein folding. A lattice model study of the requirements for folding to the native state.蛋白质折叠动力学。对折叠成天然状态所需条件的晶格模型研究。
J Mol Biol. 1994 Feb 4;235(5):1614-36. doi: 10.1006/jmbi.1994.1110.

引用本文的文献

1
Thermodynamics of Hydrophobic Amino Acids in Solution: A Combined Experimental-Computational Study.溶液中疏水氨基酸的热力学:一项实验与计算相结合的研究。
J Phys Chem Lett. 2017 Jan 19;8(2):347-351. doi: 10.1021/acs.jpclett.6b02673. Epub 2017 Jan 3.
2
COFFDROP: A Coarse-Grained Nonbonded Force Field for Proteins Derived from All-Atom Explicit-Solvent Molecular Dynamics Simulations of Amino Acids.COFFDROP:一种基于氨基酸全原子显式溶剂分子动力学模拟推导得到的蛋白质粗粒度非键合力场。
J Chem Theory Comput. 2014 Nov 11;10(11):5178-5194. doi: 10.1021/ct5006328. Epub 2014 Oct 7.
3
An Anisotropic Coarse-Grained Model for Proteins Based On Gay-Berne and Electric Multipole Potentials.
一种基于盖伊-伯尔尼势和电多极势的蛋白质各向异性粗粒度模型。
J Chem Theory Comput. 2014 Feb 10;10(2):731-750. doi: 10.1021/ct400974z.
4
Recovering physical potentials from a model protein databank.从模型蛋白质数据库中恢复物理势能。
Proc Natl Acad Sci U S A. 2010 Nov 16;107(46):19867-72. doi: 10.1073/pnas.1006428107. Epub 2010 Nov 1.
5
Multiscale coarse-graining of the protein energy landscape.蛋白质能量景观的多尺度粗粒化。
PLoS Comput Biol. 2010 Jun 24;6(6):e1000827. doi: 10.1371/journal.pcbi.1000827.