Suppr超能文献

源自数据库的电势对于正向和反向蛋白质折叠评分是否有效?

Are database-derived potentials valid for scoring both forward and inverted protein folding?

作者信息

Rooman M J, Wodak S J

机构信息

Unité de Conformation des Macromolécules Biologiques, Université Libre de Bruxelles, Belgium.

出版信息

Protein Eng. 1995 Sep;8(9):849-58. doi: 10.1093/protein/8.9.849.

Abstract

Database-derived potentials, compiled from frequencies of sequence and structure features, are often used for scoring the compatibility of protein sequences and conformations. It is often believed that these scores correspond to differences in free energy with, in addition, a term containing the partition function of the system. Since this function does not depend on the conformation, the potentials are considered to be valid for scoring the compatibility of different conformations with a given sequence ('forward folding'), but not of sequences with a given structure ('inverted folding'). This interpretation is questioned here. It is argued that when many body-effects, which dominate frequencies compiled from the protein database, are corrected for, the potentials approximate a physically meaningful free energy difference from which the partition function term cancels out. It is the difference between the free energy of a given sequence in a specific conformation and that of the same sequence in a denatured-like state. Two examples of denatured-like states are discussed. Depending on the considered state, the free energy difference reduces to the commonly used scoring scheme, or contains additional terms that depend on the sequence. In both cases, all the terms can be derived from sequence-structure frequencies in the database. Such free energy difference, commonly defined as the folding free energy, is a measure of protein stability and can be used for scoring both forward and inverted protein folding. The implications for the use of knowledge-based potentials in protein structure prediction are described. Finally, the difficulty of designing tests that could validate the proposed approach, and the inherent limitations of such tests, are discussed.

摘要

从序列和结构特征频率编译而来的数据库衍生势通常用于评估蛋白质序列和构象的兼容性。人们通常认为这些分数对应于自由能的差异,此外还包含一个包含系统配分函数的项。由于这个函数不依赖于构象,所以这些势被认为对于评估给定序列与不同构象的兼容性(“正向折叠”)是有效的,但对于具有给定结构的序列与不同构象的兼容性(“反向折叠”)则无效。这里对这种解释提出了质疑。有人认为,当校正了主导从蛋白质数据库编译的频率的多体效应时,这些势近似于一个物理上有意义的自由能差,其中配分函数项相互抵消。它是特定构象中给定序列的自由能与变性样状态下相同序列的自由能之间的差异。讨论了两种变性样状态的例子。根据所考虑的状态,自由能差简化为常用的评分方案,或者包含依赖于序列的附加项。在这两种情况下,所有项都可以从数据库中的序列 - 结构频率推导出来。这种通常定义为折叠自由能的自由能差是蛋白质稳定性的一种度量,可用于评估正向和反向蛋白质折叠。描述了在蛋白质结构预测中使用基于知识的势的意义。最后,讨论了设计能够验证所提出方法的测试的困难以及此类测试的固有局限性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验