Zhang C
Department of Biomedical Engineering, Boston University, Massachusetts 02215-2407, USA.
Proteins. 1998 May 15;31(3):299-308.
In this study, we exploited an elementary 2-dimensional square lattice model of HP polymers to test the premise of extracting contact energies from protein structures. Given a set of prespecified energies for H-H, H-P, and P-P contacts, all possible sequences of various lengths were exhaustively enumerated to find sequences that have unique lowest-energy conformations. The lowest-energy structures (or native structures) of such (native) sequences were used to extract contact energies using the Miyazawa-Jernigan procedure and here-defined reference state. The relative magnitudes of the original energies were restored reasonably well, but the extracted contact energies were independent of the absolute magnitudes of the initial energies. We turned to a more detailed characterization of the energy landscapes of the native sequences in light of a new theoretical framework on protein folding. Foldability of such sequences imposes two limits on the absolute value of the prespecified energies: a lower bound entailed by the minimum requirement for thermodynamic stability and an upper bound associated with the entrapment of the chain to local minima. We found that these two limits confine the prespecified energy values to a rather narrow range which, surprisingly, also contains the extracted energies in all the cases examined. These results indicate that the quasi-chemical approximation can be used to connect quantitatively the occurrence of various residue-residue contacts in an ensemble of native structures with the energies of the contacts. More importantly, they suggest that the extracted contact energies do contain information on structural stability and can be used to estimate actual structural energetics. This study also encourages the use of structure-derived contact energies in threading. The finding that there is a rather narrow range of energies that are optimal for folding a sequence also cautions the use of arbitrary energy Hamiltonion in minimal folding models.
在本研究中,我们利用HP聚合物的二维基本方格模型来检验从蛋白质结构中提取接触能的前提。给定一组预先指定的H-H、H-P和P-P接触能,详尽列举了各种长度的所有可能序列,以找到具有唯一最低能量构象的序列。利用宫泽-杰尼根程序和此处定义的参考态,使用这些(天然)序列的最低能量结构(或天然结构)来提取接触能。原始能量的相对大小得到了较好的恢复,但提取的接触能与初始能量的绝对值无关。鉴于蛋白质折叠的新理论框架,我们转向对天然序列能量景观的更详细表征。此类序列的可折叠性对预先指定能量的绝对值施加了两个限制:一个下限由热力学稳定性的最低要求决定,一个上限与链陷入局部最小值有关。我们发现这两个限制将预先指定的能量值限制在一个相当窄的范围内,令人惊讶的是,在所研究的所有情况下,该范围也包含提取的能量。这些结果表明,准化学近似可用于定量地将天然结构集合中各种残基-残基接触的出现与接触能联系起来。更重要的是,它们表明提取的接触能确实包含有关结构稳定性的信息,可用于估计实际的结构能量学。本研究还鼓励在穿线法中使用从结构推导的接触能。发现存在一个相当窄的能量范围对于折叠一个序列是最优的,这也警示了在最小折叠模型中使用任意能量哈密顿量的情况。