Zhang Jinfeng, Chen Rong, Liang Jie
Department of Bioengineering, University of Illinois, Chicago, Illinois, USA.
Proteins. 2006 Jun 1;63(4):949-60. doi: 10.1002/prot.20809.
An effective potential function is critical for protein structure prediction and folding simulation. Simplified protein models such as those requiring only Calpha or backbone atoms are attractive because they enable efficient search of the conformational space. We show residue-specific reduced discrete-state models can represent the backbone conformations of proteins with small RMSD values. However, no potential functions exist that are designed for such simplified protein models. In this study, we develop optimal potential functions by combining contact interaction descriptors and local sequence-structure descriptors. The form of the potential function is a weighted linear sum of all descriptors, and the optimal weight coefficients are obtained through optimization using both native and decoy structures. The performance of the potential function in a test of discriminating native protein structures from decoys is evaluated using several benchmark decoy sets. Our potential function requiring only backbone atoms or Calpha atoms have comparable or better performance than several residue-based potential functions that require additional coordinates of side-chain centers or coordinates of all side-chain atoms. By reducing the residue alphabets down to size 10 for contact descriptors, the performance of the potential function can be further improved. Our results also suggest that local sequence-structure correlation may play important role in reducing the entropic cost of protein folding.
有效势能函数对于蛋白质结构预测和折叠模拟至关重要。诸如仅需要Cα或主链原子的简化蛋白质模型很有吸引力,因为它们能够高效搜索构象空间。我们表明,特定残基的简化离散状态模型能够以较小的均方根偏差(RMSD)值表示蛋白质的主链构象。然而,目前还没有专门为这种简化蛋白质模型设计的势能函数。在本研究中,我们通过结合接触相互作用描述符和局部序列-结构描述符来开发最优势能函数。势能函数的形式是所有描述符的加权线性和,最优权重系数通过使用天然结构和诱饵结构进行优化获得。使用几个基准诱饵集评估该势能函数在区分天然蛋白质结构和诱饵结构测试中的性能。我们仅需要主链原子或Cα原子的势能函数与几种需要侧链中心的额外坐标或所有侧链原子坐标的基于残基的势能函数相比,具有相当或更好的性能。通过将接触描述符的残基字母表减少到10种,势能函数的性能可以进一步提高。我们的结果还表明,局部序列-结构相关性可能在降低蛋白质折叠的熵成本方面发挥重要作用。