Tobi D, Shafran G, Linial N, Elber R
Department of Biological Chemistry, The Hebrew University, Givat Ram, Jerusalem, Israel.
Proteins. 2000 Jul 1;40(1):71-85. doi: 10.1002/(sici)1097-0134(20000701)40:1<71::aid-prot90>3.0.co;2-3.
Pairwise interaction models to recognize native folds are designed and analyzed. Different sets of parameters are considered but the focus was on 20 x 20 contact matrices. Simultaneous solution of inequalities and minimization of the variance of the energy find matrices that recognize exactly the native folds of 572 sequences and structures from the protein data bank (PDB). The set includes many homologous pairs, which present a difficult recognition problem. Significant recognition ability is recovered with a small number of parameters (e.g., the H/P model). However, full recognition requires a complete set of amino acids. In addition to structures from the PDB, a folding program (MONSSTER) was used to generate decoy structures for 75 proteins. It is impossible to recognize all the native structures of the extended set by contact potentials. We therefore searched for a new functional form. An energy function U, which is based on a sum of general pairwise interactions limited to a resolution of 1 angstrom, is considered. This set was infeasible too. We therefore conjecture that it is not possible to find a folding potential, resolved to 1 angstrom, which is a sum of pair interactions.
设计并分析了用于识别天然折叠的成对相互作用模型。考虑了不同的参数集,但重点是20×20的接触矩阵。通过同时求解不等式和最小化能量方差,找到了能够准确识别蛋白质数据库(PDB)中572个序列和结构的天然折叠的矩阵。该集合包括许多同源对,这带来了一个困难的识别问题。通过少量参数(例如H/P模型)可以恢复显著的识别能力。然而,完全识别需要完整的氨基酸集。除了PDB中的结构外,还使用了一个折叠程序(MONSSTER)为75种蛋白质生成诱饵结构。通过接触势不可能识别扩展集合中的所有天然结构。因此,我们寻找一种新的函数形式。考虑了一种基于一般成对相互作用之和且限于1埃分辨率的能量函数U。这组也不可行。因此,我们推测不可能找到一个分辨率为1埃的折叠势,它是成对相互作用的总和。