Bahadur Ranjit Prasad, Chakrabarti Pinak
Department of Biochemistry, Bose Institute, P-1/12 CIT Scheme VIIM, Calcutta 700 054, India.
BMC Struct Biol. 2009 Dec 28;9:76. doi: 10.1186/1472-6807-9-76.
Setting the rules for the identification of a stable conformation of a protein is of utmost importance for the efficient generation of structures in computer simulation. For structure prediction, a considerable number of possible models are generated from which the best model has to be selected.
Two scoring functions, Rs and Rp, based on the consideration of packing of residues, which indicate if the conformation of an amino acid sequence is native-like, are presented. These are defined using the solvent accessible surface area (ASA) and the partner number (PN) (other residues that are within 4.5 A) of a particular residue. The two functions evaluate the deviation from the average packing properties (ASA or PN) of all residues in a polypeptide chain corresponding to a model of its three-dimensional structure. While simple in concept and computationally less intensive, both the functions are at least as efficient as any other energy functions in discriminating the native structure from decoys in a large number of standard decoy sets, as well as on models submitted for the targets of CASP7. Rs appears to be slightly more effective than Rp, as determined by the number of times the native structure possesses the minimum value for the function and its separation from the average value for the decoys.
Two parameters, Rs and Rp, are discussed that can very efficiently recognize the native fold for a sequence from an ensemble of decoy structures. Unlike many other algorithms that rely on the use of composite scoring function, these are based on a single parameter, viz., the accessible surface area (or the number of residues in contact), but still able to capture the essential attribute of the native fold.
为蛋白质稳定构象的识别设定规则对于计算机模拟中高效生成结构至关重要。对于结构预测,会生成大量可能的模型,必须从中选择最佳模型。
提出了基于残基堆积考虑的两个评分函数Rs和Rp,它们可指示氨基酸序列的构象是否类似天然构象。这两个函数是使用特定残基的溶剂可及表面积(ASA)和配体数量(PN)(距离在4.5埃以内的其他残基)来定义的。这两个函数评估与三维结构模型相对应的多肽链中所有残基的平均堆积特性(ASA或PN)的偏差。虽然这两个函数概念简单且计算强度较小,但在从大量标准诱饵集中区分天然结构和诱饵结构以及在为CASP7目标提交的模型上,它们至少与任何其他能量函数一样有效。根据天然结构具有该函数最小值的次数及其与诱饵结构平均值的差异确定,Rs似乎比Rp稍有效。
讨论了两个参数Rs和Rp,它们可以非常有效地从一组诱饵结构中识别序列的天然折叠。与许多依赖于使用复合评分函数的其他算法不同,这些参数基于单个参数,即可及表面积(或接触的残基数量),但仍能够捕捉天然折叠的基本属性。