Zhang Chi, Liu Song, Zhou Hongyi, Zhou Yaoqi
Department of Physiology and Biophysics, State University of New York at Buffalo, Buffalo, New York 14214, USA.
Biophys J. 2004 Jun;86(6):3349-58. doi: 10.1529/biophysj.103.035998.
An accurate statistical energy function that is suitable for the prediction of protein structures of all classes should be independent of the structural database used for energy extraction. Here, two high-resolution, low-sequence-identity structural databases of 333 alpha-proteins and 271 beta-proteins were built for examining the database dependence of three all-atom statistical energy functions. They are RAPDF (residue-specific all-atom conditional probability discriminatory function), atomic KBP (atomic knowledge-based potential), and DFIRE (statistical potential based on distance-scaled finite ideal-gas reference state). These energy functions differ in the reference states used for energy derivation. The energy functions extracted from the different structural databases are used to select native structures from multiple decoys of 64 alpha-proteins and 28 beta-proteins. The performance in native structure selections indicates that the DFIRE-based energy function is mostly independent of the structural database whereas RAPDF and KBP have a significant dependence. The construction of two additional structural databases of alpha/beta and alpha + beta-proteins further confirmed the weak dependence of DFIRE on the structural databases of various structural classes. The possible source for the difference between the three all-atom statistical energy functions is that the physical reference state of ideal gas used in the DFIRE-based energy function is least dependent on the structural database.
一个适用于预测所有类型蛋白质结构的精确统计能量函数应独立于用于提取能量的结构数据库。在此,构建了两个包含333个α-蛋白和271个β-蛋白的高分辨率、低序列同一性的结构数据库,用于检验三种全原子统计能量函数对数据库的依赖性。它们分别是RAPDF(残基特异性全原子条件概率判别函数)、原子KBP(基于原子知识的势)和DFIRE(基于距离缩放有限理想气体参考态的统计势)。这些能量函数在用于推导能量的参考态方面存在差异。从不同结构数据库中提取的能量函数用于从64个α-蛋白和28个β-蛋白的多个诱饵结构中选择天然结构。天然结构选择的性能表明,基于DFIRE的能量函数大多独立于结构数据库,而RAPDF和KBP则有显著依赖性。另外构建的α/β和α + β-蛋白的两个结构数据库进一步证实了DFIRE对各种结构类型的结构数据库的弱依赖性。三种全原子统计能量函数之间差异的可能来源是,基于DFIRE的能量函数中使用的理想气体物理参考态对结构数据库的依赖性最小。