Cao Hai-Bo, Wang Cai-Zhuang, Ho Kai-Ming
Department of Physics and Astronomy, Ames Laboratory-U.S. DOE, Iowa State University, Ames, Iowa 50011, USA.
Phys Rev E Stat Nonlin Soft Matter Phys. 2005 Aug;72(2 Pt 1):021907. doi: 10.1103/PhysRevE.72.021907. Epub 2005 Aug 22.
By an enumeration study, we show that the energy distributions of a lattice protein sequence on all possible compact lattice configurations can be approximated by the energy distribution of shuffled sequences on a given lattice structure. We also show that the random energy model (REM) gives a good analytical approximation for the energy distribution of shuffled sequences on lattice structures. For real proteins, when a gapped threading method is used, REM calculations systematically underestimate the mean value of the energy distributions. We found that this discrepancy can be roughly compensated by a linear correction obtained from empirical fits. This result can be used to greatly reduce the computational effort in protein threading calculations.
通过一项枚举研究,我们表明,晶格蛋白质序列在所有可能的紧凑晶格构型上的能量分布,可以由给定晶格结构上重排序列的能量分布来近似。我们还表明,随机能量模型(REM)能很好地解析近似晶格结构上重排序列的能量分布。对于真实蛋白质,当使用带空位的穿线法时,REM计算会系统性地低估能量分布的平均值。我们发现,这种差异可以通过经验拟合得到的线性校正大致补偿。这一结果可用于大幅减少蛋白质穿线计算中的计算量。