Duan Mo Jie, Zhou Yan Hong
Hubei Bioinformatics and Molecular Imaging Key Laboratory, Huazhong University of Science and Technology, Wuhan 430074, China.
Genomics Proteomics Bioinformatics. 2005 Nov;3(4):218-24. doi: 10.1016/s1672-0229(05)03030-5.
The three-dimensional (3D) structure prediction of proteins is an important task in bioinformatics. Finding energy functions that can better represent residue-residue and residue-solvent interactions is a crucial way to improve the prediction accuracy. The widely used contact energy functions mostly only consider the contact frequency between different types of residues; however, we find that the contact frequency also relates to the residue hydrophobic environment. Accordingly, we present an improved contact energy function to integrate the two factors, which can reflect the influence of hydrophobic interaction on the stabilization of protein 3D structure more effectively. Furthermore, a fold recognition (threading) approach based on this energy function is developed. The testing results obtained with 20 randomly selected proteins demonstrate that, compared with common contact energy functions, the proposed energy function can improve the accuracy of the fold template prediction from 20% to 50%, and can also improve the accuracy of the sequence-template alignment from 35% to 65%.
蛋白质的三维(3D)结构预测是生物信息学中的一项重要任务。寻找能够更好地表示残基-残基和残基-溶剂相互作用的能量函数是提高预测准确性的关键途径。广泛使用的接触能量函数大多只考虑不同类型残基之间的接触频率;然而,我们发现接触频率也与残基疏水环境有关。因此,我们提出了一种改进的接触能量函数来整合这两个因素,它可以更有效地反映疏水相互作用对蛋白质3D结构稳定性的影响。此外,基于该能量函数开发了一种折叠识别(穿线法)方法。对20个随机选择的蛋白质进行测试的结果表明,与常见的接触能量函数相比,所提出的能量函数可以将折叠模板预测的准确性从20%提高到50%,并且还可以将序列-模板比对的准确性从35%提高到65%。