Mirzaie Mehdi
Department of Applied Mathematics, Faculty of Mathematical Sciences, Tarbiat Modares University, Jalal Ale Ahmad Highway, Tehran, Iran.
School of Biological Sciences, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran.
Proteins. 2018 Apr;86(4):467-474. doi: 10.1002/prot.25466. Epub 2018 Feb 5.
Evaluation of protein structures needs a trustworthy potential function. Although several knowledge-based potential functions exist, the impact of different types of amino acids in the scoring functions has not been studied yet. Previously, we have reported the importance of nonlocal interactions in scoring function (based on Delaunay tessellation) in discrimination of native structures. Then, we have questioned the structural impact of hydrophobic amino acids in protein fold recognition. Therefore, a Hydrophobic Reduced Model (HRM) was designed to reduce protein structure of FS (Full Structure) into RS (Reduced Structure). RS is considered as a reduced structure of only seven hydrophobic amino acids (L, V, F, I, A, W, Y) and all their interactions. The presented model was evaluated via four different performance metrics including the number of correctly identified natives, the Z-score of the native energy, the RMSD of the minimum score, and the Pearson correlation coefficient between the energy and the model quality. Results indicated that only nonlocal interactions between hydrophobic amino acids could be sufficient and accurate enough for protein fold recognition. Interestingly, the results of HRM is significantly close to the model that considers all amino acids (20-amino acid model) to discriminate the native structure of the proteins on eleven decoy sets. This indicates that the power of knowledge-based potential functions in protein fold recognition is mostly due to hydrophobic interactions. Hence, we suggest combining a different well-designed scoring function for non-hydrophobic interactions with HRM to achieve better performance in fold recognition.
蛋白质结构评估需要一个可靠的势函数。尽管存在几种基于知识的势函数,但评分函数中不同类型氨基酸的影响尚未得到研究。此前,我们已经报道了非局部相互作用在基于德劳内三角剖分的评分函数中对天然结构识别的重要性。然后,我们质疑了疏水氨基酸在蛋白质折叠识别中的结构影响。因此,设计了一种疏水简化模型(HRM),将FS(全结构)的蛋白质结构简化为RS(简化结构)。RS被视为仅由七种疏水氨基酸(L、V、F、I、A、W、Y)及其所有相互作用组成的简化结构。通过四个不同的性能指标对所提出的模型进行了评估,包括正确识别的天然结构数量、天然能量的Z分数、最低分数的均方根偏差以及能量与模型质量之间的皮尔逊相关系数。结果表明,仅疏水氨基酸之间的非局部相互作用就足以且准确地用于蛋白质折叠识别。有趣的是,在十一个诱饵集上,HRM的结果与考虑所有氨基酸的模型(20氨基酸模型)在区分蛋白质天然结构方面的结果非常接近。这表明基于知识的势函数在蛋白质折叠识别中的能力主要归因于疏水相互作用。因此,我们建议将一种针对非疏水相互作用精心设计的不同评分函数与HRM相结合,以在折叠识别中获得更好的性能。