Biozentrum, University of Basel, SIB Swiss Institute of Bioinformatics, Klingelbergstrasse 50-70, Basel, Switzerland.
Proteins. 2011;79 Suppl 10(Suppl 10):126-36. doi: 10.1002/prot.23174. Epub 2011 Oct 11.
Interactions between proteins and their ligands play central roles in many physiological processes. The structural details for most of these interactions, however, have not yet been characterized experientially. Therefore, various computational tools have been developed to predict the location of binding sites and the amino acid residues interacting with ligands. In this manuscript, we assess the performance of 33 methods participating in the ligand-binding site prediction category in CASP9. The overall accuracy of ligand-binding site predictions in CASP9 appears rather high (average Matthews correlation coefficient of 0.62 for the 10 top performing groups) and compared to previous experiments more groups performed equally well. However, this should be seen in context of a strong bias in the test data toward easy template-based models. Overall, the top performing methods have converged to a similar approach using ligand-binding site inference from related homologous structures, which limits their applicability for difficult de novo prediction targets. Here, we present the results of the CASP9 assessment of the ligand-binding site category, discuss examples for successful and challenging prediction targets in CASP9, and finally suggest changes in the format of the experiment to overcome the current limitations of the assessment.
蛋白质与其配体之间的相互作用在许多生理过程中起着核心作用。然而,这些相互作用的大多数结构细节尚未通过实验来描述。因此,已经开发了各种计算工具来预测结合位点的位置和与配体相互作用的氨基酸残基。在本文中,我们评估了在 CASP9 中参与配体结合位点预测类别的 33 种方法的性能。CASP9 中配体结合位点预测的整体准确性似乎相当高(10 个表现最佳的组的平均 Matthews 相关系数为 0.62),与之前的实验相比,更多的组表现相当出色。然而,这应该结合测试数据中对基于模板的简单模型的强烈偏向来看待。总体而言,表现最佳的方法已经采用了从相关同源结构推断配体结合位点的类似方法,这限制了它们在困难的从头预测目标中的适用性。在这里,我们介绍了 CASP9 对配体结合位点类别的评估结果,讨论了 CASP9 中成功和具有挑战性的预测目标的示例,并最终提出了改变实验格式的建议,以克服目前评估的局限性。