比较蛋白质-配体对接程序很困难。

Comparing protein-ligand docking programs is difficult.

作者信息

Cole Jason C, Murray Christopher W, Nissink J Willem M, Taylor Richard D, Taylor Robin

机构信息

Cambridge Crystallographic Data Centre, Cambridge, United Kingdom.

出版信息

Proteins. 2005 Aug 15;60(3):325-32. doi: 10.1002/prot.20497.

DOI:10.1002/prot.20497

PMID:15937897

Abstract

There is currently great interest in comparing protein-ligand docking programs. A review of recent comparisons shows that it is difficult to draw conclusions of general applicability. Statistical hypothesis testing is required to ensure that differences in pose-prediction success rates and enrichment rates are significant. Numerical measures such as root-mean-square deviation need careful interpretation and may profitably be supplemented by interaction-based measures and visual inspection of dockings. Test sets must be of appropriate diversity and of good experimental reliability. The effects of crystal-packing interactions may be important. The method used for generating starting ligand geometries and positions may have an appreciable effect on docking results. For fair comparison, programs must be given search problems of equal complexity (e.g. binding-site regions of the same size) and approximately equal time in which to solve them. Comparisons based on rescoring require local optimization of the ligand in the space of the new objective function. Re-implementations of published scoring functions may give significantly different results from the originals. Ostensibly minor details in methodology may have a profound influence on headline success rates.

摘要

目前，人们对比较蛋白质-配体对接程序有着浓厚的兴趣。对近期比较研究的综述表明，很难得出具有普遍适用性的结论。需要进行统计假设检验，以确保构象预测成功率和富集率的差异具有显著性。诸如均方根偏差等数值度量需要仔细解读，并且可以有益地辅以基于相互作用的度量和对接的可视化检查。测试集必须具有适当的多样性和良好的实验可靠性。晶体堆积相互作用的影响可能很重要。用于生成起始配体几何形状和位置的方法可能对对接结果有显著影响。为了进行公平比较，必须给程序提供复杂度相同的搜索问题（例如相同大小的结合位点区域），并给予它们大致相同的求解时间。基于重新评分的比较需要在新目标函数的空间中对配体进行局部优化。已发表评分函数的重新实现可能会给出与原始结果显著不同的结果。方法学中看似微小的细节可能会对总体成功率产生深远影响。