Teramoto Reiji, Fukunishi Hiroaki
Bio-IT Center and Nano Electronics Research Laboratories, NEC Corporation, 34, Miyukigaoka, Tsukuba, Ibaraki 305-8501, Japan.
J Chem Inf Model. 2008 Apr;48(4):747-54. doi: 10.1021/ci700464x. Epub 2008 Mar 5.
Since the evaluation of ligand conformations is a crucial aspect of structure-based virtual screening, scoring functions play significant roles in it. However, it is known that a scoring function does not always work well for all target proteins. When one cannot know which scoring function works best against a target protein a priori, there is no standard scoring method to know it even if 3D structure of a target protein-ligand complex is available. Therefore, development of the method to achieve high enrichments from given scoring functions and 3D structure of protein-ligand complex is a crucial and challenging task. To address this problem, we applied SCS (supervised consensus scoring), which employs a rough linear correlation between the binding free energy and the root-mean-square deviation (rmsd) of a native ligand conformations and incorporates protein-ligand binding process with docked ligand conformations using supervised learning, to virtual screening. We evaluated both the docking poses and enrichments of SCS and five scoring functions (F-Score, G-Score, D-Score, ChemScore, and PMF) for three different target proteins: thymidine kinase (TK), thrombin (thrombin), and peroxisome proliferator-activated receptor gamma (PPARgamma). Our enrichment studies show that SCS is competitive or superior to a best single scoring function at the top ranks of screened database. We found that the enrichments of SCS could be limited by a best scoring function, because SCS is obtained on the basis of the five individual scoring functions. Therefore, it is concluded that SCS works very successfully from our results. Moreover, from docking pose analysis, we revealed the connection between enrichment and average centroid distance of top-scored docking poses. Since SCS requires only one 3D structure of protein-ligand complex, SCS will be useful for identifying new ligands.
由于配体构象的评估是基于结构的虚拟筛选的关键环节,评分函数在其中起着重要作用。然而,众所周知,评分函数并非对所有靶蛋白都能始终良好发挥作用。当无法事先知晓哪种评分函数对靶蛋白效果最佳时,即便有靶蛋白 - 配体复合物的三维结构,也没有标准的评分方法来确定这一点。因此,开发一种能从给定的评分函数和蛋白质 - 配体复合物的三维结构中实现高富集度的方法是一项关键且具有挑战性的任务。为解决此问题,我们将监督一致性评分(SCS)应用于虚拟筛选,该方法利用天然配体构象的结合自由能与均方根偏差(rmsd)之间的粗略线性相关性,并通过监督学习将蛋白质 - 配体结合过程与对接的配体构象相结合。我们针对三种不同的靶蛋白:胸苷激酶(TK)、凝血酶(thrombin)和过氧化物酶体增殖物激活受体γ(PPARγ),评估了SCS以及五种评分函数(F - Score、G - Score、D - Score、ChemScore和PMF)的对接姿势和富集度。我们的富集度研究表明,在筛选数据库的前列排名中,SCS具有竞争力或优于最佳的单一评分函数。我们发现SCS的富集度可能会受到最佳评分函数的限制,因为SCS是基于五个单独的评分函数获得的。因此,从我们得到的结果可以得出结论,SCS非常成功。此外,通过对接姿势分析,我们揭示了富集度与高分对接姿势的平均质心距离之间的联系。由于SCS仅需要蛋白质 - 配体复合物的一个三维结构,SCS将有助于识别新的配体。