Dipartimento di Scienze Farmaceutiche, Università degli Studi di Milano, Via Mangiagalli, 25, I-20133 Milano, Italy.
Int J Mol Sci. 2019 Apr 26;20(9):2060. doi: 10.3390/ijms20092060.
The study proposes a novel consensus strategy based on linear combinations of different docking scores to be used in the evaluation of virtual screening campaigns. The consensus models are generated by applying the recently proposed Enrichment Factor Optimization (EFO) method, which develops the linear equations by exhaustively combining the available docking scores and by optimizing the resulting enrichment factors. The performances of such a consensus strategy were evaluated by simulating the entire Directory of Useful Decoys (DUD datasets). In detail, the poses were initially generated by the PLANTS docking program and then rescored by ReScore+ with and without the minimization of the complexes. The so calculated scores were then used to generate the mentioned consensus models including two or three different scoring functions. The reliability of the generated models was assessed by a per target validation as performed by default by the EFO approach. The encouraging performances of the here proposed consensus strategy are emphasized by the average increase of the 17% in the Top 1% enrichment factor (EF) values when comparing the single best score with the linear combination of three scores. Specifically, kinases offer a truly convincing demonstration of the efficacy of the here proposed consensus strategy since their Top 1% EF average ranges from 6.4 when using the single best performing primary score to 23.5 when linearly combining scoring functions. The beneficial effects of this consensus approach are clearly noticeable even when considering the entire DUD datasets as evidenced by the area under the curve (AUC) averages revealing a 14% increase when combining three scores. The reached AUC values compare very well with those reported in literature by an extended set of recent benchmarking studies and the three-variable models afford the highest AUC average.
该研究提出了一种新的共识策略,基于不同对接分数的线性组合,用于评估虚拟筛选活动。共识模型是通过应用最近提出的富集因子优化(EFO)方法生成的,该方法通过详尽地组合可用的对接分数并优化得到的富集因子来开发线性方程。通过模拟整个有用的诱饵目录(DUD 数据集)来评估这种共识策略的性能。具体来说,首先由 PLANTS 对接程序生成构象,然后使用 ReScore+ 进行再评分,同时可以选择不进行复合物的最小化。然后使用这些计算得到的分数生成上述共识模型,包括两种或三种不同的打分函数。生成模型的可靠性通过 EFO 方法默认执行的每个靶标验证进行评估。当将单个最佳得分与三个得分的线性组合进行比较时,所提出的共识策略的令人鼓舞的性能通过平均增加 17%的前 1%富集因子(EF)值来强调。具体来说,激酶提供了对所提出的共识策略有效性的真正令人信服的证明,因为它们的前 1%EF 平均值从使用单个表现最佳的原始分数时的 6.4 变化到线性组合打分函数时的 23.5。即使考虑整个 DUD 数据集,这种共识方法的有益效果也很明显,正如曲线下面积(AUC)平均值所证明的那样,当组合三个分数时,AUC 增加了 14%。所达到的 AUC 值与文献中通过一组扩展的最新基准研究报告的 AUC 值非常吻合,并且三变量模型提供了最高的 AUC 平均值。