Laboratory of Bioactive Research and Development (LIDeB), Department of Biological Sciences, Faculty of Exact Sciences, University of La Plata (UNLP) - 47 & 115, La Plata (1900), Buenos Aires, Argentina.
Mini Rev Med Chem. 2020;20(14):1447-1460. doi: 10.2174/1871525718666200219130229.
Since their introduction in the virtual screening field, Receiver Operating Characteristic (ROC) curve-derived metrics have been widely used for benchmarking of computational methods and algorithms intended for virtual screening applications. Whereas in classification problems, the ratio between sensitivity and specificity for a given score value is very informative, a practical concern in virtual screening campaigns is to predict the actual probability that a predicted hit will prove truly active when submitted to experimental testing (in other words, the Positive Predictive Value - PPV). Estimation of such probability is however, obstructed due to its dependency on the yield of actives of the screened library, which cannot be known a priori.
To explore the use of PPV surfaces derived from simulated ranking experiments (retrospective virtual screening) as a complementary tool to ROC curves, for both benchmarking and optimization of score cutoff values.
The utility of the proposed approach is assessed in retrospective virtual screening experiments with four datasets used to infer QSAR classifiers: inhibitors of Trypanosoma cruzi trypanothione synthetase; inhibitors of Trypanosoma brucei N-myristoyltransferase; inhibitors of GABA transaminase and anticonvulsant activity in the 6 Hz seizure model.
Besides illustrating the utility of PPV surfaces to compare the performance of machine learning models for virtual screening applications and to select an adequate score threshold, our results also suggest that ensemble learning provides models with better predictivity and more robust behavior.
PPV surfaces are valuable tools to assess virtual screening tools and choose score thresholds to be applied in prospective in silico screens. Ensemble learning approaches seem to consistently lead to improved predictivity and robustness.
自引入虚拟筛选领域以来,接收器操作特征(ROC)曲线衍生的指标已被广泛用于基准测试计算方法和算法,旨在用于虚拟筛选应用。虽然在分类问题中,给定得分值的敏感性和特异性之间的比值非常有信息量,但虚拟筛选活动中的一个实际问题是预测预测命中物在提交实验测试时实际具有活性的概率(换句话说,阳性预测值 - PPV)。然而,由于其依赖于筛选库中活性物质的产量,因此无法事先知道,因此无法估计这种概率。
探索使用从模拟排序实验(回顾性虚拟筛选)中得出的 PPV 曲面作为 ROC 曲线的补充工具,用于基准测试和优化得分截止值。
在四个数据集的回顾性虚拟筛选实验中评估了所提出方法的实用性,这些数据集用于推断 QSAR 分类器:克氏锥虫 trypanothione 合成酶抑制剂;布氏锥虫 N-豆蔻酰转移酶抑制剂;GABA 转氨酶抑制剂和 6 Hz 惊厥模型中的抗惊厥活性。
除了说明 PPV 曲面可用于比较用于虚拟筛选应用的机器学习模型的性能并选择适当的得分阈值外,我们的结果还表明,集成学习提供了具有更好预测性和更稳健行为的模型。
PPV 曲面是评估虚拟筛选工具和选择要应用于前瞻性计算机筛选的得分阈值的有价值的工具。集成学习方法似乎始终可以提高预测性和稳健性。