Suppr超能文献

阳性预测值曲面作为评估虚拟筛选方法性能的补充工具。

Positive Predictive Value Surfaces as a Complementary Tool to Assess the Performance of Virtual Screening Methods.

机构信息

Laboratory of Bioactive Research and Development (LIDeB), Department of Biological Sciences, Faculty of Exact Sciences, University of La Plata (UNLP) - 47 & 115, La Plata (1900), Buenos Aires, Argentina.

出版信息

Mini Rev Med Chem. 2020;20(14):1447-1460. doi: 10.2174/1871525718666200219130229.

Abstract

BACKGROUND

Since their introduction in the virtual screening field, Receiver Operating Characteristic (ROC) curve-derived metrics have been widely used for benchmarking of computational methods and algorithms intended for virtual screening applications. Whereas in classification problems, the ratio between sensitivity and specificity for a given score value is very informative, a practical concern in virtual screening campaigns is to predict the actual probability that a predicted hit will prove truly active when submitted to experimental testing (in other words, the Positive Predictive Value - PPV). Estimation of such probability is however, obstructed due to its dependency on the yield of actives of the screened library, which cannot be known a priori.

OBJECTIVE

To explore the use of PPV surfaces derived from simulated ranking experiments (retrospective virtual screening) as a complementary tool to ROC curves, for both benchmarking and optimization of score cutoff values.

METHODS

The utility of the proposed approach is assessed in retrospective virtual screening experiments with four datasets used to infer QSAR classifiers: inhibitors of Trypanosoma cruzi trypanothione synthetase; inhibitors of Trypanosoma brucei N-myristoyltransferase; inhibitors of GABA transaminase and anticonvulsant activity in the 6 Hz seizure model.

RESULTS

Besides illustrating the utility of PPV surfaces to compare the performance of machine learning models for virtual screening applications and to select an adequate score threshold, our results also suggest that ensemble learning provides models with better predictivity and more robust behavior.

CONCLUSION

PPV surfaces are valuable tools to assess virtual screening tools and choose score thresholds to be applied in prospective in silico screens. Ensemble learning approaches seem to consistently lead to improved predictivity and robustness.

摘要

背景

自引入虚拟筛选领域以来,接收器操作特征(ROC)曲线衍生的指标已被广泛用于基准测试计算方法和算法,旨在用于虚拟筛选应用。虽然在分类问题中,给定得分值的敏感性和特异性之间的比值非常有信息量,但虚拟筛选活动中的一个实际问题是预测预测命中物在提交实验测试时实际具有活性的概率(换句话说,阳性预测值 - PPV)。然而,由于其依赖于筛选库中活性物质的产量,因此无法事先知道,因此无法估计这种概率。

目的

探索使用从模拟排序实验(回顾性虚拟筛选)中得出的 PPV 曲面作为 ROC 曲线的补充工具,用于基准测试和优化得分截止值。

方法

在四个数据集的回顾性虚拟筛选实验中评估了所提出方法的实用性,这些数据集用于推断 QSAR 分类器:克氏锥虫 trypanothione 合成酶抑制剂;布氏锥虫 N-豆蔻酰转移酶抑制剂;GABA 转氨酶抑制剂和 6 Hz 惊厥模型中的抗惊厥活性。

结果

除了说明 PPV 曲面可用于比较用于虚拟筛选应用的机器学习模型的性能并选择适当的得分阈值外,我们的结果还表明,集成学习提供了具有更好预测性和更稳健行为的模型。

结论

PPV 曲面是评估虚拟筛选工具和选择要应用于前瞻性计算机筛选的得分阈值的有价值的工具。集成学习方法似乎始终可以提高预测性和稳健性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验