阳性预测值曲面作为评估虚拟筛选方法性能的补充工具。

Laboratory of Bioactive Research and Development (LIDeB), Department of Biological Sciences, Faculty of Exact Sciences, University of La Plata (UNLP) - 47 & 115, La Plata (1900), Buenos Aires, Argentina.

Mini Rev Med Chem. 2020;20(14):1447-1460. doi: 10.2174/1871525718666200219130229.

BACKGROUND

Since their introduction in the virtual screening field, Receiver Operating Characteristic (ROC) curve-derived metrics have been widely used for benchmarking of computational methods and algorithms intended for virtual screening applications. Whereas in classification problems, the ratio between sensitivity and specificity for a given score value is very informative, a practical concern in virtual screening campaigns is to predict the actual probability that a predicted hit will prove truly active when submitted to experimental testing (in other words, the Positive Predictive Value - PPV). Estimation of such probability is however, obstructed due to its dependency on the yield of actives of the screened library, which cannot be known a priori.

OBJECTIVE

To explore the use of PPV surfaces derived from simulated ranking experiments (retrospective virtual screening) as a complementary tool to ROC curves, for both benchmarking and optimization of score cutoff values.

METHODS

The utility of the proposed approach is assessed in retrospective virtual screening experiments with four datasets used to infer QSAR classifiers: inhibitors of Trypanosoma cruzi trypanothione synthetase; inhibitors of Trypanosoma brucei N-myristoyltransferase; inhibitors of GABA transaminase and anticonvulsant activity in the 6 Hz seizure model.

RESULTS

Besides illustrating the utility of PPV surfaces to compare the performance of machine learning models for virtual screening applications and to select an adequate score threshold, our results also suggest that ensemble learning provides models with better predictivity and more robust behavior.

CONCLUSION

PPV surfaces are valuable tools to assess virtual screening tools and choose score thresholds to be applied in prospective in silico screens. Ensemble learning approaches seem to consistently lead to improved predictivity and robustness.

背景

自引入虚拟筛选领域以来，接收器操作特征（ROC）曲线衍生的指标已被广泛用于基准测试计算方法和算法，旨在用于虚拟筛选应用。虽然在分类问题中，给定得分值的敏感性和特异性之间的比值非常有信息量，但虚拟筛选活动中的一个实际问题是预测预测命中物在提交实验测试时实际具有活性的概率（换句话说，阳性预测值 - PPV）。然而，由于其依赖于筛选库中活性物质的产量，因此无法事先知道，因此无法估计这种概率。

目的

探索使用从模拟排序实验（回顾性虚拟筛选）中得出的 PPV 曲面作为 ROC 曲线的补充工具，用于基准测试和优化得分截止值。

方法

在四个数据集的回顾性虚拟筛选实验中评估了所提出方法的实用性，这些数据集用于推断 QSAR 分类器：克氏锥虫 trypanothione 合成酶抑制剂；布氏锥虫 N-豆蔻酰转移酶抑制剂；GABA 转氨酶抑制剂和 6 Hz 惊厥模型中的抗惊厥活性。

结果

除了说明 PPV 曲面可用于比较用于虚拟筛选应用的机器学习模型的性能并选择适当的得分阈值外，我们的结果还表明，集成学习提供了具有更好预测性和更稳健行为的模型。

结论

PPV 曲面是评估虚拟筛选工具和选择要应用于前瞻性计算机筛选的得分阈值的有价值的工具。集成学习方法似乎始终可以提高预测性和稳健性。

相似文献

Positive Predictive Value Surfaces as a Complementary Tool to Assess the Performance of Virtual Screening Methods.

Mini Rev Med Chem. 2020;20(14):1447-1460. doi: 10.2174/1871525718666200219130229.

Ensemble learning application to discover new trypanothione synthetase inhibitors.

Mol Divers. 2021 Aug;25(3):1361-1373. doi: 10.1007/s11030-021-10265-9. Epub 2021 Jul 15.

Evaluation of QSAR Equations for Virtual Screening.

Int J Mol Sci. 2020 Oct 22;21(21):7828. doi: 10.3390/ijms21217828.

Systematic Comparison of the Performance of Different 2D and 3D Ligand-Based Virtual Screening Methodologies to Discover Anticonvulsant Drugs.

Comb Chem High Throughput Screen. 2015;18(4):387-98. doi: 10.2174/1386207318666150305151420.

Integrated machine learning, molecular docking and 3D-QSAR based approach for identification of potential inhibitors of trypanosomal N-myristoyltransferase.

Mol Biosyst. 2016 Nov 15;12(12):3711-3723. doi: 10.1039/c6mb00574h.

Quantitative structure-activity relationship models for compounds with anticonvulsant activity.

Expert Opin Drug Discov. 2019 Jul;14(7):653-665. doi: 10.1080/17460441.2019.1613368. Epub 2019 May 10.

Machine learning applications for the prediction of surgical site infection in neurological operations.

Neurosurg Focus. 2019 Aug 1;47(2):E7. doi: 10.3171/2019.5.FOCUS19241.

ALADDIN: Docking Approach Augmented by Machine Learning for Protein Structure Selection Yields Superior Virtual Screening Performance.

Mol Inform. 2020 Apr;39(4):e1900103. doi: 10.1002/minf.201900103. Epub 2019 Nov 8.

Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction.

J Comput Aided Mol Des. 2020 Jul;34(7):717-730. doi: 10.1007/s10822-019-00274-0. Epub 2020 Jan 20.

Applications of Quantitative Structure-Activity Relationships (QSAR) based Virtual Screening in Drug Design: A Review.

Mini Rev Med Chem. 2020;20(14):1375-1388. doi: 10.2174/1389557520666200429102334.

引用本文的文献

Multi-target Phenylpropanoids Against Epilepsy.

Curr Neuropharmacol. 2024;22(13):2168-2190. doi: 10.2174/1570159X22666240524160126.

GABA-transaminase: A Key Player and Potential Therapeutic Target for Neurological Disorders.

Cent Nerv Syst Agents Med Chem. 2024;24(1):57-67. doi: 10.2174/0118715249267700231116053516.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Positive Predictive Value Surfaces as a Complementary Tool to Assess the Performance of Virtual Screening Methods.

Mini Rev Med Chem. 2020;20(14):1447-1460. doi: 10.2174/1871525718666200219130229.

Ensemble learning application to discover new trypanothione synthetase inhibitors.

Mol Divers. 2021 Aug;25(3):1361-1373. doi: 10.1007/s11030-021-10265-9. Epub 2021 Jul 15.

Evaluation of QSAR Equations for Virtual Screening.

Int J Mol Sci. 2020 Oct 22;21(21):7828. doi: 10.3390/ijms21217828.

Systematic Comparison of the Performance of Different 2D and 3D Ligand-Based Virtual Screening Methodologies to Discover Anticonvulsant Drugs.

Comb Chem High Throughput Screen. 2015;18(4):387-98. doi: 10.2174/1386207318666150305151420.

Integrated machine learning, molecular docking and 3D-QSAR based approach for identification of potential inhibitors of trypanosomal N-myristoyltransferase.

Mol Biosyst. 2016 Nov 15;12(12):3711-3723. doi: 10.1039/c6mb00574h.

Quantitative structure-activity relationship models for compounds with anticonvulsant activity.

Expert Opin Drug Discov. 2019 Jul;14(7):653-665. doi: 10.1080/17460441.2019.1613368. Epub 2019 May 10.

Machine learning applications for the prediction of surgical site infection in neurological operations.

Neurosurg Focus. 2019 Aug 1;47(2):E7. doi: 10.3171/2019.5.FOCUS19241.

ALADDIN: Docking Approach Augmented by Machine Learning for Protein Structure Selection Yields Superior Virtual Screening Performance.

Mol Inform. 2020 Apr;39(4):e1900103. doi: 10.1002/minf.201900103. Epub 2019 Nov 8.

Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction.

J Comput Aided Mol Des. 2020 Jul;34(7):717-730. doi: 10.1007/s10822-019-00274-0. Epub 2020 Jan 20.

Applications of Quantitative Structure-Activity Relationships (QSAR) based Virtual Screening in Drug Design: A Review.

Mini Rev Med Chem. 2020;20(14):1375-1388. doi: 10.2174/1389557520666200429102334.

引用本文的文献

Multi-target Phenylpropanoids Against Epilepsy.

Curr Neuropharmacol. 2024;22(13):2168-2190. doi: 10.2174/1570159X22666240524160126.

GABA-transaminase: A Key Player and Potential Therapeutic Target for Neurological Disorders.

Cent Nerv Syst Agents Med Chem. 2024;24(1):57-67. doi: 10.2174/0118715249267700231116053516.

Positive Predictive Value Surfaces as a Complementary Tool to Assess the Performance of Virtual Screening Methods.

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSION

背景

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献