一种评估虚拟筛选的统计框架。

A statistical framework to evaluate virtual screening.

作者信息

Zhao Wei, Hevener Kirk E, White Stephen W, Lee Richard E, Boyett James M

机构信息

Department of Biostatistics, St Jude Children's Research Hospital, Memphis, TN, USA.

出版信息

BMC Bioinformatics. 2009 Jul 20;10:225. doi: 10.1186/1471-2105-10-225.

DOI:10.1186/1471-2105-10-225

PMID:19619306

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2722655/

Abstract

BACKGROUND

Receiver operating characteristic (ROC) curve is widely used to evaluate virtual screening (VS) studies. However, the method fails to address the "early recognition" problem specific to VS. Although many other metrics, such as RIE, BEDROC, and pROC that emphasize "early recognition" have been proposed, there are no rigorous statistical guidelines for determining the thresholds and performing significance tests. Also no comparisons have been made between these metrics under a statistical framework to better understand their performances.

RESULTS

We have proposed a statistical framework to evaluate VS studies by which the threshold to determine whether a ranking method is better than random ranking can be derived by bootstrap simulations and 2 ranking methods can be compared by permutation test. We found that different metrics emphasize "early recognition" differently. BEDROC and RIE are 2 statistically equivalent metrics. Our newly proposed metric SLR is superior to pROC. Through extensive simulations, we observed a "seesaw effect" - overemphasizing early recognition reduces the statistical power of a metric to detect true early recognitions.

CONCLUSION

The statistical framework developed and tested by us is applicable to any other metric as well, even if their exact distribution is unknown. Under this framework, a threshold can be easily selected according to a pre-specified type I error rate and statistical comparisons between 2 ranking methods becomes possible. The theoretical null distribution of SLR metric is available so that the threshold of SLR can be exactly determined without resorting to bootstrap simulations, which makes it easy to use in practical virtual screening studies.

摘要

背景

受试者工作特征（ROC）曲线被广泛用于评估虚拟筛选（VS）研究。然而，该方法未能解决VS特有的“早期识别”问题。尽管已经提出了许多其他强调“早期识别”的指标，如RIE、BEDROC和pROC，但在确定阈值和进行显著性检验方面没有严格的统计指南。在统计框架下，这些指标之间也没有进行比较以更好地了解它们的性能。

结果

我们提出了一个评估VS研究的统计框架，通过该框架，可以通过自助模拟得出确定排序方法是否优于随机排序的阈值，并通过置换检验比较两种排序方法。我们发现不同的指标对“早期识别”的强调程度不同。BEDROC和RIE是两个统计等效的指标。我们新提出的指标SLR优于pROC。通过广泛的模拟，我们观察到一种“跷跷板效应”——过度强调早期识别会降低指标检测真正早期识别的统计能力。

结论

我们开发和测试的统计框架也适用于任何其他指标，即使其确切分布未知。在此框架下，可以根据预先指定的I型错误率轻松选择阈值，并且可以对两种排序方法进行统计比较。SLR指标的理论零分布是可用的，因此无需借助自助模拟即可精确确定SLR的阈值，这使得它在实际的虚拟筛选研究中易于使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6687/2722655/fce74f171c04/1471-2105-10-225-1.jpg

相似文献

A statistical framework to evaluate virtual screening.

BMC Bioinformatics. 2009 Jul 20;10:225. doi: 10.1186/1471-2105-10-225.

Evaluating virtual screening methods: good and bad metrics for the "early recognition" problem.

J Chem Inf Model. 2007 Mar-Apr;47(2):488-508. doi: 10.1021/ci600426e. Epub 2007 Feb 9.

Novel learning framework (knockoff technique) to evaluate metric ranking algorithms to describe human response to injury.

Traffic Inj Prev. 2018;19(sup2):S121-S126. doi: 10.1080/15389588.2018.1519805. Epub 2018 Dec 20.

A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval.

Bioinformatics. 2010 May 15;26(10):1348-56. doi: 10.1093/bioinformatics/btq140. Epub 2010 Apr 7.

Rank order metrics for quantifying the association of sequence features with gene regulation.

Bioinformatics. 2003 Jan 22;19(2):212-8. doi: 10.1093/bioinformatics/19.2.212.

Work efficiency: a new criterion for comprehensive comparison and evaluation of statistical methods in large-scale identification of differentially expressed genes.

Genomics. 2011 Nov;98(5):390-9. doi: 10.1016/j.ygeno.2011.05.006. Epub 2011 Jun 30.

Differential correlation for sequencing data.

BMC Res Notes. 2017 Jan 19;10(1):54. doi: 10.1186/s13104-016-2331-9.

The power metric: a new statistically robust enrichment-type metric for virtual screening applications with early recovery capability.

J Cheminform. 2017 Feb 2;9:7. doi: 10.1186/s13321-016-0189-4. eCollection 2017.

Practical approach to determine sample size for building logistic prediction models using high-throughput data.

J Biomed Inform. 2015 Feb;53:355-62. doi: 10.1016/j.jbi.2014.12.010. Epub 2014 Dec 30.

Positive Predictive Value Surfaces as a Complementary Tool to Assess the Performance of Virtual Screening Methods.

Mini Rev Med Chem. 2020;20(14):1447-1460. doi: 10.2174/1871525718666200219130229.

引用本文的文献

A versatile information retrieval framework for evaluating profile strength and similarity.

Nat Commun. 2025 Jun 4;16(1):5181. doi: 10.1038/s41467-025-60306-2.

Integrated Virtual Screening Approach Identifies New CYP19A1 Inhibitors.

J Chem Inf Model. 2025 Apr 14;65(7):3529-3543. doi: 10.1021/acs.jcim.5c00204. Epub 2025 Mar 19.

Protein profiling uncovers IGF-1R inhibition potential of 3-(2-furoyl)-indole scaffolds in hepatocellular carcinoma.

Future Med Chem. 2025 Mar;17(5):513-528. doi: 10.1080/17568919.2025.2467616. Epub 2025 Mar 3.

Identification and Evaluation of Natural Compounds as Potential Inhibitors of NS2B-NS3 Zika Virus Protease: A Computational Approach.

Mol Biotechnol. 2024 Dec 28. doi: 10.1007/s12033-024-01357-6.

Do Molecular Fingerprints Identify Diverse Active Drugs in Large-Scale Virtual Screening? (No).

Pharmaceuticals (Basel). 2024 Jul 26;17(8):992. doi: 10.3390/ph17080992.

Machine Learning Assisted Hit Prioritization for High Throughput Screening in Drug Discovery.

ACS Cent Sci. 2024 Mar 15;10(4):823-832. doi: 10.1021/acscentsci.3c01517. eCollection 2024 Apr 24.

A versatile information retrieval framework for evaluating profile strength and similarity.

bioRxiv. 2025 Mar 13:2024.04.01.587631. doi: 10.1101/2024.04.01.587631.

On the relevance of query definition in the performance of 3D ligand-based virtual screening.

J Comput Aided Mol Des. 2024 Apr 4;38(1):18. doi: 10.1007/s10822-024-00561-5.

Synergistic effect of potential alpha-amylase inhibitors from Egyptian propolis with acarbose using in silico and in vitro combination analysis.

BMC Complement Med Ther. 2024 Jan 30;24(1):65. doi: 10.1186/s12906-024-04348-x.

De Novo Prediction of Drug Targets and Candidates by Chemical Similarity-Guided Network-Based Inference.

Int J Mol Sci. 2022 Aug 26;23(17):9666. doi: 10.3390/ijms23179666.

本文引用的文献

Validation of molecular docking programs for virtual screening against dihydropteroate synthase.

J Chem Inf Model. 2009 Feb;49(2):444-60. doi: 10.1021/ci800293n.

Enhancing drug discovery through in silico screening: strategies to increase true positives retrieval rates.

Curr Med Chem. 2008;15(20):2040-53. doi: 10.2174/092986708785132843.

A novel hybrid ultrafast shape descriptor method for use in virtual screening.

Chem Cent J. 2008 Feb 18;2:3. doi: 10.1186/1752-153X-2-3.

Managing bias in ROC curves.

J Comput Aided Mol Des. 2008 Mar-Apr;22(3-4):141-6. doi: 10.1007/s10822-008-9181-z. Epub 2008 Feb 7.

Evaluation of the performance of 3D virtual screening protocols: RMSD comparisons, enrichment assessments, and decoy selection--what can we learn from earlier mistakes?

J Comput Aided Mol Des. 2008 Mar-Apr;22(3-4):213-28. doi: 10.1007/s10822-007-9163-6. Epub 2008 Jan 15.

Bias, reporting, and sharing: computational evaluations of docking methods.

J Comput Aided Mol Des. 2008 Mar-Apr;22(3-4):201-12. doi: 10.1007/s10822-007-9151-x. Epub 2007 Dec 13.

Ligand docking and structure-based virtual screening in drug discovery.

Curr Top Med Chem. 2007;7(10):1006-14. doi: 10.2174/156802607780906753.

Evaluating virtual screening methods: good and bad metrics for the "early recognition" problem.

J Chem Inf Model. 2007 Mar-Apr;47(2):488-508. doi: 10.1021/ci600426e. Epub 2007 Feb 9.

Scoring functions and enrichment: a case study on Hsp90.

BMC Bioinformatics. 2007 Jan 26;8:27. doi: 10.1186/1471-2105-8-27.

A critical assessment of docking programs and scoring functions.

J Med Chem. 2006 Oct 5;49(20):5912-31. doi: 10.1021/jm050362n.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种评估虚拟筛选的统计框架。

A statistical framework to evaluate virtual screening.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献