3D虚拟筛选方案的性能评估：均方根偏差比较、富集评估及诱饵选择——我们能从早期的错误中学到什么？

Evaluation of the performance of 3D virtual screening protocols: RMSD comparisons, enrichment assessments, and decoy selection--what can we learn from earlier mistakes?

作者信息

Kirchmair Johannes, Markt Patrick, Distinto Simona, Wolber Gerhard, Langer Thierry

机构信息

Inte:Ligand Software-Entwicklungs- und Consulting GmbH, Clemens Maria Hofbauer-Gasse 6, 2344, Maria Enzersdorf, Austria.

出版信息

J Comput Aided Mol Des. 2008 Mar-Apr;22(3-4):213-28. doi: 10.1007/s10822-007-9163-6. Epub 2008 Jan 15.

DOI:10.1007/s10822-007-9163-6

PMID:18196462

Abstract

Within the last few years a considerable amount of evaluative studies has been published that investigate the performance of 3D virtual screening approaches. Thereby, in particular assessments of protein-ligand docking are facing remarkable interest in the scientific community. However, comparing virtual screening approaches is a non-trivial task. Several publications, especially in the field of molecular docking, suffer from shortcomings that are likely to affect the significance of the results considerably. These quality issues often arise from poor study design, biasing, by using improper or inexpressive enrichment descriptors, and from errors in interpretation of the data output. In this review we analyze recent literature evaluating 3D virtual screening methods, with focus on molecular docking. We highlight problematic issues and provide guidelines on how to improve the quality of computational studies. Since 3D virtual screening protocols are in general assessed by their ability to discriminate between active and inactive compounds, we summarize the impact of the composition and preparation of test sets on the outcome of evaluations. Moreover, we investigate the significance of both classic enrichment parameters and advanced descriptors for the performance of 3D virtual screening methods. Furthermore, we review the significance and suitability of RMSD as a measure for the accuracy of protein-ligand docking algorithms and of conformational space sub sampling algorithms.

摘要

在过去几年里，已经发表了大量评估研究，这些研究调查了三维虚拟筛选方法的性能。因此，蛋白质-配体对接的特别评估在科学界引起了极大的兴趣。然而，比较虚拟筛选方法并非易事。一些出版物，特别是在分子对接领域，存在可能严重影响结果显著性的缺点。这些质量问题往往源于研究设计不佳、偏差（通过使用不当或无表现力的富集描述符）以及数据输出解释中的错误。在本综述中，我们分析了近期评估三维虚拟筛选方法的文献，重点是分子对接。我们突出了存在问题的方面，并提供了关于如何提高计算研究质量的指导方针。由于三维虚拟筛选方案通常通过区分活性和非活性化合物的能力来评估，我们总结了测试集的组成和制备对评估结果的影响。此外，我们研究了经典富集参数和先进描述符对三维虚拟筛选方法性能的重要性。此外，我们还综述了均方根偏差（RMSD）作为蛋白质-配体对接算法和构象空间子采样算法准确性度量的重要性和适用性。

相似文献

Evaluation of the performance of 3D virtual screening protocols: RMSD comparisons, enrichment assessments, and decoy selection--what can we learn from earlier mistakes?3D虚拟筛选方案的性能评估：均方根偏差比较、富集评估及诱饵选择——我们能从早期的错误中学到什么？

J Comput Aided Mol Des. 2008 Mar-Apr;22(3-4):213-28. doi: 10.1007/s10822-007-9163-6. Epub 2008 Jan 15.

Assessing the performance of 3D pharmacophore models in virtual screening: how good are they?评估 3D 药效团模型在虚拟筛选中的性能：它们有多好？

Curr Top Med Chem. 2013;13(9):1127-38. doi: 10.2174/1568026611313090010.

Lead finder: an approach to improve accuracy of protein-ligand docking, binding energy estimation, and virtual screening.铅离子寻找器：一种提高蛋白质-配体对接、结合能估计和虚拟筛选准确性的方法。

J Chem Inf Model. 2008 Dec;48(12):2371-85. doi: 10.1021/ci800166p.

Protein flexibility in ligand docking and virtual screening to protein kinases.用于蛋白激酶的配体对接和虚拟筛选中的蛋白质柔性

J Mol Biol. 2004 Mar 12;337(1):209-25. doi: 10.1016/j.jmb.2004.01.003.

Protein tyrosine phosphatases: Ligand interaction analysis and optimisation of virtual screening.蛋白质酪氨酸磷酸酶：配体相互作用分析与虚拟筛选的优化

J Mol Graph Model. 2014 Jul;52:114-23. doi: 10.1016/j.jmgm.2014.06.011. Epub 2014 Jul 5.

Toward fully automated high performance computing drug discovery: a massively parallel virtual screening pipeline for docking and molecular mechanics/generalized Born surface area rescoring to improve enrichment.迈向全自动高性能计算药物发现：一种大规模并行虚拟筛选管道，用于对接和分子力学/广义 Born 表面面积再评分，以提高富集度。

J Chem Inf Model. 2014 Jan 27;54(1):324-37. doi: 10.1021/ci4005145. Epub 2014 Jan 3.

Boosting Docking-Based Virtual Screening with Deep Learning.深度学习增强基于对接的虚拟筛选。

J Chem Inf Model. 2016 Dec 27;56(12):2495-2506. doi: 10.1021/acs.jcim.6b00355. Epub 2016 Nov 29.

Comparison of structure- and ligand-based virtual screening protocols considering hit list complementarity and enrichment factors.比较基于结构和配体的虚拟筛选方案，考虑命中列表互补性和富集因子。

ChemMedChem. 2010 Jan;5(1):148-58. doi: 10.1002/cmdc.200900314.

Multiple protein structures and multiple ligands: effects on the apparent goodness of virtual screening results.多种蛋白质结构与多种配体：对虚拟筛选结果表观质量的影响

J Comput Aided Mol Des. 2008 Mar-Apr;22(3-4):257-65. doi: 10.1007/s10822-008-9168-9. Epub 2008 Feb 14.

How to do an evaluation: pitfalls and traps.如何进行评估：陷阱与误区

J Comput Aided Mol Des. 2008 Mar-Apr;22(3-4):179-90. doi: 10.1007/s10822-007-9166-3. Epub 2008 Jan 23.

引用本文的文献

Computational prediction of high-risk non-synonymous SNPs in human ApoE and their structural impact on amyloid-β interaction in Alzheimer's disease pathogenesis.人类载脂蛋白E中高风险非同义单核苷酸多态性的计算预测及其在阿尔茨海默病发病机制中对淀粉样β蛋白相互作用的结构影响

PLoS One. 2025 Sep 2;20(9):e0331339. doi: 10.1371/journal.pone.0331339. eCollection 2025.

Influenza a Virus Inhibition: Evaluating Computationally Identified Cyproheptadine Through In Vitro Assessment.甲型流感病毒抑制作用：通过体外评估对计算鉴定出的赛庚啶进行评价。

Int J Mol Sci. 2025 Jun 21;26(13):5962. doi: 10.3390/ijms26135962.

-derived Signature Protein(s) in Murine Vaginal Lavage Fluid: Investigating the Underlying factors of Infertility as a Consequence of Sperm Impairing Bacterium.小鼠阴道灌洗液中衍生的标志性蛋白：探究精子损伤细菌导致不孕的潜在因素。

Biomark Insights. 2025 May 31;20:11772719251340518. doi: 10.1177/11772719251340518. eCollection 2025.

Docking-Based Classification of SGLT2 Inhibitors.基于对接的SGLT2抑制剂分类

Molecules. 2025 May 16;30(10):2179. doi: 10.3390/molecules30102179.

Unlocking the potential of Bavachin in vitamin D receptor cascade modulation for rheumatoid arthritis.挖掘补骨脂素在类风湿性关节炎维生素D受体级联调节中的潜力。

Mol Biol Rep. 2025 Apr 26;52(1):429. doi: 10.1007/s11033-025-10530-2.

Integrated Network Pharmacology, Molecular Modeling, LC-MS Profiling, and Semisynthetic Approach for the Roots of L. Metabolites in Cancer Treatment.综合网络药理学、分子建模、液相色谱-质谱分析以及半合成方法用于研究L. 根的代谢产物在癌症治疗中的作用。

ACS Omega. 2025 Mar 26;10(13):13027-13045. doi: 10.1021/acsomega.4c09853. eCollection 2025 Apr 8.

dbAMP 3.0: updated resource of antimicrobial activity and structural annotation of peptides in the post-pandemic era.dbAMP 3.0：大流行后时代抗菌肽活性和结构注释的更新资源。

Nucleic Acids Res. 2025 Jan 6;53(D1):D364-D376. doi: 10.1093/nar/gkae1019.

Targeting -Acetylglucosaminidase in with Iminosugar Inhibitors.使用亚氨基糖抑制剂靶向β-乙酰氨基葡萄糖苷酶。（注：原文中“-Acetylglucosaminidase”可能有误，推测为“β-Acetylglucosaminidase”，若不是请根据实际情况调整译文）

Antibiotics (Basel). 2024 Aug 10;13(8):751. doi: 10.3390/antibiotics13080751.

Extended Conformational Selection in the Antigen-Antibody Interaction of the PfAMA1 Protein.PfAMA1 蛋白的抗原-抗体相互作用中的扩展构象选择。

J Phys Chem B. 2024 Sep 5;128(35):8400-8408. doi: 10.1021/acs.jpcb.4c03734. Epub 2024 Aug 22.

Exploring compounds as potential inhibitors for allergen proteins: A systematic computational approach.探索作为变应原蛋白潜在抑制剂的化合物：一种系统的计算方法。

Heliyon. 2024 Jul 22;10(15):e34713. doi: 10.1016/j.heliyon.2024.e34713. eCollection 2024 Aug 15.

本文引用的文献

Fast and efficient in silico 3D screening: toward maximum computational efficiency of pharmacophore-based and shape-based approaches.快速高效的计算机模拟3D筛选：迈向基于药效团和基于形状方法的最大计算效率

J Chem Inf Model. 2007 Nov-Dec;47(6):2182-96. doi: 10.1021/ci700024q. Epub 2007 Oct 11.

CAESAR: a new conformer generation algorithm based on recursive buildup and local rotational symmetry consideration.CAESAR：一种基于递归构建和局部旋转对称性考虑的新构象生成算法。

J Chem Inf Model. 2007 Sep-Oct;47(5):1923-32. doi: 10.1021/ci700136x. Epub 2007 Aug 11.

Evaluations of molecular docking programs for virtual screening.用于虚拟筛选的分子对接程序评估。

J Chem Inf Model. 2007 Jul-Aug;47(4):1609-18. doi: 10.1021/ci7000378. Epub 2007 Jun 28.

Comparison of topological, shape, and docking methods in virtual screening.虚拟筛选中拓扑、形状和对接方法的比较

J Chem Inf Model. 2007 Jul-Aug;47(4):1504-19. doi: 10.1021/ci700052x. Epub 2007 Jun 26.

Parallel screening and activity profiling with HIV protease inhibitor pharmacophore models.利用HIV蛋白酶抑制剂药效团模型进行平行筛选和活性分析。

J Chem Inf Model. 2007 Mar-Apr;47(2):563-71. doi: 10.1021/ci600321m.

Evaluating virtual screening methods: good and bad metrics for the "early recognition" problem.评估虚拟筛选方法：针对“早期识别”问题的优劣指标

J Chem Inf Model. 2007 Mar-Apr;47(2):488-508. doi: 10.1021/ci600426e. Epub 2007 Feb 9.

Benchmarking sets for molecular docking.分子对接的基准测试集。

J Med Chem. 2006 Nov 16;49(23):6789-801. doi: 10.1021/jm0608356.

The impact of tautomer forms on pharmacophore-based virtual screening.互变异构体形式对基于药效团的虚拟筛选的影响。

J Chem Inf Model. 2006 Nov-Dec;46(6):2342-54. doi: 10.1021/ci060109b.

High-throughput structure-based pharmacophore modelling as a basis for successful parallel virtual screening.基于结构的高通量药效团建模作为成功并行虚拟筛选的基础。

J Comput Aided Mol Des. 2006 Dec;20(12):703-15. doi: 10.1007/s10822-006-9066-y. Epub 2006 Sep 29.

A critical assessment of docking programs and scoring functions.对接程序和评分函数的批判性评估。

J Med Chem. 2006 Oct 5;49(20):5912-31. doi: 10.1021/jm050362n.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

3D虚拟筛选方案的性能评估：均方根偏差比较、富集评估及诱饵选择——我们能从早期的错误中学到什么？

Evaluation of the performance of 3D virtual screening protocols: RMSD comparisons, enrichment assessments, and decoy selection--what can we learn from earlier mistakes?

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献