• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

虚拟筛选的社区基准。

Community benchmarks for virtual screening.

作者信息

Irwin John J

机构信息

Department of Pharmaceutical Chemistry, University of California San Francisco, PO Box 2550, Byers Hall, San Francisco, CA 94158-2330, USA.

出版信息

J Comput Aided Mol Des. 2008 Mar-Apr;22(3-4):193-9. doi: 10.1007/s10822-008-9189-4. Epub 2008 Feb 14.

DOI:10.1007/s10822-008-9189-4
PMID:18273555
Abstract

Ligand enrichment among top-ranking hits is a key metric of virtual screening. To avoid bias, decoys should resemble ligands physically, so that enrichment is not attributable to simple differences of gross features. We therefore created a directory of useful decoys (DUD) by selecting decoys that resembled annotated ligands physically but not topologically to benchmark docking performance. DUD has 2950 annotated ligands and 95,316 property-matched decoys for 40 targets. It is by far the largest and most comprehensive public data set for benchmarking virtual screening programs that I am aware of. This paper outlines several ways that DUD can be improved to provide better telemetry to investigators seeking to understand both the strengths and the weaknesses of current docking methods. I also highlight several pitfalls for the unwary: a risk of over-optimization, questions about chemical space, and the proper scope for using DUD. Careful attention to both the composition of benchmarks and how they are used is essential to avoid being misled by overfitting and bias.

摘要

排名靠前的命中配体中的配体富集是虚拟筛选的关键指标。为避免偏差,诱饵应在物理上类似于配体,以使富集不归因于总体特征的简单差异。因此,我们通过选择在物理上类似于注释配体但拓扑结构不同的诱饵来创建一个有用诱饵目录(DUD),以基准化对接性能。DUD有针对40个靶点的2950个注释配体和95316个性质匹配的诱饵。据我所知,它是目前用于基准化虚拟筛选程序的最大且最全面的公共数据集。本文概述了几种改进DUD的方法,以便为试图了解当前对接方法优缺点的研究人员提供更好的遥测数据。我还强调了一些粗心者易犯的错误:过度优化的风险、关于化学空间的问题以及使用DUD的适当范围。仔细关注基准的组成及其使用方式对于避免因过度拟合和偏差而被误导至关重要。

相似文献

1
Community benchmarks for virtual screening.虚拟筛选的社区基准。
J Comput Aided Mol Des. 2008 Mar-Apr;22(3-4):193-9. doi: 10.1007/s10822-008-9189-4. Epub 2008 Feb 14.
2
Benchmarking sets for molecular docking.分子对接的基准测试集。
J Med Chem. 2006 Nov 16;49(23):6789-801. doi: 10.1021/jm0608356.
3
Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking.有用诱饵目录增强版(DUD-E):更好的配体和诱饵,用于更好的基准测试。
J Med Chem. 2012 Jul 26;55(14):6582-94. doi: 10.1021/jm300687e. Epub 2012 Jul 5.
4
Benchmark of four popular virtual screening programs: construction of the active/decoy dataset remains a major determinant of measured performance.四种常用虚拟筛选程序的基准测试:活性/诱饵数据集的构建仍然是衡量性能的主要决定因素。
J Cheminform. 2016 Oct 17;8:56. doi: 10.1186/s13321-016-0167-x. eCollection 2016.
5
Virtual decoy sets for molecular docking benchmarks.分子对接基准的虚拟诱饵集。
J Chem Inf Model. 2011 Feb 28;51(2):196-202. doi: 10.1021/ci100374f. Epub 2011 Jan 5.
6
Binding energy landscape analysis helps to discriminate true hits from high-scoring decoys in virtual screening.结合能景观分析有助于在虚拟筛选中区分真正的命中物和高得分的伪靶标。
J Chem Inf Model. 2010 Oct 25;50(10):1855-64. doi: 10.1021/ci900463u.
7
Validation of a Field-Based Ligand Screener Using a Novel Benchmarking Data Set for Assessing 3D-Based Virtual Screening Methods.基于新型基准数据集评估 3D 虚拟筛选方法的现场配体筛选器的验证。
J Chem Inf Model. 2021 Dec 27;61(12):5841-5852. doi: 10.1021/acs.jcim.1c00866. Epub 2021 Nov 18.
8
Property-Unmatched Decoys in Docking Benchmarks.对接基准测试中的属性不匹配诱饵。
J Chem Inf Model. 2021 Feb 22;61(2):699-714. doi: 10.1021/acs.jcim.0c00598. Epub 2021 Jan 25.
9
Boosting Docking-Based Virtual Screening with Deep Learning.深度学习增强基于对接的虚拟筛选。
J Chem Inf Model. 2016 Dec 27;56(12):2495-2506. doi: 10.1021/acs.jcim.6b00355. Epub 2016 Nov 29.
10
Ligity: A Non-Superpositional, Knowledge-Based Approach to Virtual Screening. Ligity:一种非叠加的、基于知识的虚拟筛选方法。
J Chem Inf Model. 2019 Jun 24;59(6):2600-2616. doi: 10.1021/acs.jcim.8b00779. Epub 2019 Jun 4.

引用本文的文献

1
GESim: ultrafast graph-based molecular similarity calculation via von Neumann graph entropy.GESim:通过冯·诺依曼图熵进行基于图的超快速分子相似性计算。
J Cheminform. 2025 Apr 22;17(1):57. doi: 10.1186/s13321-025-01003-6.
2
Virtual screening: hope, hype, and the fine line in between.虚拟筛选:希望、炒作与二者之间的微妙界限。
Expert Opin Drug Discov. 2025 Feb;20(2):145-162. doi: 10.1080/17460441.2025.2458666. Epub 2025 Jan 27.
3
Do Molecular Fingerprints Identify Diverse Active Drugs in Large-Scale Virtual Screening? (No).分子指纹图谱能否在大规模虚拟筛选中识别出多种活性药物?(不能)

本文引用的文献

1
Free R value: a novel statistical quantity for assessing the accuracy of crystal structures.自由R值:一种用于评估晶体结构准确性的新型统计量。
Nature. 1992 Jan 30;355(6359):472-5. doi: 10.1038/355472a0.
2
Optimization of CAMD techniques 3. Virtual screening enrichment studies: a help or hindrance in tool selection?计算机辅助分子设计(CAMD)技术的优化3. 虚拟筛选富集研究:在工具选择中是助力还是阻碍?
J Comput Aided Mol Des. 2008 Mar-Apr;22(3-4):169-78. doi: 10.1007/s10822-007-9167-2. Epub 2008 Jan 9.
3
Binding MOAD, a high-quality protein-ligand database.
Pharmaceuticals (Basel). 2024 Jul 26;17(8):992. doi: 10.3390/ph17080992.
4
On the relevance of query definition in the performance of 3D ligand-based virtual screening.在 3D 基于配体的虚拟筛选性能中查询定义的相关性。
J Comput Aided Mol Des. 2024 Apr 4;38(1):18. doi: 10.1007/s10822-024-00561-5.
5
Synthesis of non-symmetric -benzylbispidinol amides and study of their inhibitory activity against the main protease of the SARS-CoV-2 virus.非对称-苄基双吡啶醇酰胺的合成及其对SARS-CoV-2病毒主要蛋白酶的抑制活性研究
Russ Chem Bull. 2023;72(1):239-247. doi: 10.1007/s11172-023-3729-x. Epub 2023 Feb 14.
6
A New Hybrid Neural Network Deep Learning Method for Protein-Ligand Binding Affinity Prediction and De Novo Drug Design.一种用于蛋白质-配体结合亲和力预测和从头药物设计的新型混合神经网络深度学习方法。
Int J Mol Sci. 2022 Nov 11;23(22):13912. doi: 10.3390/ijms232213912.
7
Confidence bands and hypothesis tests for hit enrichment curves.命中富集曲线的置信带和假设检验。
J Cheminform. 2022 Jul 28;14(1):50. doi: 10.1186/s13321-022-00629-0.
8
The Breast Cancer Protooncogenes HER2, BRCA1 and BRCA2 and Their Regulation by the iNOS/NOS2 Axis.乳腺癌原癌基因HER2、BRCA1和BRCA2及其受诱导型一氧化氮合酶/一氧化氮合酶2轴的调控
Antioxidants (Basel). 2022 Jun 17;11(6):1195. doi: 10.3390/antiox11061195.
9
Application of deep metric learning to molecular graph similarity.深度度量学习在分子图相似性中的应用。
J Cheminform. 2022 Mar 12;14(1):11. doi: 10.1186/s13321-022-00595-7.
10
A Search for Cyclin-Dependent Kinase 4/6 Inhibitors by Pharmacophore-Based Virtual Screening, Molecular Docking, and Molecular Dynamic Simulations.基于药效团的虚拟筛选、分子对接和分子动力学模拟寻找细胞周期蛋白依赖性激酶 4/6 抑制剂。
Int J Mol Sci. 2021 Dec 14;22(24):13423. doi: 10.3390/ijms222413423.
绑定MOAD,一个高质量的蛋白质-配体数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D674-8. doi: 10.1093/nar/gkm911. Epub 2007 Nov 30.
4
High-resolution crystal structure of an engineered human beta2-adrenergic G protein-coupled receptor.一种工程化人β2-肾上腺素能G蛋白偶联受体的高分辨率晶体结构
Science. 2007 Nov 23;318(5854):1258-65. doi: 10.1126/science.1150577. Epub 2007 Oct 25.
5
Separating model optimization and model validation in statistical cross-validation as applied to crystallography.在应用于晶体学的统计交叉验证中分离模型优化和模型验证。
Acta Crystallogr D Biol Crystallogr. 2007 Sep;63(Pt 9):939-40. doi: 10.1107/S0907444907033458. Epub 2007 Aug 17.
6
Evaluations of molecular docking programs for virtual screening.用于虚拟筛选的分子对接程序评估。
J Chem Inf Model. 2007 Jul-Aug;47(4):1609-18. doi: 10.1021/ci7000378. Epub 2007 Jun 28.
7
Comparative performance of several flexible docking programs and scoring functions: enrichment studies for a diverse set of pharmaceutically relevant targets.几种柔性对接程序和评分函数的比较性能:针对多种药学相关靶点的富集研究
J Chem Inf Model. 2007 Jul-Aug;47(4):1599-608. doi: 10.1021/ci7000346. Epub 2007 Jun 23.
8
Diverse, high-quality test set for the validation of protein-ligand docking performance.用于验证蛋白质-配体对接性能的多样、高质量测试集。
J Med Chem. 2007 Feb 22;50(4):726-41. doi: 10.1021/jm061277y.
9
Virtual exploration of the chemical universe up to 11 atoms of C, N, O, F: assembly of 26.4 million structures (110.9 million stereoisomers) and analysis for new ring systems, stereochemistry, physicochemical properties, compound classes, and drug discovery.对含11个碳、氮、氧、氟原子以内的化学宇宙进行虚拟探索:2640万个结构(1.109亿个立体异构体)的组装以及对新环系、立体化学、物理化学性质、化合物类别和药物发现的分析。
J Chem Inf Model. 2007 Mar-Apr;47(2):342-53. doi: 10.1021/ci600423u. Epub 2007 Jan 30.
10
Crystallographic study of the tetrabutylammonium block to the KcsA K+ channel.四丁基铵对KcsA钾通道阻滞作用的晶体学研究。
J Mol Biol. 2007 Feb 23;366(3):806-14. doi: 10.1016/j.jmb.2006.11.081. Epub 2006 Dec 2.