• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于基于配体的虚拟筛选的无偏基准集构建方法及其在 GPCR 中的应用。

An unbiased method to build benchmarking sets for ligand-based virtual screening and its application to GPCRs.

机构信息

State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University , Beijing 100191, China.

出版信息

J Chem Inf Model. 2014 May 27;54(5):1433-50. doi: 10.1021/ci500062f. Epub 2014 May 1.

DOI:10.1021/ci500062f
PMID:24749745
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4038372/
Abstract

Benchmarking data sets have become common in recent years for the purpose of virtual screening, though the main focus had been placed on the structure-based virtual screening (SBVS) approaches. Due to the lack of crystal structures, there is great need for unbiased benchmarking sets to evaluate various ligand-based virtual screening (LBVS) methods for important drug targets such as G protein-coupled receptors (GPCRs). To date these ready-to-apply data sets for LBVS are fairly limited, and the direct usage of benchmarking sets designed for SBVS could bring the biases to the evaluation of LBVS. Herein, we propose an unbiased method to build benchmarking sets for LBVS and validate it on a multitude of GPCRs targets. To be more specific, our methods can (1) ensure chemical diversity of ligands, (2) maintain the physicochemical similarity between ligands and decoys, (3) make the decoys dissimilar in chemical topology to all ligands to avoid false negatives, and (4) maximize spatial random distribution of ligands and decoys. We evaluated the quality of our Unbiased Ligand Set (ULS) and Unbiased Decoy Set (UDS) using three common LBVS approaches, with Leave-One-Out (LOO) Cross-Validation (CV) and a metric of average AUC of the ROC curves. Our method has greatly reduced the "artificial enrichment" and "analogue bias" of a published GPCRs benchmarking set, i.e., GPCR Ligand Library (GLL)/GPCR Decoy Database (GDD). In addition, we addressed an important issue about the ratio of decoys per ligand and found that for a range of 30 to 100 it does not affect the quality of the benchmarking set, so we kept the original ratio of 39 from the GLL/GDD.

摘要

近年来,基准数据集已成为虚拟筛选的常用方法,尽管主要关注点一直放在基于结构的虚拟筛选 (SBVS) 方法上。由于缺乏晶体结构,因此非常需要无偏基准数据集来评估各种配体为基础的虚拟筛选 (LBVS) 方法,以用于 G 蛋白偶联受体 (GPCR) 等重要药物靶标。迄今为止,这些可直接用于 LBVS 的现成基准数据集相当有限,而直接使用专为 SBVS 设计的基准数据集可能会给 LBVS 的评估带来偏差。在此,我们提出了一种构建 LBVS 基准数据集的无偏方法,并在多种 GPCR 靶标上对其进行了验证。具体而言,我们的方法可以 (1) 确保配体的化学多样性,(2) 保持配体和诱饵之间的物理化学相似性,(3) 使诱饵在化学拓扑上与所有配体不同,以避免假阴性,以及 (4) 最大化配体和诱饵的空间随机分布。我们使用三种常见的 LBVS 方法,通过留一法交叉验证 (LOO-CV) 和 ROC 曲线平均 AUC 的度量标准,评估了我们的无偏配体集 (ULS) 和无偏诱饵集 (UDS) 的质量。我们的方法大大降低了已发表的 GPCR 基准数据集,即 GPCR 配体库 (GLL)/GPCR 诱饵数据库 (GDD) 的“人为富集”和“类似物偏差”。此外,我们解决了关于诱饵与配体比例的一个重要问题,发现对于 30 到 100 的范围,它不会影响基准数据集的质量,因此我们保留了 GLL/GDD 中的原始比例 39。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/205d9c37562e/ci-2014-00062f_0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/133d89ef9156/ci-2014-00062f_0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/8e9edbd33e9a/ci-2014-00062f_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/642d36d7dfaa/ci-2014-00062f_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/01ca6ebaa202/ci-2014-00062f_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/0f28b42cd642/ci-2014-00062f_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/ed4cc8fbbd50/ci-2014-00062f_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/5cbd647f0d13/ci-2014-00062f_0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/205d9c37562e/ci-2014-00062f_0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/133d89ef9156/ci-2014-00062f_0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/8e9edbd33e9a/ci-2014-00062f_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/642d36d7dfaa/ci-2014-00062f_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/01ca6ebaa202/ci-2014-00062f_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/0f28b42cd642/ci-2014-00062f_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/ed4cc8fbbd50/ci-2014-00062f_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/5cbd647f0d13/ci-2014-00062f_0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed0/4038372/205d9c37562e/ci-2014-00062f_0010.jpg

相似文献

1
An unbiased method to build benchmarking sets for ligand-based virtual screening and its application to GPCRs.一种用于基于配体的虚拟筛选的无偏基准集构建方法及其在 GPCR 中的应用。
J Chem Inf Model. 2014 May 27;54(5):1433-50. doi: 10.1021/ci500062f. Epub 2014 May 1.
2
Benchmarking methods and data sets for ligand enrichment assessment in virtual screening.虚拟筛选中配体富集评估的基准测试方法和数据集
Methods. 2015 Jan;71:146-57. doi: 10.1016/j.ymeth.2014.11.015. Epub 2014 Dec 3.
3
Comparative modeling and benchmarking data sets for human histone deacetylases and sirtuin families.人类组蛋白去乙酰化酶和沉默调节蛋白家族的比较建模与基准数据集。
J Chem Inf Model. 2015 Feb 23;55(2):374-88. doi: 10.1021/ci5005515. Epub 2015 Feb 9.
4
Extended template-based modeling and evaluation method using consensus of binding mode of GPCRs for virtual screening.基于模板的扩展建模和评估方法,利用 GPCR 结合模式的共识进行虚拟筛选。
J Chem Inf Model. 2014 Nov 24;54(11):3153-61. doi: 10.1021/ci500499j. Epub 2014 Nov 11.
5
Ligand and decoy sets for docking to G protein-coupled receptors.配体和诱饵集,用于与 G 蛋白偶联受体对接。
J Chem Inf Model. 2012 Jan 23;52(1):1-6. doi: 10.1021/ci200412p. Epub 2011 Dec 14.
6
An efficient multistep ligand-based virtual screening approach for GPR40 agonists.一种高效的基于配体的多步虚拟筛选方法,用于寻找 GPR40 激动剂。
Mol Divers. 2014 Feb;18(1):183-93. doi: 10.1007/s11030-013-9493-3. Epub 2013 Dec 5.
7
Benchmarking Data Sets for the Evaluation of Virtual Ligand Screening Methods: Review and Perspectives.用于虚拟配体筛选方法评估的基准数据集:综述与展望。
J Chem Inf Model. 2015 Jul 27;55(7):1297-307. doi: 10.1021/acs.jcim.5b00090. Epub 2015 Jun 18.
8
The Development of Target-Specific Pose Filter Ensembles To Boost Ligand Enrichment for Structure-Based Virtual Screening.用于增强基于结构的虚拟筛选中配体富集的目标特异性姿态过滤器集成的开发。
J Chem Inf Model. 2017 Jun 26;57(6):1414-1425. doi: 10.1021/acs.jcim.6b00749. Epub 2017 Jun 1.
9
Development of New Methods Needs Proper Evaluation-Benchmarking Sets for Machine Learning Experiments for Class A GPCRs.发展新方法需要适当的评估——用于 A 类 GPCR 机器学习实验的基准数据集。
J Chem Inf Model. 2019 Dec 23;59(12):4974-4992. doi: 10.1021/acs.jcim.9b00689. Epub 2019 Nov 22.
10
Novel approach for efficient pharmacophore-based virtual screening: method and applications.基于药效团的高效虚拟筛选新方法:方法与应用
J Chem Inf Model. 2009 Oct;49(10):2333-43. doi: 10.1021/ci900263d.

引用本文的文献

1
Building shape-focused pharmacophore models for effective docking screening.构建以形状为重点的药效团模型用于有效的对接筛选。
J Cheminform. 2024 Aug 9;16(1):97. doi: 10.1186/s13321-024-00857-6.
2
Multimodal data fusion for supervised learning-based identification of USP7 inhibitors: a systematic comparison.基于监督学习的USP7抑制剂识别的多模态数据融合:系统比较
J Cheminform. 2023 Jan 11;15(1):5. doi: 10.1186/s13321-022-00675-8.
3
Combination of Docking-Based and Pharmacophore-Based Virtual Screening Identifies Novel Agonists That Target the Urotensin Receptor.

本文引用的文献

1
Benchmarking ligand-based virtual High-Throughput Screening with the PubChem database.基于配体的虚拟高通量筛选与 PubChem 数据库的基准测试。
Molecules. 2013 Jan 8;18(1):735-56. doi: 10.3390/molecules18010735.
2
G protein-coupled receptors--recent advances.G蛋白偶联受体——最新进展
Acta Biochim Pol. 2012;59(4):515-29. Epub 2012 Dec 18.
3
FINDSITE(comb): a threading/structure-based, proteomic-scale virtual ligand screening approach.FINDSITE(组合):一种基于配体穿线/结构的蛋白质组学规模的虚拟配体筛选方法。
基于对接和基于药效团的虚拟筛选相结合,鉴定靶向尾加压素受体的新型激动剂。
Molecules. 2022 Dec 8;27(24):8692. doi: 10.3390/molecules27248692.
4
Comprehensive Survey of Recent Drug Discovery Using Deep Learning.深度学习在药物发现中的最新应用综述
Int J Mol Sci. 2021 Sep 15;22(18):9983. doi: 10.3390/ijms22189983.
5
FRAGSITE: A Fragment-Based Approach for Virtual Ligand Screening.FRAGSITE:基于片段的虚拟配体筛选方法。
J Chem Inf Model. 2021 Apr 26;61(4):2074-2089. doi: 10.1021/acs.jcim.0c01160. Epub 2021 Mar 16.
6
Property-Unmatched Decoys in Docking Benchmarks.对接基准测试中的属性不匹配诱饵。
J Chem Inf Model. 2021 Feb 22;61(2):699-714. doi: 10.1021/acs.jcim.0c00598. Epub 2021 Jan 25.
7
A unique ligand-steered strategy for CC chemokine receptor 2 homology modeling to facilitate structure-based virtual screening.一种独特的配体导向策略,用于 CC 趋化因子受体 2 同源建模,以促进基于结构的虚拟筛选。
Chem Biol Drug Des. 2021 Apr;97(4):944-961. doi: 10.1111/cbdd.13820. Epub 2021 Jan 16.
8
Virtual screening and drug repurposing experiments to identify potential novel selective MAO-B inhibitors for Parkinson's disease treatment.虚拟筛选和药物再利用实验,以鉴定潜在的新型选择性 MAO-B 抑制剂,用于治疗帕金森病。
Mol Divers. 2021 Aug;25(3):1775-1794. doi: 10.1007/s11030-020-10155-6. Epub 2020 Nov 25.
9
Structure-Based Discovery of a Selective KDM5A Inhibitor that Exhibits Anti-Cancer Activity via Inducing Cell Cycle Arrest and Senescence in Breast Cancer Cell Lines.基于结构发现一种选择性KDM5A抑制剂,该抑制剂通过诱导乳腺癌细胞系的细胞周期停滞和衰老表现出抗癌活性。
Cancers (Basel). 2019 Jan 15;11(1):92. doi: 10.3390/cancers11010092.
10
Deep learning and virtual drug screening.深度学习与虚拟药物筛选。
Future Med Chem. 2018 Nov;10(21):2557-2567. doi: 10.4155/fmc-2018-0314. Epub 2018 Oct 5.
J Chem Inf Model. 2013 Jan 28;53(1):230-40. doi: 10.1021/ci300510n. Epub 2012 Dec 28.
4
The GPCR Network: a large-scale collaboration to determine human GPCR structure and function.G 蛋白偶联受体网络:一个旨在确定人类 G 蛋白偶联受体结构和功能的大型合作项目。
Nat Rev Drug Discov. 2013 Jan;12(1):25-34. doi: 10.1038/nrd3859. Epub 2012 Dec 14.
5
Identification of structural determinants of ligand selectivity in 5-HT₂ receptor subtypes on the basis of protein-ligand interactions.基于蛋白-配体相互作用,鉴定 5-HT₂ 受体亚型中配体选择性的结构决定因素。
J Mol Graph Model. 2012 Sep;38:342-53. doi: 10.1016/j.jmgm.2012.06.006. Epub 2012 Jul 10.
6
A new era of GPCR structural and chemical biology.GPCR 结构与化学生物学的新纪元。
Nat Chem Biol. 2012 Jul 18;8(8):670-3. doi: 10.1038/nchembio.1025.
7
Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking.有用诱饵目录增强版(DUD-E):更好的配体和诱饵,用于更好的基准测试。
J Med Chem. 2012 Jul 26;55(14):6582-94. doi: 10.1021/jm300687e. Epub 2012 Jul 5.
8
Comparative analysis of pharmacophore screening tools.药效团筛选工具的比较分析。
J Chem Inf Model. 2012 Jun 25;52(6):1607-20. doi: 10.1021/ci2005274. Epub 2012 Jun 13.
9
ZINC: a free tool to discover chemistry for biology.ZINC:一款用于生物学的免费化学发现工具。
J Chem Inf Model. 2012 Jul 23;52(7):1757-68. doi: 10.1021/ci3001277. Epub 2012 Jun 15.
10
Lead Finder docking and virtual screening evaluation with Astex and DUD test sets.Lead Finder 对接和虚拟筛选评估与 Astex 和 DUD 测试集。
J Comput Aided Mol Des. 2012 Jun;26(6):725-35. doi: 10.1007/s10822-012-9549-y. Epub 2012 May 9.