• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于协同过滤的隐式描述符配体虚拟筛选。

Implicit-descriptor ligand-based virtual screening by means of collaborative filtering.

作者信息

Srinivas Raghuram, Klimovich Pavel V, Larson Eric C

机构信息

Department of Computer Science and Engineering, Bobby B. Lyle School of Engineering, Southern Methodist University, 3145 Dyer Street, Dallas, TX, 75205, USA.

DataScience@SMU, Dallas, 75205, TX, USA.

出版信息

J Cheminform. 2018 Nov 22;10(1):56. doi: 10.1186/s13321-018-0310-y.

DOI:10.1186/s13321-018-0310-y
PMID:30467684
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6755561/
Abstract

Current ligand-based machine learning methods in virtual screening rely heavily on molecular fingerprinting for preprocessing, i.e., explicit description of ligands' structural and physicochemical properties in a vectorized form. Of particular importance to current methods are the extent to which molecular fingerprints describe a particular ligand and what metric sufficiently captures similarity among ligands. In this work, we propose and evaluate methods that do not require explicit feature vectorization through fingerprinting, but, instead, provide implicit descriptors based only on other known assays. Our methods are based upon well known collaborative filtering algorithms used in recommendation systems. Our implicit descriptor method does not require any fingerprint similarity search, which makes the method free of the bias arising from the empirical nature of the fingerprint models. We show that implicit methods significantly outperform traditional machine learning methods, and the main strengths of implicit methods are their resilience to target-ligand sparsity and high potential for spotting promiscuous ligands.

摘要

当前虚拟筛选中基于配体的机器学习方法在很大程度上依赖于分子指纹进行预处理,即以矢量化形式明确描述配体的结构和物理化学性质。对于当前方法而言,特别重要的是分子指纹描述特定配体的程度以及何种度量能够充分捕捉配体之间的相似性。在这项工作中,我们提出并评估了一些方法,这些方法不需要通过指纹进行明确的特征矢量化,而是仅基于其他已知测定提供隐式描述符。我们的方法基于推荐系统中使用的著名协同过滤算法。我们的隐式描述符方法不需要任何指纹相似性搜索,这使得该方法不受指纹模型经验性质所产生的偏差影响。我们表明,隐式方法显著优于传统机器学习方法,并且隐式方法的主要优势在于它们对靶标 - 配体稀疏性的耐受性以及发现混杂配体的高潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/35ea87a19f97/13321_2018_310_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/2b949d673f69/13321_2018_310_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/5425886259c4/13321_2018_310_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/6f346a34d026/13321_2018_310_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/c29e75168da1/13321_2018_310_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/34851238f250/13321_2018_310_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/1165d502898a/13321_2018_310_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/be0cd22af608/13321_2018_310_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/621dad690d24/13321_2018_310_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/7e4710de8a0c/13321_2018_310_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/e223faa0e9a7/13321_2018_310_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/6d4af1e84cbc/13321_2018_310_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/9e23d2d55e52/13321_2018_310_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/9d762dec9564/13321_2018_310_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/567ed3ec714a/13321_2018_310_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/35ea87a19f97/13321_2018_310_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/2b949d673f69/13321_2018_310_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/5425886259c4/13321_2018_310_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/6f346a34d026/13321_2018_310_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/c29e75168da1/13321_2018_310_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/34851238f250/13321_2018_310_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/1165d502898a/13321_2018_310_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/be0cd22af608/13321_2018_310_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/621dad690d24/13321_2018_310_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/7e4710de8a0c/13321_2018_310_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/e223faa0e9a7/13321_2018_310_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/6d4af1e84cbc/13321_2018_310_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/9e23d2d55e52/13321_2018_310_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/9d762dec9564/13321_2018_310_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/567ed3ec714a/13321_2018_310_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de29/6755561/35ea87a19f97/13321_2018_310_Fig15_HTML.jpg

相似文献

1
Implicit-descriptor ligand-based virtual screening by means of collaborative filtering.基于协同过滤的隐式描述符配体虚拟筛选。
J Cheminform. 2018 Nov 22;10(1):56. doi: 10.1186/s13321-018-0310-y.
2
Deep Learning-Based Ligand Design Using Shared Latent Implicit Fingerprints from Collaborative Filtering.基于深度学习的配体设计,利用协同过滤的共享潜在隐式指纹图谱。
J Chem Inf Model. 2021 May 24;61(5):2159-2174. doi: 10.1021/acs.jcim.0c01355. Epub 2021 Apr 26.
3
Comparison of ligand- and structure-based virtual screening on the DUD data set.基于配体和结构的虚拟筛选在DUD数据集上的比较。
J Chem Inf Model. 2009 Feb;49(2):209-31. doi: 10.1021/ci800303k.
4
Consensus queries in ligand-based virtual screening experiments.基于配体的虚拟筛选实验中的一致性查询。
J Cheminform. 2017 Nov 28;9(1):60. doi: 10.1186/s13321-017-0248-5.
5
Molecular graph convolutions: moving beyond fingerprints.分子图卷积:超越指纹图谱
J Comput Aided Mol Des. 2016 Aug;30(8):595-608. doi: 10.1007/s10822-016-9938-8. Epub 2016 Aug 24.
6
MOST: most-similar ligand based approach to target prediction.MOST:基于最相似配体的靶点预测方法。
BMC Bioinformatics. 2017 Mar 11;18(1):165. doi: 10.1186/s12859-017-1586-z.
7
Using Domain-Specific Fingerprints Generated Through Neural Networks to Enhance Ligand-Based Virtual Screening.利用神经网络生成的领域特定指纹增强基于配体的虚拟筛选。
J Chem Inf Model. 2021 Feb 22;61(2):664-675. doi: 10.1021/acs.jcim.0c01208. Epub 2021 Jan 26.
8
How similar are similarity searching methods? A principal component analysis of molecular descriptor space.相似性搜索方法的相似程度如何?分子描述符空间的主成分分析。
J Chem Inf Model. 2009 Jan;49(1):108-19. doi: 10.1021/ci800249s.
9
Structure-based drug screening and ligand-based drug screening with machine learning.基于结构的药物筛选以及结合机器学习的基于配体的药物筛选。
Comb Chem High Throughput Screen. 2009 May;12(4):397-408. doi: 10.2174/138620709788167890.
10
LigMate: A Multifeature Integration Algorithm for Ligand-Similarity-Based Virtual Screening.LigMate:一种基于配体相似性的虚拟筛选的多特征集成算法。
J Chem Inf Model. 2020 Dec 28;60(12):6044-6053. doi: 10.1021/acs.jcim.9b01210. Epub 2020 Nov 16.

引用本文的文献

1
Development and Evaluation of Health Recommender Systems: Systematic Scoping Review and Evidence Mapping.健康推荐系统的开发和评估:系统范围的综述和证据绘图。
J Med Internet Res. 2023 Jan 19;25:e38184. doi: 10.2196/38184.
2
Key Topics in Molecular Docking for Drug Design.药物设计中的分子对接关键主题。
Int J Mol Sci. 2019 Sep 15;20(18):4574. doi: 10.3390/ijms20184574.
3
Exploring the Potential of Spherical Harmonics and PCVM for Compounds Activity Prediction.探索球谐函数和 PCVM 在化合物活性预测中的应用潜力。

本文引用的文献

1
Deep learning and virtual drug screening.深度学习与虚拟药物筛选。
Future Med Chem. 2018 Nov;10(21):2557-2567. doi: 10.4155/fmc-2018-0314. Epub 2018 Oct 5.
2
MoleculeNet: a benchmark for molecular machine learning.分子网络:分子机器学习的一个基准
Chem Sci. 2017 Oct 31;9(2):513-530. doi: 10.1039/c7sc02664a. eCollection 2018 Jan 14.
3
Beyond the hype: deep neural networks outperform established methods using a ChEMBL bioactivity benchmark set.超越炒作:使用ChEMBL生物活性基准集,深度神经网络优于现有方法。
Int J Mol Sci. 2019 May 2;20(9):2175. doi: 10.3390/ijms20092175.
J Cheminform. 2017 Aug 14;9(1):45. doi: 10.1186/s13321-017-0232-0.
4
Shallow Representation Learning via Kernel PCA Improves QSAR Modelability.通过核主成分分析的浅层表示学习提高定量构效关系模型能力。
J Chem Inf Model. 2017 Aug 28;57(8):1859-1867. doi: 10.1021/acs.jcim.6b00694. Epub 2017 Aug 7.
5
Comparison of the Predictive Performance and Interpretability of Random Forest and Linear Models on Benchmark Data Sets.随机森林和线性模型在基准数据集上的预测性能与可解释性比较
J Chem Inf Model. 2017 Aug 28;57(8):1773-1792. doi: 10.1021/acs.jcim.6b00753. Epub 2017 Aug 2.
6
Active learning for computational chemogenomics.计算化学生物基因组学的主动学习。
Future Med Chem. 2017 Mar;9(4):381-402. doi: 10.4155/fmc-2016-0197. Epub 2017 Mar 6.
7
The impact of in silico screening in the discovery of novel and safer drug candidates.计算机筛选在新型、更安全药物候选物发现中的影响。
Pharmacol Ther. 2017 Jul;175:47-66. doi: 10.1016/j.pharmthera.2017.02.034. Epub 2017 Feb 14.
8
Computational chemistry at Janssen.杨森公司的计算化学
J Comput Aided Mol Des. 2017 Mar;31(3):267-273. doi: 10.1007/s10822-016-9998-9. Epub 2016 Dec 19.
9
Profiling Prediction of Kinase Inhibitors: Toward the Virtual Assay.激酶抑制剂的特征预测:迈向虚拟分析
J Med Chem. 2017 Jan 12;60(1):474-485. doi: 10.1021/acs.jmedchem.6b01611. Epub 2016 Dec 14.
10
Extreme Gradient Boosting as a Method for Quantitative Structure-Activity Relationships.极端梯度提升在定量构效关系中的应用。
J Chem Inf Model. 2016 Dec 27;56(12):2353-2360. doi: 10.1021/acs.jcim.6b00591. Epub 2016 Dec 13.