MetMaxStruct：一种基于特沃斯基相似性的策略，用于分析药物和内源性代谢物的（亚）结构相似性。

MetMaxStruct: A Tversky-Similarity-Based Strategy for Analysing the (Sub)Structural Similarities of Drugs and Endogenous Metabolites.

作者信息

O'Hagan Steve, Kell Douglas B

机构信息

School of Chemistry, The University of ManchesterManchester, UK; The Manchester Institute of Biotechnology, The University of ManchesterManchester, UK; Manchester Centre for Synthetic Biology of Fine and Speciality Chemicals, The University of ManchesterManchester, UK.

出版信息

Front Pharmacol. 2016 Aug 22;7:266. doi: 10.3389/fphar.2016.00266. eCollection 2016.

DOI:10.3389/fphar.2016.00266

PMID:27597830

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4992690/

Abstract

BACKGROUND

Previous studies compared the molecular similarity of marketed drugs and endogenous human metabolites (endogenites), using a series of fingerprint-type encodings, variously ranked and clustered using the Tanimoto (Jaccard) similarity coefficient (TS). Because this gives equal weight to all parts of the encoding (thence to different substructures in the molecule) it may not be optimal, since in many cases not all parts of the molecule will bind to their macromolecular targets. Unsupervised methods cannot alone uncover this. We here explore the kinds of differences that may be observed when the TS is replaced-in a manner more equivalent to semi-supervised learning-by variants of the asymmetric Tversky (TV) similarity, that includes α and β parameters.

RESULTS

Dramatic differences are observed in (i) the drug-endogenite similarity heatmaps, (ii) the cumulative "greatest similarity" curves, and (iii) the fraction of drugs with a Tversky similarity to a metabolite exceeding a given value when the Tversky α and β parameters are varied from their Tanimoto values. The same is true when the sum of the α and β parameters is varied. A clear trend toward increased endogenite-likeness of marketed drugs is observed when α or β adopt values nearer the extremes of their range, and when their sum is smaller. The kinds of molecules exhibiting the greatest similarity to two interrogating drug molecules (chlorpromazine and clozapine) also vary in both nature and the values of their similarity as α and β are varied. The same is true for the converse, when drugs are interrogated with an endogenite. The fraction of drugs with a Tversky similarity to a molecule in a library exceeding a given value depends on the contents of that library, and α and β may be "tuned" accordingly, in a semi-supervised manner. At some values of α and β drug discovery library candidates or natural products can "look" much more like (i.e., have a numerical similarity much closer to) drugs than do even endogenites.

CONCLUSIONS

Overall, the Tversky similarity metrics provide a more useful range of examples of molecular similarity than does the simpler Tanimoto similarity, and help to draw attention to molecular similarities that would not be recognized if Tanimoto alone were used. Hence, the Tversky similarity metrics are likely to be of significant value in many general problems in cheminformatics.

摘要

背景

以往的研究使用一系列指纹型编码比较市售药物与内源性人体代谢物（内源性物质）的分子相似性，并使用Tanimoto（Jaccard）相似系数（TS）进行各种排序和聚类。由于这对编码的所有部分（进而对分子中的不同子结构）赋予同等权重，可能并非最优，因为在许多情况下，分子的并非所有部分都会与它们的大分子靶点结合。无监督方法无法单独揭示这一点。在此，我们探讨当以更类似于半监督学习的方式，用包含α和β参数的非对称Tversky（TV）相似性变体取代TS时，可能观察到的差异类型。

结果

当Tversky的α和β参数从其Tanimoto值变化时，在（i）药物 - 内源性物质相似性热图、（ii）累积“最大相似性”曲线以及（iii）与代谢物的Tversky相似性超过给定值的药物比例方面观察到显著差异。当α和β参数之和变化时也是如此。当α或β采用更接近其范围极值的值且它们的和更小时，观察到市售药物的内源性物质相似性有明显增加的趋势。随着α和β的变化，与两种受试药物分子（氯丙嗪和氯氮平）表现出最大相似性的分子类型在性质和相似性值方面也有所不同。反之，当用内源性物质询问药物时也是如此。与库中分子的Tversky相似性超过给定值的药物比例取决于该库的内容，并且α和β可以以半监督的方式相应地“调整”。在α和β的某些值下，药物发现库候选物或天然产物甚至可能比内源性物质“看起来”更像（即，具有更接近的数值相似性）药物。

结论

总体而言，与更简单的Tanimoto相似性相比，Tversky相似性度量提供了更有用的分子相似性示例范围，并有助于引起人们对仅使用Tanimoto时无法识别的分子相似性的关注。因此，Tversky相似性度量在化学信息学的许多一般问题中可能具有重要价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b220/4992690/4b574f8a23ae/fphar-07-00266-g0001.jpg

相似文献

MetMaxStruct: A Tversky-Similarity-Based Strategy for Analysing the (Sub)Structural Similarities of Drugs and Endogenous Metabolites.

Front Pharmacol. 2016 Aug 22;7:266. doi: 10.3389/fphar.2016.00266. eCollection 2016.

Analysis of drug-endogenous human metabolite similarities in terms of their maximum common substructures.

J Cheminform. 2017 Mar 9;9:18. doi: 10.1186/s13321-017-0198-y. eCollection 2017.

Understanding the foundations of the structural similarities between marketed drugs and endogenous human metabolites.

Front Pharmacol. 2015 May 13;6:105. doi: 10.3389/fphar.2015.00105. eCollection 2015.

Do not hesitate to use Tversky-and other hints for successful active analogue searches with feature count descriptors.

J Chem Inf Model. 2013 Jul 22;53(7):1543-62. doi: 10.1021/ci400106g. Epub 2013 Jun 13.

Mar Drugs. 2020 Nov 23;18(11):582. doi: 10.3390/md18110582.

The apparent permeabilities of Caco-2 cells to marketed drugs: magnitude, and independence from both biophysical properties and endogenite similarities.

PeerJ. 2015 Nov 17;3:e1405. doi: 10.7717/peerj.1405. eCollection 2015.

A 'rule of 0.5' for the metabolite-likeness of approved pharmaceutical drugs.

Metabolomics. 2015;11(2):323-339. doi: 10.1007/s11306-014-0733-z. Epub 2014 Sep 19.

Design of chemical space networks on the basis of Tversky similarity.

J Comput Aided Mol Des. 2016 Jan;30(1):1-12. doi: 10.1007/s10822-015-9891-y. Epub 2015 Dec 22.

Drug repositioning for enzyme modulator based on human metabolite-likeness.

BMC Bioinformatics. 2017 May 31;18(Suppl 7):226. doi: 10.1186/s12859-017-1637-5.

Development of a compound class-directed similarity coefficient that accounts for molecular complexity effects in fingerprint searching.

J Chem Inf Model. 2009 Jun;49(6):1369-76. doi: 10.1021/ci900108d.

引用本文的文献

Evidence for the Role of the Mitochondrial ABC Transporter MDL1 in the Uptake of Clozapine and Related Molecules into the Yeast .

Pharmaceuticals (Basel). 2024 Jul 13;17(7):938. doi: 10.3390/ph17070938.

The Transporter-Mediated Cellular Uptake and Efflux of Pharmaceutical Drugs and Biotechnology Products: How and Why Phospholipid Bilayer Transport Is Negligible in Real Biomembranes.

Molecules. 2021 Sep 16;26(18):5629. doi: 10.3390/molecules26185629.

Shape-Restrained Modeling of Protein-Small-Molecule Complexes with High Ambiguity Driven DOCKing.

J Chem Inf Model. 2021 Sep 27;61(9):4807-4818. doi: 10.1021/acs.jcim.1c00796. Epub 2021 Aug 26.

Applications of Virtual Screening in Bioprospecting: Facts, Shifts, and Perspectives to Explore the Chemo-Structural Diversity of Natural Products.

Front Chem. 2021 Apr 29;9:662688. doi: 10.3389/fchem.2021.662688. eCollection 2021.

FragNet, a Contrastive Learning-Based Transformer Model for Clustering, Interpreting, Visualizing, and Navigating Chemical Space.

Molecules. 2021 Apr 3;26(7):2065. doi: 10.3390/molecules26072065.

A palette of fluorophores that are differentially accumulated by wild-type and mutant strains of : surrogate ligands for profiling bacterial membrane transporters.

Microbiology (Reading). 2021 Feb;167(2). doi: 10.1099/mic.0.001016.

Mar Drugs. 2020 Nov 23;18(11):582. doi: 10.3390/md18110582.

VAE-Sim: A Novel Molecular Similarity Measure Based on a Variational Autoencoder.

Molecules. 2020 Jul 29;25(15):3446. doi: 10.3390/molecules25153446.

The biology of ergothioneine, an antioxidant nutraceutical.

Nutr Res Rev. 2020 Dec;33(2):190-217. doi: 10.1017/S0954422419000301. Epub 2020 Feb 13.

Generation of a Small Library of Natural Products Designed to Cover Chemical Space Inexpensively.

Pharm Front. 2019;1(1):e190005. doi: 10.20900/pf20190005. Epub 2019 Aug 9.

本文引用的文献

Chapter 9 Molecular Similarity: Advances in Methods, Applications and Validations in Virtual Screening and QSAR.

Annu Rep Comput Chem. 2006;2:141-168. doi: 10.1016/S1574-1400(06)02009-3. Epub 2006 Nov 7.

The Calculation of Molecular Structural Similarity: Principles and Practice.

Mol Inform. 2014 Jun;33(6-7):403-13. doi: 10.1002/minf.201400024. Epub 2014 Apr 29.

KNIME Workflow to Assess PAINS Filters in SMARTS Format. Comparison of RDKit and Indigo Cheminformatics Libraries.

Mol Inform. 2011 Oct;30(10):847-50. doi: 10.1002/minf.201100076. Epub 2011 Aug 4.

Implications of endogenous roles of transporters for drug discovery: hitchhiking and metabolite-likeness.

Nat Rev Drug Discov. 2016 Feb;15(2):143. doi: 10.1038/nrd.2015.44.

The apparent permeabilities of Caco-2 cells to marketed drugs: magnitude, and independence from both biophysical properties and endogenite similarities.

PeerJ. 2015 Nov 17;3:e1405. doi: 10.7717/peerj.1405. eCollection 2015.

Fitting Transporter Activities to Cellular Drug Concentrations and Fluxes: Why the Bumblebee Can Fly.

Trends Pharmacol Sci. 2015 Nov;36(11):710-723. doi: 10.1016/j.tips.2015.07.006. Epub 2015 Nov 1.

ZINC 15--Ligand Discovery for Everyone.

J Chem Inf Model. 2015 Nov 23;55(11):2324-37. doi: 10.1021/acs.jcim.5b00559. Epub 2015 Nov 9.

A Call for Systematic Research on Solute Carriers.

Cell. 2015 Jul 30;162(3):478-87. doi: 10.1016/j.cell.2015.07.022.

Understanding the foundations of the structural similarities between marketed drugs and endogenous human metabolites.

Front Pharmacol. 2015 May 13;6:105. doi: 10.3389/fphar.2015.00105. eCollection 2015.

Deep learning.

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MetMaxStruct：一种基于特沃斯基相似性的策略，用于分析药物和内源性代谢物的（亚）结构相似性。

MetMaxStruct: A Tversky-Similarity-Based Strategy for Analysing the (Sub)Structural Similarities of Drugs and Endogenous Metabolites.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献