• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

差异一致性分析:药物发现中可应用哪些相似性度量指标?

Differential Consistency Analysis: Which Similarity Measures can be Applied in Drug Discovery?

机构信息

Department of Chemistry, University of Florida, Gainesville, FL 32603, USA.

Medicinal Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117, Budapest, Hungary.

出版信息

Mol Inform. 2021 Jul;40(7):e2060017. doi: 10.1002/minf.202060017. Epub 2021 Apr 23.

DOI:10.1002/minf.202060017
PMID:33891369
Abstract

Similarity measures are widely used in various areas from taxonomy to cheminformatics. To this end, a large number of similarity and distance measures (or, collectively, comparative measures) have been introduced, with only a few studies directed to revealing their inner relationships. We present a thorough analytical study of the conditions leading to two comparative measures providing equivalent results over a given set of molecules. A key part of this work is the introduction of a novel way to study the consistency between comparative measures: the differential consistency analysis (DCA). This tool reveals how the consistency can be established in an analytical way with minimal (or no) assumptions. We found that the consensus between Tanimoto and the Cosine coefficients improved by choosing a reference whose similarity to the rest of the molecules varies less, or by representing the molecules in a way that does not depend strongly on their size (i. e. bit frequency in the chosen fingerprint representation). The presented derivations are just some generic examples; DCA can be applied widely and for all binary similarity coefficients introduced so far, independently from the molecular representations.

摘要

相似性度量在从分类学到化学信息学的各个领域都有广泛的应用。为此,已经引入了大量的相似性和距离度量(或统称为比较度量),但只有少数研究致力于揭示它们的内在关系。我们对导致两个比较度量在给定分子集上产生等效结果的条件进行了全面的分析研究。这项工作的一个关键部分是引入了一种新的方法来研究比较度量之间的一致性:差分一致性分析(DCA)。该工具揭示了如何以最小(或无)假设的方式以分析方式建立一致性。我们发现,通过选择与其余分子相似性变化较小的参考物,或者通过以不强烈依赖于分子大小的方式(即所选指纹表示中的位频率)表示分子,Tanimoto 和余弦系数之间的一致性得到了改善。所提出的推导只是一些通用示例;DCA 可以广泛应用于迄今为止引入的所有二进制相似性系数,并且与分子表示无关。

相似文献

1
Differential Consistency Analysis: Which Similarity Measures can be Applied in Drug Discovery?差异一致性分析:药物发现中可应用哪些相似性度量指标?
Mol Inform. 2021 Jul;40(7):e2060017. doi: 10.1002/minf.202060017. Epub 2021 Apr 23.
2
Extended similarity indices: the benefits of comparing more than two objects simultaneously. Part 2: speed, consistency, diversity selection.扩展相似性指数:同时比较两个以上对象的益处。第2部分:速度、一致性、多样性选择。
J Cheminform. 2021 Apr 23;13(1):33. doi: 10.1186/s13321-021-00504-4.
3
Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations?为什么田本系数是基于指纹的相似性计算的合适选择?
J Cheminform. 2015 May 20;7:20. doi: 10.1186/s13321-015-0069-3. eCollection 2015.
4
Life beyond the Tanimoto coefficient: similarity measures for interaction fingerprints.超越谷本系数的生命:相互作用指纹的相似性度量
J Cheminform. 2018 Oct 4;10(1):48. doi: 10.1186/s13321-018-0302-y.
5
ccbmlib - a Python package for modeling Tanimoto similarity value distributions.ccbmlib - 一个用于对谷本相似度值分布进行建模的Python包。
F1000Res. 2020 Feb 10;9. doi: 10.12688/f1000research.22292.2. eCollection 2020.
6
Comparing structural fingerprints using a literature-based similarity benchmark.使用基于文献的相似性基准比较结构指纹。
J Cheminform. 2016 Jul 5;8:36. doi: 10.1186/s13321-016-0148-0. eCollection 2016.
7
Extended similarity indices: the benefits of comparing more than two objects simultaneously. Part 1: Theory and characteristics.扩展相似性指数:同时比较两个以上对象的益处。第1部分:理论与特征。
J Cheminform. 2021 Apr 23;13(1):32. doi: 10.1186/s13321-021-00505-3.
8
Development of a compound class-directed similarity coefficient that accounts for molecular complexity effects in fingerprint searching.一种考虑指纹搜索中分子复杂性效应的化合物类导向相似系数的开发。
J Chem Inf Model. 2009 Jun;49(6):1369-76. doi: 10.1021/ci900108d.
9
Distance phenomena in high-dimensional chemical descriptor spaces: consequences for similarity-based approaches.高维化学描述符空间中的距离现象:对基于相似度方法的影响。
J Comput Chem. 2009 Nov 15;30(14):2285-96. doi: 10.1002/jcc.21218.
10
Modeling Tanimoto Similarity Value Distributions and Predicting Search Results.模拟谷本相似度值分布并预测搜索结果。
Mol Inform. 2017 Jul;36(7). doi: 10.1002/minf.201600131. Epub 2016 Dec 29.

引用本文的文献

1
Molecular similarity: Theory, applications, and perspectives.分子相似性:理论、应用与展望。
Artif Intell Chem. 2024 Dec;2(2). doi: 10.1016/j.aichem.2024.100077. Epub 2024 Aug 31.
2
Is Tanimoto a metric?谷本系数是一种度量标准吗?
bioRxiv. 2025 Feb 23:2025.02.18.638904. doi: 10.1101/2025.02.18.638904.
3
In Silico Evaluation of Some Computer-Designed Fluoroquinolone-Glutamic Acid Hybrids as Potential Topoisomerase II Inhibitors with Anti-Cancer Effect.一些计算机设计的氟喹诺酮-谷氨酸杂化物作为具有抗癌作用的潜在拓扑异构酶II抑制剂的计算机模拟评估
Pharmaceuticals (Basel). 2024 Nov 26;17(12):1593. doi: 10.3390/ph17121593.
4
Alternative weighting schemes for fine-tuned extended similarity indices.微调扩展相似性指数的替代加权方案。
J Chemom. 2024 Sep;38(9). doi: 10.1002/cem.3558. Epub 2024 May 11.
5
Peptide hemolytic activity analysis using visual data mining of similarity-based complex networks.使用基于相似性的复杂网络的可视化数据挖掘分析肽的溶血活性。
NPJ Syst Biol Appl. 2024 Oct 4;10(1):115. doi: 10.1038/s41540-024-00429-2.
6
Pharmacophore-Based Study: An In Silico Perspective for the Identification of Potential New Delhi Metallo-β-lactamase-1 (NDM-1) Inhibitors.基于药效团的研究:用于鉴定潜在新型德里金属β-内酰胺酶-1(NDM-1)抑制剂的计算机模拟视角
Pharmaceuticals (Basel). 2024 Sep 9;17(9):1183. doi: 10.3390/ph17091183.
7
Protein Retrieval via Integrative Molecular Ensembles (PRIME) through Extended Similarity Indices.通过扩展相似性指数的综合分子组合(PRIME)进行蛋白质提取。
J Chem Theory Comput. 2024 Jul 23;20(14):6303-6315. doi: 10.1021/acs.jctc.4c00362. Epub 2024 Jul 8.
8
Comparative study of the mechanism of natural compounds with similar structures using docking and transcriptome data for improving in silico herbal medicine experimentations.使用对接和转录组数据对具有相似结构的天然化合物的机制进行比较研究,以改进中草药虚拟实验。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad344.
9
Molecular Dynamics Simulations and Diversity Selection by Extended Continuous Similarity Indices.分子动力学模拟与通过扩展连续相似性指数进行的多样性选择。
J Chem Inf Model. 2022 Jul 25;62(14):3415-3425. doi: 10.1021/acs.jcim.2c00433. Epub 2022 Jul 14.
10
Icotinib, Almonertinib, and Olmutinib: A 2D Similarity/Docking-Based Study to Predict the Potential Binding Modes and Interactions into EGFR.依西美坦、阿美替尼和奥希替尼:基于 2D 相似性/对接的研究预测潜在的结合模式和相互作用进入 EGFR。
Molecules. 2021 Oct 24;26(21):6423. doi: 10.3390/molecules26216423.