• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

特征树:一种基于树匹配的新分子相似性度量方法。

Feature trees: a new molecular similarity measure based on tree matching.

作者信息

Rarey M, Dixon J S

机构信息

German National Research Center for Information Technology (GMD), Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany.

出版信息

J Comput Aided Mol Des. 1998 Sep;12(5):471-90. doi: 10.1023/a:1008068904628.

DOI:10.1023/a:1008068904628
PMID:9834908
Abstract

In this paper we present a new method for evaluating molecular similarity between small organic compounds. Instead of a linear representation like fingerprints, a more complex description, a feature tree, is calculated for a molecule. A feature tree represents hydrophobic fragments and functional groups of the molecule and the way these groups are linked together. Each node in the tree is labeled with a set of features representing chemical properties of the part of the molecule corresponding to the node. The comparison of feature trees is based on matching subtrees of two feature trees onto each other. Two algorithms for tackling the matching problem are described throughout this paper. On a dataset of about 1000 molecules, we demonstrate the ability of our approach to identify molecules belonging to the same class of inhibitors. With a second dataset of 58 molecules with known binding modes taken from the Brookhaven Protein Data Bank, we show that the matchings produced by our algorithms are compatible with the relative orientation of the molecules in the active site in 61% of the test cases. The average computation time for a pair comparison is about 50 ms on a current workstation.

摘要

在本文中,我们提出了一种评估小有机化合物之间分子相似性的新方法。与指纹等线性表示方式不同,我们为分子计算一种更复杂的描述——特征树。特征树表示分子的疏水片段和官能团以及这些基团连接在一起的方式。树中的每个节点都用一组特征进行标记,这些特征代表与该节点对应的分子部分的化学性质。特征树的比较基于将两个特征树的子树相互匹配。本文描述了两种解决匹配问题的算法。在一个约1000个分子的数据集上,我们展示了我们的方法识别属于同一类抑制剂的分子的能力。使用从布鲁克海文蛋白质数据库获取的58个具有已知结合模式的分子的第二个数据集,我们表明在61%的测试案例中,我们算法产生的匹配与活性位点中分子的相对取向兼容。在当前工作站上,一对比较的平均计算时间约为50毫秒。

相似文献

1
Feature trees: a new molecular similarity measure based on tree matching.特征树:一种基于树匹配的新分子相似性度量方法。
J Comput Aided Mol Des. 1998 Sep;12(5):471-90. doi: 10.1023/a:1008068904628.
2
SwiFT: an index structure for reduced graph descriptors in virtual screening and clustering.SwiFT:一种用于虚拟筛选和聚类中简化图形描述符的索引结构。
J Chem Inf Model. 2007 Jul-Aug;47(4):1341-53. doi: 10.1021/ci700007b. Epub 2007 Jun 14.
3
Similarity metrics for ligands reflecting the similarity of the target proteins.反映靶蛋白相似性的配体相似性度量。
J Chem Inf Comput Sci. 2003 Mar-Apr;43(2):391-405. doi: 10.1021/ci025569t.
4
Engineering Aspects of Olfaction嗅觉的工程学方面
5
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
6
Multiple-ligand-based virtual screening: methods and applications of the MTree approach.基于多配体的虚拟筛选:MTree方法的原理与应用
J Med Chem. 2005 Oct 20;48(21):6575-84. doi: 10.1021/jm050078w.
7
Docking of hydrophobic ligands with interaction-based matching algorithms.使用基于相互作用的匹配算法对接疏水配体。
Bioinformatics. 1999 Mar;15(3):243-50. doi: 10.1093/bioinformatics/15.3.243.
8
SimG: an alignment based method for evaluating the similarity of small molecules and binding sites.SimG:一种基于对齐的方法,用于评估小分子和结合位点的相似性。
J Chem Inf Model. 2013 Aug 26;53(8):2103-15. doi: 10.1021/ci400139j. Epub 2013 Aug 9.
9
Noncontiguous atom matching structural similarity function.非连续原子匹配结构相似度函数。
J Chem Inf Model. 2013 Oct 28;53(10):2511-24. doi: 10.1021/ci400324u. Epub 2013 Oct 8.
10
Improving similarity-driven library design: customized matching and regioselective feature trees.改进基于相似度的库设计:定制匹配和区域选择性特征树。
J Chem Inf Model. 2011 Sep 26;51(9):2156-63. doi: 10.1021/ci200014g. Epub 2011 Aug 18.

引用本文的文献

1
Allosteric targeting of RIPK1: discovery of novel inhibitors parallel virtual screening and structure-guided optimization.RIPK1的变构靶向:新型抑制剂的发现、平行虚拟筛选及结构导向优化
RSC Med Chem. 2025 Sep 17. doi: 10.1039/d5md00317b.
2
A Benchmark Set of Bioactive Molecules for Diversity Analysis of Compound Libraries and Combinatorial Chemical Spaces.用于化合物库和组合化学空间多样性分析的生物活性分子基准集。
J Chem Inf Model. 2025 Sep 8;65(17):9097-9124. doi: 10.1021/acs.jcim.5c00719. Epub 2025 Aug 20.
3
CoLiNN: A Tool for Fast Chemical Space Visualization of Combinatorial Libraries Without Enumeration.

本文引用的文献

1
Time-efficient flexible superposition of medium-sized molecules.中等大小分子的高效灵活叠加
J Comput Aided Mol Des. 1997 Jul;11(4):357-68. doi: 10.1023/a:1007959729800.
2
A genetic algorithm for flexible molecular overlay and pharmacophore elucidation.一种用于柔性分子叠合和药效团阐释的遗传算法。
J Comput Aided Mol Des. 1995 Dec;9(6):532-49. doi: 10.1007/BF00124324.
3
A fast flexible docking method using an incremental construction algorithm.一种使用增量构建算法的快速灵活对接方法。
CoLiNN:一种无需枚举即可快速可视化组合库化学空间的工具。
Mol Inform. 2025 Mar;44(3):e202400263. doi: 10.1002/minf.202400263.
4
An Open-Source Implementation of the Scaffold Identification and Naming System (SCINS) and Example Applications.开源的支架识别和命名系统(SCINS)实现及应用示例。
J Chem Inf Model. 2024 Oct 28;64(20):7905-7916. doi: 10.1021/acs.jcim.4c01314. Epub 2024 Oct 15.
5
Amine-containing donepezil analogues as potent acetylcholinesterase inhibitors with increased polarity.含胺多奈哌齐类似物作为具有更高极性的强效乙酰胆碱酯酶抑制剂。
RSC Med Chem. 2024 Apr 12;15(6):2037-2044. doi: 10.1039/d3md00635b. eCollection 2024 Jun 19.
6
t-SMILES: a fragment-based molecular representation framework for de novo ligand design.t-SMILES:一种用于从头设计配体的基于片段的分子表示框架。
Nat Commun. 2024 Jun 11;15(1):4993. doi: 10.1038/s41467-024-49388-6.
7
A Multi-view Molecular Pre-training with Generative Contrastive Learning.多视图分子预训练与生成对比学习。
Interdiscip Sci. 2024 Sep;16(3):741-754. doi: 10.1007/s12539-024-00632-z. Epub 2024 May 6.
8
Advances in virtual screening.虚拟筛选的进展。
Drug Discov Today Technol. 2006 Winter;3(4):405-411. doi: 10.1016/j.ddtec.2006.12.002. Epub 2007 Jan 12.
9
AI is a viable alternative to high throughput screening: a 318-target study.人工智能是高通量筛选的可行替代方案:一项 318 靶点研究。
Sci Rep. 2024 Apr 2;14(1):7526. doi: 10.1038/s41598-024-54655-z.
10
Artificial intelligence-powered discovery of small molecules inhibiting CTLA-4 in cancer.利用人工智能发现抑制癌症中CTLA-4的小分子
BJC Rep. 2024;2. doi: 10.1038/s44276-023-00035-5. Epub 2024 Jan 23.
J Mol Biol. 1996 Aug 23;261(3):470-89. doi: 10.1006/jmbi.1996.0477.
4
Molecular similarity based on DOCK-generated fingerprints.
J Med Chem. 1996 Aug 16;39(17):3401-8. doi: 10.1021/jm950800y.
5
Searching for pharmacophoric patterns in databases of three-dimensional chemical structures.在三维化学结构数据库中搜索药效团模式。
J Mol Recognit. 1995 Sep-Oct;8(5):290-303. doi: 10.1002/jmr.300080503.
6
New method for rapid characterization of molecular shapes: applications in drug design.
J Chem Inf Comput Sci. 1993 Jan-Feb;33(1):79-85. doi: 10.1021/ci00011a012.
7
New molecular shape descriptors: application in database screening.
J Comput Aided Mol Des. 1995 Feb;9(1):1-12. doi: 10.1007/BF00117274.
8
Different approaches toward an automatic structural alignment of drug molecules: applications to sterol mimics, thrombin and thermolysin inhibitors.药物分子自动结构比对的不同方法:在甾醇模拟物、凝血酶和嗜热菌蛋白酶抑制剂中的应用
J Comput Aided Mol Des. 1994 Dec;8(6):751-78. doi: 10.1007/BF00124019.
9
A fast and efficient method for 2D and 3D molecular shape description.
J Comput Aided Mol Des. 1992 Dec;6(6):607-28. doi: 10.1007/BF00126218.
10
The Protein Data Bank: a computer-based archival file for macromolecular structures.蛋白质数据库:一个基于计算机的大分子结构存档文件。
J Mol Biol. 1977 May 25;112(3):535-42. doi: 10.1016/s0022-2836(77)80200-3.