• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MultiMCS:一种用于多个分子最大公共子结构问题的快速算法。

MultiMCS: a fast algorithm for the maximum common substructure problem on multiple molecules.

机构信息

Strand Life Sciences, Fifth Floor, Kirloskar Business Park, Bellary Road, Hebbal, Bangalore 560024, India.

出版信息

J Chem Inf Model. 2011 Apr 25;51(4):788-806. doi: 10.1021/ci100297y. Epub 2011 Mar 29.

DOI:10.1021/ci100297y
PMID:21446748
Abstract

Several efficient correspondence graph-based algorithms for determining the maximum common substructure (MCS) of a pair of molecules have been published in the literature. The extension of the problem to three or more molecules is however nontrivial; heuristics used to increase the efficiency in the two-molecule case are either inapplicable to the many-molecule case or do not provide significant speedups. Our specific algorithmic contribution is two-fold. First, we show how the correspondence graph approach for the two-molecule case can be generalized to obtain an algorithm that is guaranteed to find the optimum connected MCS of multiple molecules, and that runs fast on most families of molecules using a new divide-and-conquer strategy that has hitherto not been reported in this context. Second, we provide a characterization of those compound families for which the algorithm might run slowly, along with a heuristic for speeding up computations on these families. We also extend the above algorithm to a heuristic algorithm to find the disconnected MCS of multiple molecules and to an algorithm for clustering molecules into groups, with each group sharing a substantial MCS. Our methods are flexible in that they provide exquisite control on various matching criteria used to define a common substructure.

摘要

已经有一些基于有效配对比对图的算法被发表出来,用于确定一对分子的最大公共子结构(MCS)。然而,将问题扩展到三个或更多分子并不简单;在二分子情况下用于提高效率的启发式方法要么不适用于多分子情况,要么不能提供显著的加速。我们的具体算法贡献有两方面。首先,我们展示了如何将二分子情况下的配对比对图方法推广,以获得一种保证能够找到多个分子最优连接 MCS 的算法,并且使用一种新的分治策略在大多数分子家族上快速运行,而这种策略在以前的相关研究中并未被报道。其次,我们对算法可能运行缓慢的化合物家族进行了特征描述,并提供了一种启发式方法来加速这些家族的计算。我们还将上述算法扩展到一个启发式算法,用于寻找多个分子的不连接 MCS,以及一个用于将分子聚类成具有共享大量 MCS 的组的算法。我们的方法具有灵活性,它们提供了对用于定义公共子结构的各种匹配标准的精确控制。

相似文献

1
MultiMCS: a fast algorithm for the maximum common substructure problem on multiple molecules.MultiMCS:一种用于多个分子最大公共子结构问题的快速算法。
J Chem Inf Model. 2011 Apr 25;51(4):788-806. doi: 10.1021/ci100297y. Epub 2011 Mar 29.
2
Build-up algorithm for atomic correspondence between chemical structures.化学结构间原子对应关系的构建算法。
J Chem Inf Model. 2011 Aug 22;51(8):1775-87. doi: 10.1021/ci2001023. Epub 2011 Jul 18.
3
On the impact of dissimilarity measure in k-modes clustering algorithm.关于差异度量在k-模式聚类算法中的影响。
IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):503-7. doi: 10.1109/TPAMI.2007.53.
4
Graphical models and point pattern matching.图形模型与点模式匹配。
IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1646-63. doi: 10.1109/TPAMI.2006.207.
5
LEGClust- a clustering algorithm based on layered entropic subgraphs.LEGClust——一种基于分层熵子图的聚类算法。
IEEE Trans Pattern Anal Mach Intell. 2008 Jan;30(1):62-75. doi: 10.1109/TPAMI.2007.1142.
6
Dynamic graph cuts for efficient inference in Markov Random Fields.用于马尔可夫随机场高效推理的动态图割
IEEE Trans Pattern Anal Mach Intell. 2007 Dec;29(12):2079-88. doi: 10.1109/TPAMI.2007.1128.
7
Representing clusters using a maximum common edge substructure algorithm applied to reduced graphs and molecular graphs.使用应用于简化图和分子图的最大公共边子结构算法来表示簇。
J Chem Inf Model. 2007 Mar-Apr;47(2):354-66. doi: 10.1021/ci600444g. Epub 2007 Feb 20.
8
Weighted graph cuts without eigenvectors a multilevel approach.无需特征向量的加权图割:一种多级方法。
IEEE Trans Pattern Anal Mach Intell. 2007 Nov;29(11):1944-57. doi: 10.1109/TPAMI.2007.1115.
9
Learning graph matching.学习图匹配。
IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):1048-58. doi: 10.1109/TPAMI.2009.28.
10
Generalizing Swendsen-Wang to sampling arbitrary posterior probabilities.将斯文森-王算法推广到对任意后验概率进行采样。
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1239-53. doi: 10.1109/TPAMI.2005.161.

引用本文的文献

1
"Molecular Anatomy": a new multi-dimensional hierarchical scaffold analysis tool.“分子解剖学”:一种新型的多维分层支架分析工具。
J Cheminform. 2021 Jul 23;13:54. doi: 10.1186/s13321-021-00526-y. eCollection 2021.
2
Analysis of drug-endogenous human metabolite similarities in terms of their maximum common substructures.基于最大公共子结构分析药物与人内源性代谢物的相似性。
J Cheminform. 2017 Mar 9;9:18. doi: 10.1186/s13321-017-0198-y. eCollection 2017.
3
Discovery of novel polyamine analogs with anti-protozoal activity by computer guided drug repositioning.
通过计算机辅助药物重新定位发现具有抗原生动物活性的新型多胺类似物。
J Comput Aided Mol Des. 2016 Apr;30(4):305-21. doi: 10.1007/s10822-016-9903-6. Epub 2016 Feb 18.
4
Large-Scale Computational Screening Identifies First in Class Multitarget Inhibitor of EGFR Kinase and BRD4.大规模计算筛选鉴定出首个EGFR激酶和BRD4的多靶点抑制剂。
Sci Rep. 2015 Nov 24;5:16924. doi: 10.1038/srep16924.
5
Identification of levothyroxine antichagasic activity through computer-aided drug repurposing.通过计算机辅助药物重新利用鉴定左甲状腺素的抗恰加斯病活性。
ScientificWorldJournal. 2014 Jan 30;2014:279618. doi: 10.1155/2014/279618. eCollection 2014.
6
Development of conformation independent computational models for the early recognition of breast cancer resistance protein substrates.开发构象非依赖型计算模型以早期识别乳腺癌耐药蛋白底物。
Biomed Res Int. 2013;2013:863592. doi: 10.1155/2013/863592. Epub 2013 Aug 1.
7
The CARLSBAD database: a confederated database of chemical bioactivities.CARLSBAD 数据库:一个化学生物活性的联合数据库。
Database (Oxford). 2013 Jun 21;2013:bat044. doi: 10.1093/database/bat044. Print 2013.