Suppr超能文献

用于枚举具有给定路径频率的树状化学图的改进算法。

Improved algorithms for enumerating tree-like chemical graphs with given path frequency.

作者信息

Ishida Yusuke, Zhao Liang, Nagamochi Hiroshi, Akutsu Tatsuya

机构信息

Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Yoshida, Kyoto, Japan.

出版信息

Genome Inform. 2008;21:53-64.

Abstract

This paper considers the problem of enumerating all non-isomorphic tree-like chemical graphs with given path frequency, where "tree-like" means that the graph can be viewed as a tree if multiple edges (i.e., edges with the same end points) and a benzene ring are treated as one edge and one vertex, respectively, and "path frequency" is a vector of the numbers of specified vertex-labeled paths that must be realized in every output. This and related problems have several potential applications such as classification of chemical compounds, structure determination using mass-spectrum and/or NMR and design of novel chemical compounds. For this problem, several studies have been done. Recently, Fujiwara et al. (2008) showed two formulations and for each of them, they gave a branch-and-bound algorithm, which combined efficient enumeration of non-isomorphic trees with bounding operations based on the path frequency and the atom-atom bonds to avoid the generation of invalid trees. In this paper, based on their work and a result of Nagamochi (2006), we introduce two new bounding operations, the detachment-cut and the H-cut, to further reduce the size of the search space. We performed computational experiments to compare our proposed algorithms with those of Fujiwara et al. (2008) using some chemical compound data obtained from the KEGG LIGAND database (http://www.genome.jp/kegg/ligand.html). The results show that our proposed algorithms are much faster than their algorithms.

摘要

本文考虑了枚举所有具有给定路径频率的非同构树状化学图的问题,其中“树状”意味着如果将多重边(即具有相同端点的边)和苯环分别视为一条边和一个顶点,那么该图可被视为一棵树,并且“路径频率”是一个向量,其表示在每个输出中必须实现的指定顶点标记路径的数量。此问题及相关问题有若干潜在应用,如化合物分类、利用质谱和/或核磁共振进行结构测定以及新型化合物设计。针对这个问题,已经开展了多项研究。最近,藤原等人(2008年)给出了两种公式表述,并且针对每种表述都给出了一种分支定界算法,该算法将非同构树的高效枚举与基于路径频率和原子间键的定界操作相结合,以避免生成无效树。在本文中,基于他们的工作以及长内(2006年)的一个结果,我们引入了两种新的定界操作,即分离切割和H切割,以进一步减小搜索空间的大小。我们使用从KEGG配体数据库(http://www.genome.jp/kegg/ligand.html)获取的一些化合物数据进行了计算实验,将我们提出的算法与藤原等人(2008年)的算法进行比较。结果表明,我们提出的算法比他们的算法快得多。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验