Suzuki Masaki, Nagamochi Hiroshi, Akutsu Tatsuya
Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University Yoshida, 606-8501 Kyoto, Japan.
Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, 611-0011 Kyoto, Japan.
J Cheminform. 2014 May 30;6:31. doi: 10.1186/1758-2946-6-31. eCollection 2014.
The enumeration of chemical graphs (molecular graphs) satisfying given constraints is one of the fundamental problems in chemoinformatics and bioinformatics because it leads to a variety of useful applications including structure determination and development of novel chemical compounds.
We consider the problem of enumerating chemical graphs with monocyclic structure (a graph structure that contains exactly one cycle) from a given set of feature vectors, where a feature vector represents the frequency of the prescribed paths in a chemical compound to be constructed and the set is specified by a pair of upper and lower feature vectors. To enumerate all tree-like (acyclic) chemical graphs from a given set of feature vectors, Shimizu et al. and Suzuki et al. proposed efficient branch-and-bound algorithms based on a fast tree enumeration algorithm. In this study, we devise a novel method for extending these algorithms to enumeration of chemical graphs with monocyclic structure by designing a fast algorithm for testing uniqueness. The results of computational experiments reveal that the computational efficiency of the new algorithm is as good as those for enumeration of tree-like chemical compounds.
We succeed in expanding the class of chemical graphs that are able to be enumerated efficiently.
枚举满足给定约束条件的化学图(分子图)是化学信息学和生物信息学中的基本问题之一,因为它会带来包括结构确定和新型化合物开发在内的各种有用应用。
我们考虑从给定的一组特征向量中枚举具有单环结构(恰好包含一个环的图结构)的化学图的问题,其中特征向量表示要构建的化合物中规定路径的频率,并且该集合由一对上下特征向量指定。为了从给定的一组特征向量中枚举所有树状(无环)化学图,Shimizu等人和Suzuki等人基于快速树枚举算法提出了高效的分支定界算法。在本研究中,我们通过设计一种用于测试唯一性的快速算法,设计了一种将这些算法扩展到枚举具有单环结构的化学图的新方法。计算实验结果表明,新算法的计算效率与枚举树状化合物的算法相当。
我们成功地扩展了能够有效枚举的化学图的类别。