Graduate School of Informatics, Kyoto University, Yoshida, Kyoto 606-8501, Japan.
BMC Bioinformatics. 2011 Dec 14;12 Suppl 14(Suppl 14):S3. doi: 10.1186/1471-2105-12-S14-S3.
Enumeration of chemical graphs satisfying given constraints is one of the fundamental problems in chemoinformatics and bioinformatics since it leads to a variety of useful applications including structure determination of novel chemical compounds and drug design.
In this paper, we consider the problem of enumerating all tree-like chemical graphs from a given set of feature vectors, which is specified by a pair of upper and lower feature vectors, where a feature vector represents the frequency of prescribed paths in a chemical compound to be constructed. This problem can be solved by applying the algorithm proposed by Ishida et al. to each single feature vector in the given set, but this method may take much computation time because in general there are many feature vectors in a given set. We propose a new exact branch-and-bound algorithm for the problem so that all the feature vectors in a given set are handled directly. Since we cannot use the bounding operation proposed by Ishida et al. due to upper and lower constraints, we introduce new bounding operations based on upper and lower feature vectors, a bond constraint, and a detachment condition.
Our proposed algorithm is useful for enumerating tree-like chemical graphs with given upper and lower bounds on path frequencies.
枚举满足给定约束的化学图是化学生物信息学和生物信息学中的基本问题之一,因为它可以应用于各种有用的领域,包括新型化合物的结构确定和药物设计。
在本文中,我们考虑了从给定的特征向量集合中枚举所有树状化学图的问题,该集合由一对上限和下限特征向量指定,其中特征向量表示要构建的化学化合物中规定路径的频率。可以通过将 Ishida 等人提出的算法应用于给定集合中的每个单个特征向量来解决此问题,但由于给定集合中通常有许多特征向量,因此此方法可能需要花费很多计算时间。我们提出了一种用于该问题的新的精确分支定界算法,以便直接处理给定集合中的所有特征向量。由于我们由于上限和下限约束而无法使用 Ishida 等人提出的绑定操作,因此我们基于上限和下限特征向量、键约束和分离条件引入了新的绑定操作。
我们提出的算法对于枚举具有给定路径频率上限和下限的树状化学图很有用。