Chen Wenbin, Hendrix William, Samatova Nagiza F
1 Department of Computer Science, Guangzhou University , Guangzhou, China .
2 Shanghai Key Laboratory of Intelligent Information Processing, Fudan University , Shanghai, China .
J Comput Biol. 2017 Dec;24(12):1195-1211. doi: 10.1089/cmb.2017.0064. Epub 2017 Sep 11.
The problem of aligning multiple metabolic pathways is one of very challenging problems in computational biology. A metabolic pathway consists of three types of entities: reactions, compounds, and enzymes. Based on similarities between enzymes, Tohsato et al. gave an algorithm for aligning multiple metabolic pathways. However, the algorithm given by Tohsato et al. neglects the similarities among reactions, compounds, enzymes, and pathway topology. How to design algorithms for the alignment problem of multiple metabolic pathways based on the similarity of reactions, compounds, and enzymes? It is a difficult computational problem. In this article, we propose an algorithm for the problem of aligning multiple metabolic pathways based on the similarities among reactions, compounds, enzymes, and pathway topology. First, we compute a weight between each pair of like entities in different input pathways based on the entities' similarity score and topological structure using Ay et al.'s methods. We then construct a weighted k-partite graph for the reactions, compounds, and enzymes. We extract a mapping between these entities by solving the maximum-weighted k-partite matching problem by applying a novel heuristic algorithm. By analyzing the alignment results of multiple pathways in different organisms, we show that the alignments found by our algorithm correctly identify common subnetworks among multiple pathways.
比对多条代谢途径的问题是计算生物学中极具挑战性的问题之一。一条代谢途径由三种类型的实体组成:反应、化合物和酶。基于酶之间的相似性,户里里等人给出了一种比对多条代谢途径的算法。然而,户里里等人给出的算法忽略了反应、化合物、酶以及途径拓扑结构之间的相似性。如何基于反应、化合物和酶的相似性来设计针对多条代谢途径比对问题的算法呢?这是一个困难的计算问题。在本文中,我们提出了一种基于反应、化合物、酶以及途径拓扑结构之间的相似性来比对多条代谢途径问题的算法。首先,我们使用阿伊等人的方法,基于实体的相似性得分和拓扑结构,计算不同输入途径中每对同类实体之间的权重。然后,我们为反应、化合物和酶构建一个加权k部图。通过应用一种新颖的启发式算法解决最大加权k部匹配问题,我们提取这些实体之间的一个映射。通过分析不同生物体中多条途径的比对结果,我们表明我们的算法所找到的比对能够正确识别多条途径之间的共同子网。