Lin Guohui, Tegos Theodore, Chen Zhi-Zhong
Bioinformatics Research Group, Department of Computing Science, University of Alberta, Edmonton, Alberta T6G 2E8, Canada.
J Bioinform Comput Biol. 2005 Dec;3(6):1331-50. doi: 10.1142/s0219720005001570.
The constrained bipartite matching (CBM) problem is a variant of the classical bipartite matching problem that has been well studied in the Combinatorial Optimization community. The input to CBM is an edge-weighted complete bipartite graph in which there are a same number of vertices on both sides and vertices on one side are sequentially ordered while vertices on the other side are partitioned and connected into disjoint directed paths. In a feasible matching, a path must be mapped to consecutive vertices on the other side. The optimization goal is to find a maximum or a minimum weight perfect matching. Such an optimization problem has its applications to scheduling and protein Nuclear Magnetic Resonance peak assignment. It has been shown to be NP-hard and MAX SNP-hard if the perfectness requirement is dropped. In this paper, more results on the inapproximability are presented and IDA*, a memory efficient variant of the well known A* search algorithm, is utilized to solve the problem. Accordingly, search heuristics and a set of heuristic evaluation functions are developed to assist the search, whose effectiveness is demonstrated by a simulation study using real protein NMR backbone resonance assignment instances.
约束二分匹配(Constrained Bipartite Matching,CBM)问题是经典二分匹配问题的一个变体,组合优化领域对此已有深入研究。CBM的输入是一个边带权的完全二分图,其两侧顶点数量相同,一侧顶点按顺序排列,另一侧顶点被划分并连接成不相交的有向路径。在可行匹配中,一条路径必须映射到另一侧的连续顶点上。优化目标是找到一个最大或最小权重完美匹配。这样一个优化问题在调度和蛋白质核磁共振峰分配中有应用。如果去掉完美性要求,已证明该问题是NP难和MAX SNP难的。本文给出了更多关于不可近似性的结果,并利用IDA*(著名的A*搜索算法的一个内存高效变体)来解决该问题。相应地,开发了搜索启发式方法和一组启发式评估函数来辅助搜索,使用真实蛋白质核磁共振主链共振分配实例进行的模拟研究证明了其有效性。