有界最大度矩阵的间隔连续1属性问题的复杂性

The complexity of the gapped consecutive-ones property problem for matrices of bounded maximum degree.

作者信息

Maňuch Ján, Patterson Murray

机构信息

Department of Mathematics, Simon Fraser University, Burnaby, Canada.

出版信息

J Comput Biol. 2011 Sep;18(9):1243-53. doi: 10.1089/cmb.2011.0128.

DOI:10.1089/cmb.2011.0128

PMID:21899429

Abstract

The Gapped Consecutive-Ones Property (C1P) Problem, or the (k, δ)-C1P Problem is: given a binary matrix M and integers k and δ, decide if the columns of M can be ordered such that each row contains at most k blocks of 1's, and no two neighboring blocks of 1's are separated by a gap of more than δ 0's. This problem was introduced by Chauve et al. ( 2009b ). The classical polynomial-time solvable C1P Problem is equivalent to the (1, 0)-C1P problem. It has been shown that, for every unbounded or bounded k ≥ 2 and unbounded or bounded δ ≥ 1, except when (k, δ) = (2, 1), the (k, δ)-C1P Problem is NP-complete (Maňuch et al., 2011 ; Goldberg et al., 1995 ). In this article, we study the Gapped C1P Problem with a third parameter d, namely the bound on the maximum number of 1's in any row of M, or the bound on the maximum degree of M. This is motivated by the reconstruction of ancestral genomes (Ma et al., 2006 ; Chauve and Tannier, 2008 ), where, in binary matrices obtained from the experiments of Chauve and Tannier ( 2008 ), we have observed that the majority of the rows have low degree, while each high degree row contains many rows of low degree. The (d, k, δ)-C1P Problem has been shown to be polynomial-time solvable when all three parameters are fixed (Chauve et al., 2009b ). Since fixing d also fixes k (k ≤ d), the only case left to consider is the case when δ is unbounded, or the (d, k, ∞)-C1P Problem. Here we show that for every d > k ≥ 2, the (d, k, ∞)-C1P Problem is NP-complete.

摘要

间隔连续1属性（C1P）问题，即（k, δ）-C1P问题是：给定一个二进制矩阵M以及整数k和δ，判断是否可以对M的列进行排序，使得每行最多包含k个由1组成的块，并且任意两个相邻的由1组成的块之间被0隔开的间隔不超过δ个。这个问题由肖夫等人（2009年b）提出。经典的多项式时间可解的C1P问题等同于（1, 0）-C1P问题。已经证明，对于每个无界或有界的k≥2以及无界或有界的δ≥1，除了（k, δ） = （2, 1）的情况外，（k, δ）-C1P问题是NP完全问题（马努赫等人，2011年；戈德堡等人，1995年）。在本文中，我们研究带有第三个参数d的间隔C1P问题，即M的任意一行中1的最大数量的界限，或者M的最大度数的界限。这是受祖先基因组重建问题的启发（马等人，2006年；肖夫和坦尼尔，2008年），在从肖夫和坦尼尔（2008年）的实验中得到的二进制矩阵里，我们观察到大多数行的度数较低，而每个高度数的行包含许多低度数的行。当所有三个参数都固定时，（d, k, δ）-C1P问题已被证明是多项式时间可解的（肖夫等人，2009年b）。由于固定d也会固定k（k≤d），剩下唯一需要考虑的情况是δ无界的情况，即（d, k, ∞）-C1P问题。在这里我们证明，对于每个d > k≥2，（d, k, ∞）-C1P问题是NP完全问题。