Tannier Eric, Zheng Chunfang, Sankoff David
INRIA Rhône-Alpes, Inovallée, Montbonnot, Saint Ismier Cedex, France.
BMC Bioinformatics. 2009 Apr 22;10:120. doi: 10.1186/1471-2105-10-120.
Genome median and genome halving are combinatorial optimization problems that aim at reconstructing ancestral genomes as well as the evolutionary events leading from the ancestor to extant species. Exploring complexity issues is a first step towards devising efficient algorithms. The complexity of the median problem for unichromosomal genomes (permutations) has been settled for both the breakpoint distance and the reversal distance. Although the multichromosomal case has often been assumed to be a simple generalization of the unichromosomal case, it is also a relaxation so that complexity in this context does not follow from existing results, and is open for all distances.
We settle here the complexity of several genome median and halving problems, including a surprising polynomial result for the breakpoint median and guided halving problems in genomes with circular and linear chromosomes, showing that the multichromosomal problem is actually easier than the unichromosomal problem. Still other variants of these problems are NP-complete, including the DCJ double distance problem, previously mentioned as an open question. We list the remaining open problems.
This theoretical study clears up a wide swathe of the algorithmical study of genome rearrangements with multiple multichromosomal genomes.
基因组中位数和基因组减半是组合优化问题,旨在重建祖先基因组以及从祖先到现存物种的进化事件。探索复杂性问题是设计高效算法的第一步。单染色体基因组(排列)中位数问题在断点距离和反转距离方面的复杂性已经得到解决。尽管多染色体情况通常被认为是单染色体情况的简单推广,但它也是一种松弛,因此在此背景下的复杂性并非从现有结果得出,并且对于所有距离都是开放的。
我们在此解决了几个基因组中位数和减半问题的复杂性,包括在具有环状和线性染色体的基因组中,断点中位数和引导减半问题得出的令人惊讶的多项式结果,表明多染色体问题实际上比单染色体问题更容易。这些问题的其他变体仍然是NP完全问题,包括之前作为开放问题提到的DCJ双距离问题。我们列出了其余的开放问题。
这项理论研究澄清了多个多染色体基因组的基因组重排算法研究中的一大片问题。