IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):587-599. doi: 10.1109/TCBB.2015.2511761. Epub 2015 Dec 23.
Duplication-Transfer-Loss (DTL) reconciliation has emerged as a powerful technique for studying gene family evolution in the presence of horizontal gene transfer. DTL reconciliation takes as input a gene family phylogeny and the corresponding species phylogeny, and reconciles the two by postulating speciation, gene duplication, horizontal gene transfer, and gene loss events. Efficient algorithms exist for finding optimal DTL reconciliations when the gene tree is binary. However, gene trees are frequently non-binary. With such non-binary gene trees, the reconciliation problem seeks to find a binary resolution of the gene tree that minimizes the reconciliation cost. Given the prevalence of non-binary gene trees, many efficient algorithms have been developed for this problem in the context of the simpler Duplication-Loss (DL) reconciliation model. Yet, no efficient algorithms exist for DTL reconciliation with non-binary gene trees and the complexity of the problem remains unknown. In this work, we resolve this open question by showing that the problem is, in fact, NP-hard. Our reduction applies to both the dated and undated formulations of DTL reconciliation. By resolving this long-standing open problem, this work will spur the development of both exact and heuristic algorithms for this important problem.
复制-转移-缺失(DTL)协调已成为研究水平基因转移存在下基因家族进化的强大技术。DTL 协调将基因家族系统发育和相应的物种系统发育作为输入,并通过假设物种形成、基因复制、水平基因转移和基因缺失事件来协调两者。当基因树为二进制时,存在用于找到最优 DTL 协调的有效算法。然而,基因树经常是非二进制的。对于这种非二进制的基因树,协调问题旨在找到基因树的二进制分辨率,以最小化协调成本。鉴于非二进制基因树的普遍性,已经为这个问题在更简单的复制-缺失(DL)协调模型的背景下开发了许多有效的算法。然而,对于具有非二进制基因树的 DTL 协调,还没有有效的算法,并且该问题的复杂性仍然未知。在这项工作中,我们通过证明该问题实际上是 NP 难的来解决这个悬而未决的问题。我们的归约适用于 DTL 协调的有日期和无日期公式。通过解决这个长期存在的开放性问题,这项工作将激发针对这个重要问题的精确和启发式算法的开发。