Hellmuth Marc, Huber Katharina T, Moulton Vincent
Institute of Mathematics and Computer Science, University of Greifswald, Greifswald, Germany.
Center for Bioinformatics, Saarland University, Saarbrücken, Germany.
J Math Biol. 2019 Oct;79(5):1885-1925. doi: 10.1007/s00285-019-01414-8. Epub 2019 Aug 13.
Phylogenomics commonly aims to construct evolutionary trees from genomic sequence information. One way to approach this problem is to first estimate event-labeled gene trees (i.e., rooted trees whose non-leaf vertices are labeled by speciation or gene duplication events), and to then look for a species tree which can be reconciled with this tree through a reconciliation map between the trees. In practice, however, it can happen that there is no such map from a given event-labeled tree to any species tree. An important situation where this might arise is where the species evolution is better represented by a network instead of a tree. In this paper, we therefore consider the problem of reconciling event-labeled trees with species networks. In particular, we prove that any event-labeled gene tree can be reconciled with some network and that, under certain mild assumptions on the gene tree, the network can even be assumed to be multi-arc free. To prove this result, we show that we can always reconcile the gene tree with some multi-labeled (MUL-)tree, which can then be "folded up" to produce the desired reconciliation and network. In addition, we study the interplay between reconciliation maps from event-labeled gene trees to MUL-trees and networks. Our results could be useful for understanding how genomes have evolved after undergoing complex evolutionary events such as polyploidy.
系统发育基因组学通常旨在根据基因组序列信息构建进化树。解决这个问题的一种方法是首先估计带有事件标签的基因树(即非叶节点由物种形成或基因复制事件标记的有根树),然后寻找一棵可以通过树之间的协调映射与该树协调的物种树。然而,在实际中,可能会出现从给定的带有事件标签的树到任何物种树都不存在这样的映射的情况。这种情况可能出现的一个重要情形是,物种进化由网络而非树能更好地表示。因此,在本文中,我们考虑将带有事件标签的树与物种网络进行协调的问题。特别地,我们证明任何带有事件标签的基因树都可以与某个网络协调,并且在对基因树的某些温和假设下,甚至可以假设该网络是无多弧的。为了证明这个结果,我们表明我们总能将基因树与某个多标签(MUL-)树协调,然后该树可以“折叠起来”以产生所需的协调和网络。此外,我们研究了从带有事件标签的基因树到MUL-树和网络的协调映射之间的相互作用。我们的结果对于理解基因组在经历多倍体等复杂进化事件后是如何进化的可能会很有用。