School of Mathematics and Statistics / Melbourne Integrative Genomics, The University of Melbourne, Melbourne, Australia.
School of BioSciences, The University of Melbourne, Melbourne, Australia.
J Theor Biol. 2019 Jul 7;472:54-66. doi: 10.1016/j.jtbi.2019.04.001. Epub 2019 Apr 3.
The phylogenetic trees of genes and the species which they belong to are similar, but distinct due to various evolutionary processes which affect genes but do not create new species. Reconciliations map the gene tree into the species tree, explaining the discrepancies by events including gene duplications and losses. However, when duplicate genes undergo recombination (a phenomenon known as paralog exchange, or non-allelic homologous recombination), the phylogeny of the genes becomes a network, not a tree. In this paper, we explore how to reconcile a gene network to a species tree with duplications and losses. We propose an extension of the lowest common ancestor (LCA) mapping which solves the problem for tree-child gene networks, show that a restricted version of the problem is polynomial-time solvable and bounds the optimal position of each gene node in the full problem, and show that the full problem is fixed-parameter tractable in the level of the gene network. This provides a formal foundation for the development of efficient algorithms to solve this problem.
基因与其所属物种的系统发育树相似,但由于各种影响基因但不产生新物种的进化过程而有所不同。调和将基因树映射到物种树上,通过包括基因重复和丢失在内的事件来解释差异。然而,当重复基因发生重组(一种称为基因对交换或非等位基因同源重组的现象)时,基因的系统发育就变成了一个网络,而不是一棵树。在本文中,我们探讨了如何将基因网络与具有重复和丢失的物种树进行调和。我们提出了一种最低共同祖先(LCA)映射的扩展,解决了树-子基因网络的问题,表明该问题的一个受限版本是多项式时间可解的,并限制了完整问题中每个基因节点的最佳位置,表明完整问题在基因网络的级别上是固定参数可解的。这为开发解决该问题的有效算法提供了一个正式的基础。