Serdoz Stuart, Egri-Nagy Attila, Sumner Jeremy, Holland Barbara R, Jarvis Peter D, Tanaka Mark M, Francis Andrew R
Centre for Research in Mathematics, Western Sydney University, Australia.
Centre for Research in Mathematics, Western Sydney University, Australia; Akita International University, Japan.
J Theor Biol. 2017 Jun 21;423:31-40. doi: 10.1016/j.jtbi.2017.04.015. Epub 2017 Apr 20.
Accurate estimation of evolutionary distances between taxa is important for many phylogenetic reconstruction methods. Distances can be estimated using a range of different evolutionary models, from single nucleotide polymorphisms to large-scale genome rearrangements. Corresponding corrections for genome rearrangement distances fall into 3 categories: Empirical computational studies, Bayesian/MCMC approaches, and combinatorial approaches. Here, we introduce a maximum likelihood estimator for the inversion distance between a pair of genomes, using a group-theoretic approach to modelling inversions introduced recently. This MLE functions as a corrected distance: in particular, we show that because of the way sequences of inversions interact with each other, it is quite possible for minimal distance and MLE distance to differently order the distances of two genomes from a third. The second aspect tackles the problem of accounting for the symmetries of circular arrangements. While, generally, a frame of reference is locked, and all computation made accordingly, this work incorporates the action of the dihedral group so that distance estimates are free from any a priori frame of reference. The philosophy of accounting for symmetries can be applied to any existing correction method, for which examples are offered.
准确估计分类单元之间的进化距离对于许多系统发育重建方法都很重要。可以使用一系列不同的进化模型来估计距离,从单核苷酸多态性到大规模基因组重排。基因组重排距离的相应校正可分为三类:实证计算研究、贝叶斯/马尔可夫链蒙特卡罗方法和组合方法。在这里,我们使用最近引入的一种群论方法来对倒位进行建模,从而引入了一种用于估计一对基因组之间倒位距离的最大似然估计器。这种最大似然估计器起到校正距离的作用:特别是,我们表明,由于倒位序列相互作用的方式,最小距离和最大似然估计器距离很可能会以不同的顺序排列两个基因组与第三个基因组之间的距离。第二个方面解决了考虑环状排列对称性的问题。虽然通常会锁定一个参考框架,并据此进行所有计算,但这项工作纳入了二面体群的作用,以便距离估计不受任何先验参考框架的影响。考虑对称性的理念可以应用于任何现有的校正方法,并给出了相关示例。