Kück Patrick, Wilkinson Mark, Groß Christian, Foster Peter G, Wägele Johann W
Zoologisches Forschungsmuseum Alexander Koenig, Bonn, 53113, Germany.
The Natural History Museum, London, SW7 5BD, United Kingdom.
PLoS One. 2017 Aug 25;12(8):e0183393. doi: 10.1371/journal.pone.0183393. eCollection 2017.
Systematic biases such as long branch attraction can mislead commonly relied upon model-based (i.e. maximum likelihood and Bayesian) phylogenetic methods when, as is usually the case with empirical data, there is model misspecification. We present PhyQuart, a new method for evaluating the three possible binary trees for any quartet of taxa. PhyQuart was developed through a process of reciprocal illumination between a priori considerations and the results of extensive simulations. It is based on identification of site-patterns that can be considered to support a particular quartet tree taking into account the Hennigian distinction between apomorphic and plesiomorphic similarity, and employing corrections to the raw observed frequencies of site-patterns that exploit expectations from maximum likelihood estimation. We demonstrate through extensive simulation experiments that, whereas maximum likeilihood estimation performs well in many cases, it can be outperformed by PhyQuart in cases where it fails due to extreme branch length asymmetries producing long-branch attraction artefacts where there is only very minor model misspecification.
当出现模型设定错误时(经验数据通常如此),诸如长枝吸引等系统偏差会误导常用的基于模型的(即最大似然法和贝叶斯法)系统发育方法。我们提出了PhyQuart,这是一种用于评估任何四个分类单元的三种可能二叉树的新方法。PhyQuart是在先验考虑和广泛模拟结果之间的相互启发过程中开发出来的。它基于对位点模式的识别,这些位点模式可被视为支持特定的四重奏树,同时考虑到同源和近源相似性之间的亨尼希区别,并对利用最大似然估计期望的位点模式原始观察频率进行校正。我们通过广泛的模拟实验证明,虽然最大似然估计在许多情况下表现良好,但在由于极端分支长度不对称导致长枝吸引假象而失败的情况下,PhyQuart可能会优于最大似然估计,而此时模型设定错误非常小。