Nei M, Kumar S, Takahashi K
Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University, University Park, PA 16802-5301, USA.
Proc Natl Acad Sci U S A. 1998 Oct 13;95(21):12390-7. doi: 10.1073/pnas.95.21.12390.
In the maximum parsimony (MP) and minimum evolution (ME) methods of phylogenetic inference, evolutionary trees are constructed by searching for the topology that shows the minimum number of mutational changes required (M) and the smallest sum of branch lengths (S), respectively, whereas in the maximum likelihood (ML) method the topology showing the highest maximum likelihood (A) of observing a given data set is chosen. However, the theoretical basis of the optimization principle remains unclear. We therefore examined the relationships of M, S, and A for the MP, ME, and ML trees with those for the true tree by using computer simulation. The results show that M and S are generally greater for the true tree than for the MP and ME trees when the number of nucleotides examined (n) is relatively small, whereas A is generally lower for the true tree than for the ML tree. This finding indicates that the optimization principle tends to give incorrect topologies when n is small. To deal with this disturbing property of the optimization principle, we suggest that more attention should be given to testing the statistical reliability of an estimated tree rather than to finding the optimal tree with excessive efforts. When a reliability test is conducted, simplified MP, ME, and ML algorithms such as the neighbor-joining method generally give conclusions about phylogenetic inference very similar to those obtained by the more extensive tree search algorithms.
在系统发育推断的最大简约法(MP)和最小进化法(ME)中,构建进化树时分别寻找显示所需最少突变变化数(M)的拓扑结构和最小分支长度总和(S)的拓扑结构,而在最大似然法(ML)中,则选择显示观察给定数据集的最高最大似然值(A)的拓扑结构。然而,优化原则的理论基础仍不清楚。因此,我们通过计算机模拟研究了MP、ME和ML树的M、S和A与真实树的M、S和A之间的关系。结果表明,当所检查的核苷酸数量(n)相对较小时,真实树的M和S通常大于MP和ME树,而真实树的A通常低于ML树。这一发现表明,当n较小时,优化原则往往会给出错误的拓扑结构。为了应对优化原则的这一令人不安的特性,我们建议应更多地关注测试估计树的统计可靠性,而不是过度努力地寻找最优树。当进行可靠性测试时,诸如邻接法等简化的MP、ME和ML算法通常会得出与通过更广泛的树搜索算法得出的系统发育推断结论非常相似的结论。