Tateno Y, Tajima F
J Mol Evol. 1986;23(4):354-61. doi: 10.1007/BF02100645.
The statistical properties of three molecular tree construction methods--the unweighted pair-group arithmetic average clustering (UPG), Farris, and modified Farris methods--are examined under the neutral mutation model of evolution. The methods are compared for accuracy in construction of the topology and estimation of the branch lengths, using statistics of these two aspects. The distribution of the statistic concerning topological construction is shown to be as important as its mean and variance for the comparison. Of the three methods, the UPG method constructs the tree topology with the least variation. The modified Farris method, however, gives the best performance when the two aspects are considered simultaneously. It is also shown that a topology based on two genes is much more accurate than that based on one gene. There is a tendency to accept published molecular trees, but uncritical acceptance may lead one to spurious conclusions. It should always be kept in mind that a tree is a statistical result that is affected strongly by the stochastic error of nucleotide substitution and the error intrinsic to the tree construction method itself.
在中性突变进化模型下,研究了三种分子树构建方法——非加权配对组算术平均聚类法(UPG)、法里斯法和改良法里斯法——的统计特性。利用这两个方面的统计数据,比较了这些方法在构建拓扑结构和估计分支长度方面的准确性。结果表明,在比较中,关于拓扑结构构建的统计量的分布与其均值和方差同样重要。在这三种方法中,UPG法构建的树拓扑结构变化最小。然而,当同时考虑这两个方面时,改良法里斯法表现最佳。研究还表明,基于两个基因的拓扑结构比基于一个基因的拓扑结构要准确得多。人们倾向于接受已发表的分子树,但不加批判地接受可能会导致得出虚假的结论。应该始终牢记,一棵树是一个统计结果,它受到核苷酸替换的随机误差和树构建方法本身固有的误差的强烈影响。