Rzhetsky A, Nei M
Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park 16802.
J Mol Evol. 1992 Oct;35(4):367-75. doi: 10.1007/BF00161174.
Statistical properties of the ordinary least-squares (OLS), generalized least-squares (GLS), and minimum-evolution (ME) methods of phylogenetic inference were studied by considering the case of four DNA sequences. Analytical study has shown that all three methods are statistically consistent in the sense that as the number of nucleotides examined (m) increases they tend to choose the true tree as long as the evolutionary distances used are unbiased. When evolutionary distances (dij's) are large and sequences under study are not very long, however, the OLS criterion is often biased and may choose an incorrect tree more often than expected under random choice. It is also shown that the variance-covariance matrix of dij's becomes singular as dij's approach zero and thus the GLS may not be applicable when dij's are small. The ME method suffers from neither of these problems, and the ME criterion is statistically unbiased. Computer simulation has shown that the ME method is more efficient in obtaining the true tree than the OLS and GLS methods and that the OLS is more efficient than the GLS when dij's are small, but otherwise the GLS is more efficient.
通过考虑四条DNA序列的情况,研究了系统发育推断的普通最小二乘法(OLS)、广义最小二乘法(GLS)和最小进化法(ME)的统计特性。分析研究表明,只要所使用的进化距离无偏,随着所检测核苷酸数量(m)的增加,这三种方法在统计上都是一致的,即它们倾向于选择真实的树。然而,当进化距离(dij)较大且所研究的序列不是很长时,OLS准则往往存在偏差,并且可能比随机选择情况下预期的更频繁地选择错误的树。研究还表明,当dij接近零时,dij的方差 - 协方差矩阵会变得奇异,因此当dij较小时,GLS可能不适用。ME方法不存在这些问题,并且ME准则在统计上是无偏的。计算机模拟表明,ME方法在获得真实树方面比OLS和GLS方法更有效,并且当dij较小时,OLS比GLS更有效,但在其他情况下GLS更有效。