Jin L, Nei M
Center for Demographic and Population Genetics, Graduate School of Biomedical Sciences, University of Texas Health Science Center, Houston 77225.
Mol Biol Evol. 1990 Jan;7(1):82-102. doi: 10.1093/oxfordjournals.molbev.a040588.
Lake's evolutionary parsimony (EP) method of constructing a phylogenetic tree is primarily applied to four DNA sequences. In this method, three quantities--X, Y, and Z--that correspond to three possible unrooted trees are computed, and an invariance property of these quantities is used for choosing the best tree. However, Lake's method depends on a number of unrealistic assumptions. We therefore examined the theoretical basis of his method and reached the following conclusions: (1) When the rates of two transversional changes from a nucleotide are unequal, his invariance property breaks down. (2) Even if the rates of two transversional changes are equal, the invariance property requires some additional conditions. (3) When Kimura's two-parameter model of nucleotide substitution applies and the rate of nucleotide substitution varies greatly with branch, the EP method is generally better than the standard maximum-parsimony (MP) method in recovering the correct tree but is inferior to the neighbor-joining (NJ) and a few other distance matrix methods. (4) When the rate of nucleotide substitution is the same or nearly the same for all branches, the EP method is inferior to the MP method even if the proportion of transitional changes is high. (5) When Lake's assumptions fail, his chi2 test may identify an erroneous tree as the correct tree. This happens because the test is not for comparing different trees. (6) As long as a proper distance measure is used, the NJ method is better than the EP and MP methods whether there is a transition/transversion bias or whether there is variation in substitution rate among different nucleotide sites.
莱克构建系统发育树的进化简约法(EP)主要应用于四条DNA序列。在该方法中,计算与三种可能的无根树相对应的三个量——X、Y和Z,并利用这些量的不变性来选择最佳树。然而,莱克的方法依赖于一些不现实的假设。因此,我们研究了他的方法的理论基础并得出以下结论:(1)当核苷酸的两种颠换变化率不相等时,他的不变性就会失效。(2)即使两种颠换变化率相等,不变性也需要一些附加条件。(3)当应用木村二参数核苷酸替换模型且核苷酸替换率随分支变化很大时,在恢复正确的树方面,EP方法通常比标准的最大简约法(MP)更好,但不如邻接法(NJ)和其他一些距离矩阵方法。(4)当所有分支的核苷酸替换率相同或几乎相同时,即使转换变化的比例很高,EP方法也不如MP方法。(5)当莱克的假设不成立时,他的卡方检验可能会将错误的树识别为正确的树。出现这种情况是因为该检验不是用于比较不同的树。(6)只要使用适当的距离度量,无论是否存在转换/颠换偏差,也无论不同核苷酸位点之间的替换率是否存在差异,NJ方法都比EP和MP方法更好。