Felsenstein Joseph
Department of Genetics SK-50, University of Washington, Seattle, WA, 98195, USA.
Evolution. 1992 Feb;46(1):159-173. doi: 10.1111/j.1558-5646.1992.tb01991.x.
Restriction sites data can be analyzed by maximum likelihood to obtain estimates of phylogenies. The likelihood methods of Smouse and Li, who were able to compute likelihoods for up to four species under a simplified model of base change, can be extended numerically to deal with any number of species. The computational methods for doing so are outlined. The resulting algorithms are slow but take multiple gains and losses of restriction sites fully into account, unlike parsimony methods. They allow for the failure to observe potential sites that are absent from all species. Analysis of the five-species hominoid data of Ferris and coworkers confirms the pattern found by Smouse and Li with four species-that a chimpanzee-gorilla clade is favored, but not statistically significantly over other tree topologies. A large data set produced by computer simulation has also been analyzed to confirm that the method works properly. The methods used here do not allow for different rates of transitions and transversions. They can be extended to do so, but only at a cost of considerably slower computations. The present method is available in a computer program.
限制酶切位点数据可通过最大似然法进行分析,以获得系统发育的估计值。斯茅斯和李的似然方法能够在简化的碱基变化模型下计算多达四个物种的似然值,该方法可通过数值扩展来处理任意数量的物种。文中概述了这样做的计算方法。由此产生的算法虽然速度较慢,但与简约法不同,它充分考虑了限制酶切位点的多次获得和丢失情况。它们允许考虑所有物种中都不存在的潜在位点未被观察到的情况。对费里斯及其同事的五种类人猿数据的分析证实了斯茅斯和李在四个物种数据中发现的模式——即黑猩猩 - 大猩猩分支受到支持,但在统计学上并不显著优于其他树形拓扑结构。还对通过计算机模拟产生的一个大数据集进行了分析,以确认该方法运行正常。这里使用的方法不考虑转换和颠换的不同速率。它们可以扩展以考虑这一点,但代价是计算速度会大幅减慢。本方法有相应的计算机程序可用。