Liu Liang, Yu Lili, Pearl Dennis K
Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA.
J Math Biol. 2010 Jan;60(1):95-106. doi: 10.1007/s00285-009-0260-0. Epub 2009 Mar 13.
We propose a model based approach to use multiple gene trees to estimate the species tree. The coalescent process requires that gene divergences occur earlier than species divergences when there is any polymorphism in the ancestral species. Under this scenario, speciation times are restricted to be smaller than the corresponding gene split times. The maximum tree (MT) is the tree with the largest possible speciation times in the space of species trees restricted by available gene trees. If all populations have the same population size, the MT is the maximum likelihood estimate of the species tree. It can be shown the MT is a consistent estimator of the species tree even when the MT is built upon the estimates of the true gene trees if the gene tree estimates are statistically consistent. The MT converges in probability to the true species tree at an exponential rate.
我们提出一种基于模型的方法,利用多个基因树来估计物种树。当祖先物种存在任何多态性时,溯祖过程要求基因分歧早于物种分歧发生。在这种情况下,物种形成时间被限制为小于相应的基因分裂时间。最大树(MT)是在由可用基因树限制的物种树空间中具有最大可能物种形成时间的树。如果所有种群具有相同的种群大小,MT就是物种树的最大似然估计。可以证明,即使MT是基于真实基因树的估计构建的,只要基因树估计在统计上是一致的,MT就是物种树的一致估计量。MT以指数速率依概率收敛到真实的物种树。