Edwards S V, Beerli P
Department of Zoology and Burke Museum, University of Washington, Seattle 98195, USA.
Evolution. 2000 Dec;54(6):1839-54. doi: 10.1111/j.0014-3820.2000.tb01231.x.
Molecular methods as applied to the biogeography of single species (phylogeography) or multiple codistributed species (comparative phylogeography) have been productively and extensively used to elucidate common historical features in the diversification of the Earth's biota. However, only recently have methods for estimating population divergence times or their confidence limits while taking into account the critical effects of genetic polymorphism in ancestral species become available, and earlier methods for doing so are underutilized. We review models that address the crucial distinction between the gene divergence, the parameter that is typically recovered in molecular phylogeographic studies, and the population divergence, which is in most cases the parameter of interest and will almost always postdate the gene divergence. Assuming that population sizes of ancestral species are distributed similarly to those of extant species, we show that phylogeographic studies in vertebrates suggest that divergence of alleles in ancestral species can comprise from less than 10% to over 50% of the total divergence between sister species, suggesting that the problem of ancestral polymorphism in dating population divergence can be substantial. The variance in the number of substitutions (among loci for a given species or among species for a given gene) resulting from the stochastic nature of DNA change is generally smaller than the variance due to substitutions along allelic lines whose coalescence times vary due to genetic drift in the ancestral population. Whereas the former variance can be reduced by further DNA sequencing at a single locus, the latter cannot. Contrary to phylogeographic intuition, dating population divergence times when allelic lines have achieved reciprocal monophyly is in some ways more challenging than when allelic lines have not achieved monophyly, because in the former case critical data on ancestral population size provided by residual ancestral polymorphism is lost. In the former case differences in coalescence time between species pairs can in principle be explained entirely by differences in ancestral population size without resorting to explanations involving differences in divergence time. Furthermore, the confidence limits on population divergence times are severely underestimated when those for number of substitutions per site in the DNA sequences examined are used as a proxy. This uncertainty highlights the importance of multilocus data in estimating population divergence times; multilocus data can in principle distinguish differences in coalescence time (T) resulting from differences in population divergence time and differences in T due to differences in ancestral population sizes and will reduce the confidence limits on the estimates. We analyze the contribution of ancestral population size (theta) to T and the effect of uncertainty in theta on estimates of population divergence (tau) for single loci under reciprocal monophyly using a simple Bayesian extension of Takahata and Satta's and Yang's recent coalescent methods. The confidence limits on tau decrease when the range over which ancestral population size theta is assumed to be distributed decreases and when tau increases; they generally exclude zero when tau/(4Ne) > 1. We also apply a maximum-likelihood method to several single and multilocus data sets. With multilocus data, the criterion for excluding tau = 0 is roughly that l tau/(4Ne) > 1, where l is the number of loci. Our analyses corroborate recent suggestions that increasing the number of loci is critical to decreasing the uncertainty in estimates of population divergence time.
应用于单一物种生物地理学(系统发育地理学)或多个共分布物种(比较系统发育地理学)的分子方法,已被有效地广泛用于阐明地球生物群多样化过程中的共同历史特征。然而,直到最近,才出现了在考虑祖先物种遗传多态性的关键影响的同时估计种群分化时间或其置信区间的方法,而早期的此类方法未得到充分利用。我们回顾了一些模型,这些模型解决了基因分化(分子系统发育地理学研究中通常恢复的参数)与种群分化(在大多数情况下是感兴趣的参数,并且几乎总是晚于基因分化)之间的关键区别。假设祖先物种的种群大小与现存物种的种群大小分布相似,我们表明脊椎动物的系统发育地理学研究表明,祖先物种中等位基因的分化可能占姐妹物种之间总分化的不到10%至超过50%,这表明在确定种群分化时间时祖先多态性问题可能相当严重。由于DNA变化的随机性导致的替换数(对于给定物种的不同位点或对于给定基因的不同物种之间)的方差通常小于由于沿着等位基因线的替换导致的方差,这些等位基因线的合并时间因祖先种群中的遗传漂变而不同。虽然前一种方差可以通过在单个位点进行进一步的DNA测序来降低,但后一种方差则不能。与系统发育地理学的直觉相反,当等位基因线达到相互单系性时确定种群分化时间在某些方面比等位基因线未达到单系性时更具挑战性,因为在前一种情况下,由残余祖先多态性提供的关于祖先种群大小的关键数据会丢失。在前一种情况下,物种对之间合并时间的差异原则上可以完全由祖先种群大小的差异来解释,而无需诉诸涉及分化时间差异的解释。此外,当将所检查的DNA序列中每个位点的替换数的置信区间用作替代时,种群分化时间的置信区间会被严重低估。这种不确定性凸显了多位点数据在估计种群分化时间方面的重要性;多位点数据原则上可以区分由于种群分化时间差异导致的合并时间(T)差异和由于祖先种群大小差异导致的T差异,并将降低估计的置信区间。我们使用高桥和佐藤以及杨最近的合并方法的简单贝叶斯扩展,分析了祖先种群大小(θ)对T的贡献以及θ的不确定性对相互单系性下单个位点的种群分化(τ)估计的影响。当假设祖先种群大小θ的分布范围减小且τ增加时,τ的置信区间会减小;当τ/(4Ne) > 1时,它们通常不包括零。我们还将最大似然法应用于几个单一位点和多位点数据集。对于多位点数据,排除τ = 0的标准大致是l τ/(4Ne) > 1,其中l是位点的数量。我们的分析证实了最近的建议,即增加位点数量对于降低种群分化时间估计的不确定性至关重要。