Slatkin M, Hudson R R
Department of Integrative Biology, University of California, Berkeley 94720.
Genetics. 1991 Oct;129(2):555-62. doi: 10.1093/genetics/129.2.555.
We consider the distribution of pairwise sequence differences of mitochondrial DNA or of other nonrecombining portions of the genome in a population that has been of constant size and in a population that has been growing in size exponentially for a long time. We show that, in a population of constant size, the sample distribution of pairwise differences will typically deviate substantially from the geometric distribution expected, because the history of coalescent events in a single sample of genes imposes a substantial correlation on pairwise differences. Consequently, a goodness-of-fit test of observed pairwise differences to the geometric distribution, which assumes that each pairwise comparison is independent, is not a valid test of the hypothesis that the genes were sampled from a panmictic population of constant size. In an exponentially growing population in which the product of the current population size and the growth rate is substantially larger than one, our analytical and simulation results show that most coalescent events occur relatively early and in a restricted range of times. Hence, the "gene tree" will be nearly a "star phylogeny" and the distribution of pairwise differences will be nearly a Poisson distribution. In that case, it is possible to estimate r, the population growth rate, if the mutation rate, mu, and current population size, N0, are assumed known. The estimate of r is the solution to ri/mu = ln(N0r) - gamma, where i is the average pairwise difference and gamma approximately 0.577 is Euler's constant.
我们考虑了在一个大小恒定的种群以及一个长期呈指数增长的种群中,线粒体DNA或基因组其他非重组部分的成对序列差异的分布情况。我们表明,在大小恒定的种群中,成对差异的样本分布通常会显著偏离预期的几何分布,因为单个基因样本中的合并事件历史会给成对差异带来显著的相关性。因此,对观察到的成对差异与几何分布进行拟合优度检验(该检验假定每个成对比较都是独立的),并非是对基因取自大小恒定的随机交配种群这一假设的有效检验。在当前种群大小与增长率的乘积远大于1的指数增长种群中,我们的分析和模拟结果表明,大多数合并事件发生得相对较早,且发生时间范围有限。因此,“基因树”将近乎是一棵“星状系统发育树”,而成对差异的分布将近乎是泊松分布。在这种情况下,如果假定突变率μ和当前种群大小N0已知,就有可能估计种群增长率r。r的估计值是ri/μ = ln(N0r) - γ的解,其中i是平均成对差异,γ约为0.577是欧拉常数。