Departments of Biological Statistics and Computational Biology and Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853.
Proc Natl Acad Sci U S A. 2014 Jan 14;111(2):757-62. doi: 10.1073/pnas.1310398110. Epub 2013 Dec 30.
Human populations have experienced dramatic growth since the Neolithic revolution. Recent studies that sequenced a very large number of individuals observed an extreme excess of rare variants and provided clear evidence of recent rapid growth in effective population size, although estimates have varied greatly among studies. All these studies were based on protein-coding genes, in which variants are also impacted by natural selection. In this study, we introduce targeted sequencing data for studying recent human history with minimal confounding by natural selection. We sequenced loci far from genes that meet a wide array of additional criteria such that mutations in these loci are putatively neutral. As population structure also skews allele frequencies, we sequenced 500 individuals of relatively homogeneous ancestry by first analyzing the population structure of 9,716 European Americans. We used very high coverage sequencing to reliably call rare variants and fit an extensive array of models of recent European demographic history to the site frequency spectrum. The best-fit model estimates ∼ 3.4% growth per generation during the last ∼ 140 generations, resulting in a population size increase of two orders of magnitude. This model fits the data very well, largely due to our observation that assumptions of more ancient demography can impact estimates of recent growth. This observation and results also shed light on the discrepancy in demographic estimates among recent studies.
自新石器时代革命以来,人类人口经历了显著的增长。最近的研究对大量个体进行了测序,观察到了罕见变异的极端过剩,并提供了近期有效种群规模快速增长的明确证据,尽管不同研究之间的估计差异很大。所有这些研究都基于编码蛋白质的基因,其中变异也受到自然选择的影响。在这项研究中,我们引入了靶向测序数据,以最小的自然选择干扰来研究近期人类历史。我们对远离基因的基因座进行了测序,这些基因座符合广泛的其他标准,因此这些基因座中的突变被假定为中性。由于群体结构也会扭曲等位基因频率,我们通过首先分析 9716 名欧洲裔美国人的群体结构,对 500 名相对同质祖先的个体进行了测序。我们使用非常高的覆盖测序来可靠地调用罕见变异,并拟合了广泛的近期欧洲人口历史模型,以适应位点频率谱。最佳拟合模型估计在过去的 140 代中,每代增长约 3.4%,导致种群规模增加了两个数量级。该模型非常符合数据,这在很大程度上是由于我们观察到,更古老的人口统计学假设会影响对近期增长的估计。这一观察结果和结果也揭示了近期研究中人口统计学估计值的差异。