School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland.
School of Basic Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland.
Genetics. 2018 Jan;208(1):323-338. doi: 10.1534/genetics.117.300499. Epub 2017 Nov 10.
Nonequilibrium demography impacts coalescent genealogies leaving detectable, well-studied signatures of variation. However, similar genomic footprints are also expected under models of large reproductive skew, posing a serious problem when trying to make inference. Furthermore, current approaches consider only one of the two processes at a time, neglecting any genomic signal that could arise from their simultaneous effects, preventing the possibility of jointly inferring parameters relating to both offspring distribution and population history. Here, we develop an extended Moran model with exponential population growth, and demonstrate that the underlying ancestral process converges to a time-inhomogeneous psi-coalescent. However, by applying a nonlinear change of time scale-analogous to the Kingman coalescent-we find that the ancestral process can be rescaled to its time-homogeneous analog, allowing the process to be simulated quickly and efficiently. Furthermore, we derive analytical expressions for the expected site-frequency spectrum under the time-inhomogeneous psi-coalescent, and develop an approximate-likelihood framework for the joint estimation of the coalescent and growth parameters. By means of extensive simulation, we demonstrate that both can be estimated accurately from whole-genome data. In addition, not accounting for demography can lead to serious biases in the inferred coalescent model, with broad implications for genomic studies ranging from ecology to conservation biology. Finally, we use our method to analyze sequence data from Japanese sardine populations, and find evidence of high variation in individual reproductive success, but few signs of a recent demographic expansion.
非平衡人口统计学对合并谱系产生影响,留下可检测的、经过充分研究的变异特征。然而,在大生殖偏斜模型下,也预计会出现类似的基因组足迹,这在尝试进行推断时是一个严重的问题。此外,当前的方法一次只考虑两个过程中的一个,忽略了任何可能由于它们的同时影响而产生的基因组信号,从而无法联合推断与后代分布和种群历史有关的参数。在这里,我们开发了一个具有指数种群增长的扩展 Moran 模型,并证明了基础祖先过程收敛到非时齐 psi 合并。然而,通过应用非线性时间尺度变换——类似于 Kingman 合并——我们发现祖先过程可以被重新缩放为其时齐类似物,从而可以快速有效地模拟该过程。此外,我们推导出了非时齐 psi 合并下预期的位点频率谱的解析表达式,并为合并和增长参数的联合估计开发了一个近似似然框架。通过广泛的模拟,我们证明了这两个参数都可以从全基因组数据中准确估计。此外,不考虑人口统计学可能会导致合并模型推断的严重偏差,这对从生态学到保护生物学的基因组研究都有广泛的影响。最后,我们使用我们的方法分析了来自日本沙丁鱼种群的序列数据,发现个体生殖成功存在高度变异的证据,但最近的人口扩张迹象很少。