Zhao Lei, Gossmann Toni I, Waxman David
Centre for Computational Systems Biology, Fudan University, 220 Handan Road, Shanghai 200433, People׳s Republic of China.
Department of Animal and Plant Sciences, University of Sheffield, Sheffield S10 2TN, United Kingdom.
J Theor Biol. 2016 Mar 21;393:218-28. doi: 10.1016/j.jtbi.2016.01.002. Epub 2016 Jan 12.
The Wright-Fisher model is an important model in evolutionary biology and population genetics. It has been applied in numerous analyses of finite populations with discrete generations. It is recognised that real populations can behave, in some key aspects, as though their size that is not the census size, N, but rather a smaller size, namely the effective population size, Ne. However, in the Wright-Fisher model, there is no distinction between the effective and census population sizes. Equivalently, we can say that in this model, Ne coincides with N. The Wright-Fisher model therefore lacks an important aspect of biological realism. Here, we present a method that allows Ne to be directly incorporated into the Wright-Fisher model. The modified model involves matrices whose size is determined by Ne. Thus apart from increased biological realism, the modified model also has reduced computational complexity, particularly so when Ne⪡N. For complex problems, it may be hard or impossible to numerically analyse the most commonly-used approximation of the Wright-Fisher model that incorporates Ne, namely the diffusion approximation. An alternative approach is simulation. However, the simulations need to be sufficiently detailed that they yield an effective size that is different to the census size. Simulations may also be time consuming and have attendant statistical errors. The method presented in this work may then be the only alternative to simulations, when Ne differs from N. We illustrate the straightforward application of the method to some problems involving allele fixation and the determination of the equilibrium site frequency spectrum. We then apply the method to the problem of fixation when three alleles are segregating in a population. This latter problem is significantly more complex than a two allele problem and since the diffusion equation cannot be numerically solved, the only other way Ne can be incorporated into the analysis is by simulation. We have achieved good accuracy in all cases considered. In summary, the present work extends the realism and tractability of an important model of evolutionary biology and population genetics.
赖特 - 费希尔模型是进化生物学和群体遗传学中的一个重要模型。它已被应用于对具有离散世代的有限群体的众多分析中。人们认识到,实际群体在某些关键方面的表现,就好像其大小不是普查规模(N),而是一个更小的规模,即有效群体大小(Ne)。然而,在赖特 - 费希尔模型中,有效群体大小和普查群体大小之间没有区别。等效地,我们可以说在这个模型中,(Ne)与(N)一致。因此,赖特 - 费希尔模型缺乏生物学现实性的一个重要方面。在这里,我们提出一种方法,使(Ne)能够直接纳入赖特 - 费希尔模型。修改后的模型涉及大小由(Ne)确定的矩阵。因此,除了增加生物学现实性外,修改后的模型还降低了计算复杂度,当(Ne\ll N)时尤其如此。对于复杂问题,可能很难或无法对纳入(Ne)的赖特 - 费希尔模型最常用的近似,即扩散近似进行数值分析。一种替代方法是模拟。然而,模拟需要足够详细,以产生与普查规模不同的有效大小。模拟也可能很耗时且存在伴随的统计误差。当(Ne)与(N)不同时,本文提出的方法可能是模拟的唯一替代方法。我们说明了该方法在一些涉及等位基因固定和平衡位点频率谱确定问题上的直接应用。然后我们将该方法应用于群体中三个等位基因分离时的固定问题。后一个问题比两个等位基因的问题复杂得多,并且由于扩散方程无法进行数值求解,将(Ne)纳入分析的唯一其他方法是通过模拟。在所有考虑的情况下,我们都取得了良好的精度。总之,本工作扩展了进化生物学和群体遗传学一个重要模型的现实性和可处理性。