关于统计物理学在进化生物学中的应用。

On the application of statistical physics to evolutionary biology.

作者信息

Barton N H, Coe J B

机构信息

Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Kings Buildings, Edinburgh EH9 3JT, UK.

出版信息

J Theor Biol. 2009 Jul 21;259(2):317-24. doi: 10.1016/j.jtbi.2009.03.019. Epub 2009 Apr 5.

DOI:10.1016/j.jtbi.2009.03.019

PMID:19348811

Abstract

There is a close analogy between statistical thermodynamics and the evolution of allele frequencies under mutation, selection and random drift. Wright's formula for the stationary distribution of allele frequencies is analogous to the Boltzmann distribution in statistical physics. Population size, 2N, plays the role of the inverse temperature, 1/kT, and determines the magnitude of random fluctuations. Log mean fitness, log(W), tends to increase under selection, and is analogous to a (negative) energy; a potential function, U, increases under mutation in a similar way. An entropy, S(H), can be defined which measures the deviation from the distribution of allele frequencies expected under random drift alone; the sum G=E[log(W)+U+S(H)] gives a free fitness that increases as the population evolves towards its stationary distribution. Usually, we observe the distribution of a few quantitative traits that depend on the frequencies of very many alleles. The mean and variance of such traits are analogous to observable quantities in statistical thermodynamics. Thus, we can define an entropy, S(Omega), which measures the volume of allele frequency space that is consistent with the observed trait distribution. The stationary distribution of the traits is exp[2N(log(W)+U+S(Omega))]; this applies with arbitrary epistasis and dominance. The entropies S(Omega), S(H) are distinct, but converge when there are so many alleles that traits fluctuate close to their expectations. Populations tend to evolve towards states that can be realised in many ways (i.e., large S(Omega)), which may lead to a substantial drop below the adaptive peak; we illustrate this point with a simple model of genetic redundancy. This analogy with statistical thermodynamics brings together previous ideas in a general framework, and justifies a maximum entropy approximation to the dynamics of quantitative traits.

摘要

统计热力学与在突变、选择和随机漂变作用下等位基因频率的演化之间存在紧密的类比关系。赖特关于等位基因频率平稳分布的公式类似于统计物理学中的玻尔兹曼分布。种群大小(2N)起到逆温度(1/kT)的作用，并决定随机波动的幅度。对数平均适应度(\log(W))在选择作用下往往会增加，类似于（负的）能量；一个势函数(U)在突变作用下以类似的方式增加。可以定义一个熵(S(H))，它衡量与仅在随机漂变下预期的等位基因频率分布的偏差；总和(G = E[\log(W) + U + S(H)])给出一个自由适应度，随着种群向其平稳分布演化而增加。通常，我们观察到一些数量性状的分布，这些性状取决于非常多等位基因的频率。此类性状的均值和方差类似于统计热力学中的可观测数量。因此，我们可以定义一个熵(S(\Omega))，它衡量与观察到的性状分布一致的等位基因频率空间的体积。性状的平稳分布为(\exp[2N(\log(W) + U + S(\Omega))])；这适用于任意上位性和显性情况。熵(S(\Omega))、(S(H))是不同的，但当等位基因数量足够多以至于性状波动接近其预期值时会收敛。种群倾向于朝着可以通过多种方式实现的状态（即大的(S(\Omega))）演化，这可能导致在适应峰以下大幅下降；我们用一个简单的基因冗余模型来说明这一点。这种与统计热力学的类比将先前的观点整合到一个通用框架中，并为数量性状动力学的最大熵近似提供了依据。