Gravner Janko, Pitman Damien, Gavrilets Sergey
Department of Mathematics, University of California, Davis, CA 95616, USA.
J Theor Biol. 2007 Oct 21;248(4):627-45. doi: 10.1016/j.jtbi.2007.07.009. Epub 2007 Jul 18.
We study how correlations in the random fitness assignment may affect the structure of fitness landscapes, in three classes of fitness models. The first is a phenotype space in which individuals are characterized by a large number n of continuously varying traits. In a simple model of random fitness assignment, viable phenotypes are likely to form a giant connected cluster percolating throughout the phenotype space provided the viability probability is larger than 1/2(n). The second model explicitly describes genotype-to-phenotype and phenotype-to-fitness maps, allows for neutrality at both phenotype and fitness levels, and results in a fitness landscape with tunable correlation length. Here, phenotypic neutrality and correlation between fitnesses can reduce the percolation threshold, and correlations at the point of phase transition between local and global are most conducive to the formation of the giant cluster. In the third class of models, particular combinations of alleles or values of phenotypic characters are "incompatible" in the sense that the resulting genotypes or phenotypes have zero fitness. This setting can be viewed as a generalization of the canonical Bateson-Dobzhansky-Muller model of speciation and is related to K-SAT problems, prominent in computer science. We analyze the conditions for the existence of viable genotypes, their number, as well as the structure and the number of connected clusters of viable genotypes. We show that analysis based on expected values can easily lead to wrong conclusions, especially when fitness correlations are strong. We focus on pairwise incompatibilities between diallelic loci, but we also address multiple alleles, complex incompatibilities, and continuous phenotype spaces. In the case of diallelic loci, the number of clusters is stochastically bounded and each cluster contains a very large sub-cube. Finally, we demonstrate that the discrete NK model shares some signature properties of models with high correlations.
我们研究了随机适应度分配中的相关性如何影响三类适应度模型中的适应度景观结构。第一类是一个表型空间,其中个体由大量(n)个连续变化的性状表征。在一个随机适应度分配的简单模型中,只要生存概率大于(1/2(n)),可行的表型很可能形成一个贯穿表型空间的巨大连通簇。第二个模型明确描述了基因型到表型以及表型到适应度的映射,允许在表型和适应度水平上存在中性,并产生一个具有可调相关长度的适应度景观。在这里,表型中性和适应度之间的相关性可以降低渗流阈值,并且在局部和全局之间的相变点处的相关性最有利于巨大簇的形成。在第三类模型中,等位基因的特定组合或表型特征的值在产生的基因型或表型具有零适应度的意义上是“不相容的”。这种设置可以被视为物种形成的经典贝茨森 - 多布赞斯基 - 穆勒模型的推广,并且与计算机科学中突出的(K - SAT)问题相关。我们分析了可行基因型存在的条件、它们的数量,以及可行基因型的连通簇的结构和数量。我们表明,基于期望值的分析很容易导致错误的结论,特别是当适应度相关性很强时。我们专注于双等位基因位点之间的成对不相容性,但我们也处理多等位基因、复杂不相容性和连续表型空间。在双等位基因位点的情况下,簇的数量是随机有界的,并且每个簇包含一个非常大的子立方体。最后,我们证明离散(NK)模型具有一些高相关性模型的特征属性。