Fontana W, Schuster P
Institut für Theoretische Chemie, Universität Wien, Wien, A-1090, Austria.
J Theor Biol. 1998 Oct 21;194(4):491-515. doi: 10.1006/jtbi.1998.0771.
Understanding which phenotypes are accessible from which genotypes is fundamental for understanding the evolutionary process. This notion of accessibility can be used to define a relation of nearness among phenotypes, independently of their similarity. Because of neutrality, phenotypes denote equivalence classes of genotypes. The definition of neighborhood relations among phenotypes relies, therefore, on the statistics of neighborhood relations among equivalence classes of genotypes in genotype space. The folding of RNA sequence (genotypes) into secondary structures (phenotypes) is an ideal case to implement these concepts. We study the extent to which the folding of RNA sequence induces a "statistical topology" on the set of minimum free energy secondary structures. The resulting nearness relation suggests a notion of "continuous" structure transformation. We can, then rationalize major transitions in evolutionary trajectories at the level of RNA structures by identifying those transformations which are irreducibly discontinuous. This is shown by means of computer simulations. The statistical topology organizing the set of RNA shapes explains why neutral drift in sequence space plays a key role in evolutionary optimization.
理解从哪些基因型可获得哪些表型是理解进化过程的基础。这种可达性的概念可用于定义表型之间的接近关系,而与它们的相似性无关。由于中性,表型表示基因型的等价类。因此,表型之间邻域关系的定义依赖于基因型空间中等价类之间邻域关系的统计。RNA序列(基因型)折叠成二级结构(表型)是实现这些概念的理想情况。我们研究RNA序列折叠在最小自由能二级结构集上诱导“统计拓扑”的程度。由此产生的接近关系暗示了“连续”结构转变的概念。然后,我们可以通过识别那些不可简化的不连续转变,在RNA结构水平上合理化进化轨迹中的主要转变。这通过计算机模拟得到了证明。组织RNA形状集的统计拓扑解释了序列空间中的中性漂移在进化优化中为何起关键作用。