Lercher Martin J, Hurst Laurence D
Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, Somerset BA2 7AY, UK.
Gene. 2002 Oct 30;300(1-2):53-8. doi: 10.1016/s0378-1119(02)01040-5.
One of the most abiding controversies in evolutionary biology concerns the role of neutral processes in molecular evolution. A main focus of the debate has been the evolution of isochores, the strong and systematic variation of base composition in mammalian genomes. One set of hypotheses argue that regions of similar GC are owing to localised mutational biases coupled with neutral evolution. The alternatives point to either selection or biased gene conversion as mechanisms to preferentially remove A or T bases, favouring G and C instead. Using a novel method, we compare models including such fixation biases to models based on mutation bias alone, under the assumption that non-coding, non-repetitive human DNA is at compositional equilibrium. While failing to fully explain the allele frequency distributions of recent single nucleotide polymorphism data, we show that the data are best fitted if the mutation bias is assumed to be constant across the genome, while fixation bias varies with GC content. We also attempt to estimate the strength of fixation bias, which increases linearly with increasing GC. Our approximation suggests that this force exists within the necessary parameter range: it is not so weak as to be drowned by random drift, but not so strong as to lead to exclusive use of G and C alone. Together these results demonstrate that mutation bias fails to explain the evolution of isochores, and suggest that either selection or biased gene conversion are involved.
进化生物学中最持久的争议之一涉及中性过程在分子进化中的作用。争论的一个主要焦点是等密度区的进化,即哺乳动物基因组中碱基组成的强烈而系统的变化。一组假设认为,相似GC含量的区域是由于局部突变偏差以及中性进化所致。另一种观点则指出,选择或偏向性基因转换是优先去除A或T碱基、转而青睐G和C碱基的机制。我们采用一种新方法,在非编码、非重复的人类DNA处于组成平衡的假设下,将包含这种固定偏差的模型与仅基于突变偏差的模型进行比较。虽然未能完全解释近期单核苷酸多态性数据的等位基因频率分布,但我们表明,如果假设突变偏差在全基因组中是恒定的,而固定偏差随GC含量变化,那么数据能得到最佳拟合。我们还试图估计固定偏差的强度,它随GC含量的增加呈线性增加。我们的近似结果表明,这种作用力存在于必要的参数范围内:它不至于弱到被随机漂变淹没,但也不至于强到导致仅使用G和C碱基。这些结果共同表明,突变偏差无法解释等密度区的进化,并暗示选择或偏向性基因转换参与其中。