Elhaik Eran, Landan Giddy, Graur Dan
Department of Biology and Biochemistry, University of Houston, TX, USA.
Mol Biol Evol. 2009 Aug;26(8):1829-33. doi: 10.1093/molbev/msp100. Epub 2009 May 14.
The isochore theory depicts the genomes of warm-blooded vertebrates as a mosaic of long genomic regions that are characterized by relatively homogeneous GC content. In the absence of genomic data, the GC content at third-codon positions of protein-coding genes (GC3) was commonly used as a proxy for the GC content of isochores. Oddly, in the postgenomic era, GC3 is still sometimes used as a proxy for the GC composition of isochores. Here, we use genic and genomic sequences from human, chimpanzee, cow, mouse, rat, chicken, and zebrafish to show that GC3 only explains a very small proportion of the variation in GC content of long genomic sequences flanking the genes (GCf), and what little correlation there is between GC3 and GCf was found to decay rapidly with distance from the gene. The coefficient of variation of GC3 was found to be much larger than that of GCf and, therefore, GC3 and GCf values are not comparable with each other. Comparisons of orthologous gene pairs from 1) human and chimpanzee and 2) mouse and rat show strong correlations between their GC3 values, but very weak correlations between their GCf values. We conclude that the GC content of third-codon position cannot be used as stand-in for isochoric composition.
等容线理论将温血脊椎动物的基因组描绘为由具有相对均一GC含量的长基因组区域组成的镶嵌体。在缺乏基因组数据的情况下,蛋白质编码基因第三密码子位置的GC含量(GC3)通常被用作等容线GC含量的替代指标。奇怪的是,在后基因组时代,GC3有时仍被用作等容线GC组成的替代指标。在这里,我们使用来自人类、黑猩猩、牛、小鼠、大鼠、鸡和斑马鱼的基因和基因组序列表明,GC3仅解释了基因侧翼长基因组序列(GCf)GC含量变异的很小一部分,并且发现GC3与GCf之间的微弱相关性会随着与基因距离的增加而迅速衰减。发现GC3的变异系数远大于GCf的变异系数,因此,GC3和GCf值彼此不可比。对来自1)人类和黑猩猩以及2)小鼠和大鼠的直系同源基因对的比较表明,它们的GC3值之间存在强相关性,但它们的GCf值之间的相关性非常弱。我们得出结论,第三密码子位置的GC含量不能用作等容线组成的替代指标。