Forsdyke D R
Department of Biochemistry, Queen's University, Kingston, Ontario, Canada.
J Mol Evol. 1995 Nov;41(5):573-81. doi: 10.1007/BF00175815.
To an approximation Chargaff's rule (%A = %T; %G = %C) applies to single-stranded DNA. In long sequences, not only complementary bases but also complementary oligonucleotides are present in approximately equal frequencies. This applies to all species studied. However, species usually differ in base composition. With the goal of understanding the evolutionary forces involved, I have compared the frequencies of trinucleotides in long sequences and their shuffled counterparts. Among the 32 complementary trinucleotide pairs there is a hierarchy of frequencies which is influenced both by base composition (not affected by shuffling the order of the bases) and by base order (affected by shuffling). The influence of base order is greatest in DNA of 50% G + C and seems to reflect a more fundamental hierarchy of dinucleotide frequencies. Thus if TpA is at low frequency, all eight TpA-containing trinucleotides are at low frequency. Mammals and their viruses share similar hierarchies, with intra- and intergenomic differences being mainly associated with differences in base composition (percentage G + C). E. coli and, to a lesser extent, Drosophila melanogaster hierarchies differ from mammalian hierarchies; this is associated with differences both in base composition and in base order. It is proposed that Chargaff's rule applies to single-stranded DNA because there has been an evolutionary selection pressure favoring mutations that generate complementary oligonucleotides in close proximity, thus creating a potential to form stem-loops. These are dispersed throughout genomes and are rate-limiting in recombination. Differences in (G + C)% between species would impair interspecies recombination by interfering with stem-loop interactions.
近似地说,查加夫法则(%A = %T;%G = %C)适用于单链DNA。在长序列中,不仅互补碱基,而且互补寡核苷酸的出现频率也大致相等。这适用于所有已研究的物种。然而,不同物种的碱基组成通常存在差异。为了理解其中涉及的进化力量,我比较了长序列及其重排对应序列中三核苷酸的频率。在32对互补三核苷酸对中,存在一个频率层次结构,它既受碱基组成(不受碱基顺序重排的影响)影响,也受碱基顺序(受重排影响)影响。碱基顺序的影响在G + C含量为50%的DNA中最大,似乎反映了二核苷酸频率更基本的层次结构。因此,如果TpA频率较低,所有含TpA的八个三核苷酸频率也都较低。哺乳动物及其病毒具有相似的层次结构,基因组内和基因组间的差异主要与碱基组成(G + C百分比)的差异有关。大肠杆菌以及在较小程度上黑腹果蝇的层次结构与哺乳动物的层次结构不同;这与碱基组成和碱基顺序的差异都有关。有人提出查加夫法则适用于单链DNA,是因为存在一种进化选择压力,有利于产生紧密相邻的互补寡核苷酸的突变,从而创造形成茎环的可能性。这些茎环分散在整个基因组中,是重组的限速因素。物种间(G + C)%的差异会通过干扰茎环相互作用而损害种间重组。