Larhammar D, Chatzidimitriou-Dreismann C A
Department of Medical Genetics, Uppsala University, Sweden.
Nucleic Acids Res. 1993 Nov 11;21(22):5167-70. doi: 10.1093/nar/21.22.5167.
The occurrence of certain long-range correlations between nucleotides in DNA sequences of living organisms has recently been reported. The biological origin of these correlations was unknown. The correlations were proposed to be concerned with fractal structure and differences between intron-containing and intron-less sequences. We and others have reported that no consistent difference exists between intron-containing and intron-less sequences. In agreement with this, we demonstrate here that the long-range correlations are trivially equivalent to the varying ratio R between pyrimidines and purines (or any other nucleotide combinations) in different regions of a DNA sequence. Moreover, we show that this variation of R has simple biological explanations: Differences in base composition occur along most DNA sequences and are associated with (i) simple repeats (ii) differences in codon composition (due to the amino acid composition in the encoded protein), (iii) change of the direction of transcription (and thus also translation), and (iv) differences between protein- and rRNA-encoding segments. Seven biological examples are given.
最近有报道称,生物体DNA序列中的核苷酸之间存在某些长程相关性。这些相关性的生物学起源尚不清楚。有人提出这些相关性与分形结构以及含内含子序列和无内含子序列之间的差异有关。我们和其他人已经报道,含内含子序列和无内含子序列之间不存在一致的差异。与此一致的是,我们在此证明,长程相关性与DNA序列不同区域中嘧啶与嘌呤(或任何其他核苷酸组合)之间变化的比率R完全等效。此外,我们表明R的这种变化具有简单的生物学解释:沿着大多数DNA序列会出现碱基组成的差异,并且与(i)简单重复、(ii)密码子组成的差异(由于编码蛋白质中的氨基酸组成)、(iii)转录方向的变化(进而也是翻译方向的变化)以及(iv)蛋白质编码片段和rRNA编码片段之间差异有关。给出了七个生物学实例。