Bernaola-Galván P, Carpena P, Román-Roldán R, Oliver J L
Departamento de Física Aplicada II, E.T.S.I. de Telecomunicación, Universidad de Málaga, Málaga, Spain.
Gene. 2002 Oct 30;300(1-2):105-15. doi: 10.1016/s0378-1119(02)01037-5.
Here we present a study of statistical correlations among different positions in DNA sequences and their implications by directly using the autocorrelation function. Such an analysis is possible now because of the availability of large sequences or even complete genomes of many organisms. After describing the way in which the autocorrelation function can be applied to DNA-sequence analysis, we show that long-range correlations, implying scale independence, appear in several bacterial genomes as well as in long human chromosome contigs. The source for such correlations in bacteria, which may extend up to 60 kb in Bacillus subtilis, may be related to massive lateral transfer of compositionally biased genes from other genomes. In the human genome, correlations extend for more than five decades and may be related to the evolution of the 'neogenome', a modern evolutionary acquisition composed by GC-rich isochores displaying long-range correlations and scale invariance.
在此,我们通过直接使用自相关函数,对DNA序列中不同位置之间的统计相关性及其影响进行了一项研究。由于现在有许多生物体的大量序列甚至完整基因组可供使用,这样的分析成为可能。在描述了自相关函数可应用于DNA序列分析的方式之后,我们表明,暗示尺度独立性的长程相关性出现在几个细菌基因组以及人类长染色体重叠群中。细菌中这种相关性的来源,在枯草芽孢杆菌中可能延伸至60 kb,可能与来自其他基因组的成分偏向基因的大量横向转移有关。在人类基因组中,相关性延伸超过五个数量级,可能与“新基因组”的进化有关,“新基因组”是一种现代进化产物,由显示长程相关性和尺度不变性的富含GC的等臂组成。