Peng C K, Buldyrev S V, Goldberger A L, Havlin S, Sciortino F, Simons M, Stanley H E
Center for Polymer Studies, Boston University, Massachusetts 02215.
Nature. 1992 Mar 12;356(6365):168-70. doi: 10.1038/356168a0.
DNA sequences have been analysed using models, such as an n-step Markov chain, that incorporate the possibility of short-range nucleotide correlations. We propose here a method for studying the stochastic properties of nucleotide sequences by constructing a 1:1 map of the nucleotide sequence onto a walk, which we term a 'DNA walk'. We then use the mapping to provide a quantitative measure of the correlation between nucleotides over long distances along the DNA chain. Thus we uncover in the nucleotide sequence a remarkably long-range power law correlation that implies a new scale-invariant property of DNA. We find such long-range correlations in intron-containing genes and in nontranscribed regulatory DNA sequences, but not in complementary DNA sequences or intron-less genes.
DNA序列已使用诸如n步马尔可夫链等模型进行了分析,这些模型考虑了短程核苷酸相关性的可能性。我们在此提出一种方法,通过构建核苷酸序列与一种游走的1:1映射(我们称之为“DNA游走”)来研究核苷酸序列的随机特性。然后,我们利用这种映射来定量测量沿DNA链长距离核苷酸之间的相关性。因此,我们在核苷酸序列中发现了一种显著的长程幂律相关性,这意味着DNA具有一种新的尺度不变特性。我们在含内含子的基因和非转录调控DNA序列中发现了这种长程相关性,但在互补DNA序列或无内含子基因中未发现。