Marx K A, Hess S T, Blake R D
Department of Chemistry, University of Massachusetts Lowell 01854.
J Biomol Struct Dyn. 1993 Aug;11(1):57-66. doi: 10.1080/07391102.1993.10508709.
D. discoideum, the slime mold, is one of the most AT rich eukaryotic genomes known. In this paper we examine this organism's database for overlapping N-tuples of high frequency and find A and T tracts possess among the highest frequencies in flanking sequences but not in coding sequences. We examined both overlapping and non-overlapping frequencies of the A, T, G and C homopolymer tracts of 2 < N < 6. Overlapping (dG).(dC) and (dA).(dT) tracts occur at greater frequencies than expected, based on random occurrence. Long (dA).(dT) tracts of N > 10 occur at well above expected frequencies in flanking and intron regions, while (dG).(dC) tracts above N = 5 are rarely found. Some of the implications of these findings for tract origins in slip-strand replication and for chromatin structure are discussed.
盘基网柄菌,一种黏菌,是已知的AT含量最高的真核生物基因组之一。在本文中,我们研究了该生物体数据库中高频重叠N元组,发现A和T序列在侧翼序列中的频率最高,但在编码序列中并非如此。我们研究了2 < N < 6时A、T、G和C同聚物序列的重叠和非重叠频率。基于随机出现的情况,重叠的(dG).(dC)和(dA).(dT)序列出现的频率高于预期。在侧翼和内含子区域,N > 10的长(dA).(dT)序列出现的频率远高于预期,而N = 5以上的(dG).(dC)序列则很少见。本文讨论了这些发现对滑链复制中序列起源和染色质结构的一些影响。