Conde J
Mol Genet Genomics. 2003 Dec;270(4):287-95. doi: 10.1007/s00438-003-0871-x. Epub 2003 Nov 5.
Single stranded chains of biological DNA show a widespread occurrence of parity for complementary nucleotides, i.e., A=T, G=C. This has been referred to as A-T, G-C symmetry. A distinction must be made between this, which this paper calls mirror symmetry, and twofold symmetry, where complementary nucleotide parity occurs between two segments, of the same length and equidistant from a symmetry center, along a single-stranded DNA chain. I have analysed the sequence of Chromosome I of Saccharomyces cerevisiae for the occurrence of complementary nucleotide symmetry. Open reading frame (ORF) sequences made up 63% of the total chromosome length and most of them were asymmetric for both A-T and G-C. The sign of A-T asymmetry was correlated with transcriptional orientation (A>T for sense and A<T for antisense ORFs), whereas G-C asymmetry was not. However, long single-stranded segments of Chromosome I were A-T mirror symmetric because they contained similar frequencies of ORFs in both transcriptional orientations. The same results were obtained with the AA-TT pair of complementary dinucleotides. Profiling of AA-TT symmetry along Chromosome I showed this chromosome to be organized as a succession of five domains that were twofold symmetric for AA-TT, placed between two subtelomeric regions without clear symmetry properties. This pattern was destroyed when ORF sequences were randomly repositioned along the chromosome. Based on the above findings, an architectural model is proposed for Chromosome I, in which the twofold symmetric domains, from 30 to 50 kb long, correspond to chromosome loops.
生物DNA的单链显示互补核苷酸存在广泛的奇偶性,即A=T,G=C。这被称为A-T、G-C对称性。必须区分本文所称的镜像对称性和双重对称性,双重对称性是指在单链DNA链上,两个长度相同且距对称中心等距的片段之间存在互补核苷酸奇偶性。我分析了酿酒酵母1号染色体序列中互补核苷酸对称性的情况。开放阅读框(ORF)序列占染色体总长度的63%,其中大多数在A-T和G-C方面都是不对称的。A-T不对称的符号与转录方向相关(有义ORF为A>T,反义ORF为A<T),而G-C不对称则不然。然而,1号染色体的长单链片段具有A-T镜像对称性,因为它们在两个转录方向上包含相似频率的ORF。对于互补二核苷酸AA-TT对也得到了相同的结果。沿1号染色体对AA-TT对称性进行分析表明,该染色体由五个连续的结构域组成,这些结构域对于AA-TT具有双重对称性,位于两个没有明显对称特性的亚端粒区域之间。当ORF序列沿染色体随机重新定位时,这种模式被破坏。基于上述发现,提出了一个1号染色体的结构模型,其中长度为30至50 kb的双重对称结构域对应于染色体环。