Ozolin' O N, Deev A A, Masulis I S, Chasov V V, Kostianitsyna E G, Purtov Iu A, Arkhipov I V, Brok-Volchanskiĭ A S
Institute of Cell Biophysics, Russian Academy of Sciences, Pushchino, Moscow Region, 142290 Russia.
Biofizika. 2002 Sep-Oct;47(5):809-19.
A number of additional structural elements were identified by statistic analysis of nucleotide sequences in promoters recognized by Escherichia coli RNA polymerase. Together with canonical hexanucleotides, these elements characterize different levels in the structural organization of promoter DNA. Sequence motifs exhibiting the highest statistical significance, which dominate in the contact regions with RNA polymerase alpha and sigma subunits, are considered as targets for specific interaction with RNA polymerase. A typical feature of these elements is the presence of easily deformable dinucleotides (TG, CA and TA) or tracts containing only A/T base pairs. Thus, we noticed that the frequency of occurrence of TA in the promoter DNA is essentially higher than the average value for the genome. Besides the regions of specific interaction with RNA polymerase, these dinucleotides are often located in the number of other sites periodically distributed along the promoter DNA. This preferred disposition suggests that deformable elements participate in the adaptive conformational transitions of the promoter DNA favoring optimal configuration of the transcription complex. Probably, the most important feature of promoter DNA revealed by statistic analysis is the presence of A/T-tracts regularly distributed in the wide range from -160 up to +75 relative to the transcription start point. Both of these spatially distributed elements (TA dinucleotides and A/T-tracts) are linked with canonical regions and, therefore, may contribute to the conformational or dynamic features of the transcription machinery. Having high statistic significance, these elements might be considered as additional factors discriminating the promoter DNA on the background of other nucleotide sequences in the genome.
通过对大肠杆菌RNA聚合酶识别的启动子中核苷酸序列进行统计分析,鉴定出了许多额外的结构元件。这些元件与典型的六核苷酸一起,表征了启动子DNA结构组织的不同层次。在与RNA聚合酶α亚基和σ亚基的接触区域中占主导地位、具有最高统计显著性的序列基序,被视为与RNA聚合酶特异性相互作用的靶点。这些元件的一个典型特征是存在易于变形的二核苷酸(TG、CA和TA)或仅含A/T碱基对的片段。因此,我们注意到启动子DNA中TA的出现频率明显高于基因组的平均值。除了与RNA聚合酶特异性相互作用的区域外,这些二核苷酸还经常位于沿启动子DNA周期性分布的其他一些位点。这种优先分布表明,可变形元件参与了启动子DNA的适应性构象转变,有利于转录复合物的最佳构象。统计分析揭示的启动子DNA最重要的特征可能是,相对于转录起始点,在从-160到+75的广泛范围内,存在规则分布的A/T片段。这两个空间分布的元件(TA二核苷酸和A/T片段)都与典型区域相连,因此可能有助于转录机制的构象或动态特征。由于具有高统计显著性,这些元件可被视为在基因组中其他核苷酸序列背景下区分启动子DNA的额外因素。