Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilova, 32, Moscow 119991, Russia.
Department of Information and Internet Technologies of the Institute of Digital Medicine, Sechenov University, 8-2 Trubetskaya str., Moscow 119991, Russia.
Int J Mol Sci. 2023 May 3;24(9):8199. doi: 10.3390/ijms24098199.
RNA polymerase II (POL II) is responsible for the transcription of messenger RNAs (mRNAs) and long non-coding RNAs (lncRNAs). Previously, we have shown the evolutionary invariance of the structural features of DNA in the POL II core promoters of the precursors of mRNAs. In this work, we have analyzed the POL II core promoters of the precursors of lncRNAs in and genomes. Structural analysis of nucleotide sequences in positions -50, +30 bp in relation to the TSS have shown the extremely heterogeneous 3D structure that includes two singular regions - hexanucleotide "INR" around the TSS and octanucleotide "TATA-box" at around ~-28 bp upstream. Thus, the 3D structure of core promoters of lncRNA resembles the architecture of the core promoters of mRNAs; however, textual analysis revealed differences between promoters of lncRNAs and promoters of mRNAs, which lies in their textual characteristics; namely, the informational entropy at each position of the nucleotide text of lncRNA core promoters (by the exception of singular regions) is significantly higher than that of the mRNA core promoters. Another distinguishing feature of lncRNA is the extremely rare occurrence in the TATA box of octanucleotides with the consensus sequence. These textual differences can significantly affect the efficiency of the transcription of lncRNAs.
RNA 聚合酶 II(POL II)负责信使 RNA(mRNA)和长非编码 RNA(lncRNA)的转录。以前,我们已经展示了 mRNA 前体的 POL II 核心启动子中 DNA 的结构特征在进化上是不变的。在这项工作中,我们分析了 和 基因组中 lncRNA 前体的 POL II 核心启动子。对 TSS 上下游-50bp 和+30bp 位置的核苷酸序列的结构分析表明,存在极其异质的 3D 结构,包括 TSS 周围的六核苷酸“INR”和大约-28bp 上游的八核苷酸“TATA 盒”两个独特区域。因此,lncRNA 核心启动子的 3D 结构类似于 mRNA 核心启动子的结构;然而,文本分析揭示了 lncRNA 启动子和 mRNA 启动子之间的差异,这在于它们的文本特征;即,lncRNA 核心启动子的核苷酸文本的每个位置的信息熵(除了奇异区域)明显高于 mRNA 核心启动子的信息熵。lncRNA 的另一个区别特征是八核苷酸 TATA 盒中具有一致序列的 octanucleotides 的出现极其罕见。这些文本差异可能会显著影响 lncRNA 的转录效率。