Panova Vera V, Dolinnaya Nina G, Novoselov Kirill A, Savitskaya Viktoriia Yu, Chernykh Ivan S, Kubareva Elena A, Alexeevski Andrei V, Zvereva Maria I
Department of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskie Gory 1, Moscow 119234, Russia.
Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory 1, Moscow 119991, Russia.
Life (Basel). 2023 Jun 29;13(7):1478. doi: 10.3390/life13071478.
Somatic mutations in the promoter region of the human telomerase reverse transcriptase () gene have been identified in many types of cancer. The promoter is known to be enriched with sequences that enable the formation of G-quadruplex (G4) structures, whose presence is associated with elevated mutagenicity and genome instability. Here, we used a bioinformatics tool (QGRS mapper) to search for G4-forming sequences (G4 motifs) in the 1000 bp promoter regions of 141 mammalian species belonging to 20 orders, 5 of which, including primates and predators, contain more than 10 species. Groups of conserved G4 motifs and single-nucleotide variants within these groups were discovered using a block alignment approach (based on the Nucleotide PanGenome explorer). It has been shown that: (i) G4 motifs are predominantly located in the region proximal to the transcription start site (up to 400 bp) and are over-represented on the non-coding strand of the promoters, (ii) 11 to 22% of the G4 motifs found are evolutionarily conserved across the related organisms, and (iii) a statistically significant higher frequency of nucleotide substitutions in the conserved G4 motifs compared to the surrounding regions was confirmed only for the order . These data support the assumption that G4s can interfere with the DNA repair process and affect the evolutionary adaptation of organisms and species.
在多种癌症中已鉴定出人类端粒酶逆转录酶(hTERT)基因启动子区域的体细胞突变。已知hTERT启动子富含能够形成G-四链体(G4)结构的序列,其存在与致突变性升高和基因组不稳定有关。在这里,我们使用一种生物信息学工具(QGRS mapper)在属于20个目、其中5个目(包括灵长目和食肉目,每个目包含超过10个物种)的141种哺乳动物的1000 bp hTERT启动子区域中搜索G4形成序列(G4基序)。使用一种基于核苷酸全基因组浏览器的比对方法发现了保守的G4基序组以及这些组内的单核苷酸变体。结果表明:(i)G4基序主要位于转录起始位点近端区域(达400 bp),并且在hTERT启动子的非编码链上过度富集;(ii)所发现的G4基序中有11%至22%在相关生物体中是进化保守的;(iii)仅在灵长目动物中证实,与周围区域相比,保守G4基序中的核苷酸替换频率在统计学上显著更高。这些数据支持了G4可能干扰DNA修复过程并影响生物体和物种进化适应的假设。