Du Zhuo, Zhao Yiqiang, Li Ning
State Key Laboratory for Agrobiotechnology, China Agricultural University, Beijing, 10094, People's Republic of China.
Genome Res. 2008 Feb;18(2):233-41. doi: 10.1101/gr.6905408. Epub 2007 Dec 20.
G-quadruplex or G4 DNA, a four-stranded DNA structure formed in G-rich sequences, has been hypothesized to be a structural motif involved in gene regulation. In this study, we examined the regulatory role of potential G4 DNA motifs (PG4Ms) located in the putative transcriptional regulatory region (TRR, -500 to +500) of genes across the human genome. We found that PG4Ms in the 500-bp region downstream of the annotated transcription start site (TSS; PG4M(D500)) are associated with gene expression. Generally, PG4M(D500)-positive genes are expressed at higher levels than PG4M(D500)-negative genes, and an increased number of PG4M(D500) provides a cumulative effect. This observation was validated by controlling for attributes, including gene family, function, and promoter similarity. We also observed an asymmetric pattern of PG4M(D500) distribution between strands, whereby the frequency of PG4M(D500) in the coding strand is generally higher than that in the template strand. Further analysis showed that the presence of PG4M(D500) and its strand asymmetry are associated with significant enrichment of RNAP II at the putative TRR. On the basis of these results, we propose a model of G4 DNA-mediated stimulation of transcription with the hypothesis that PG4M(D500) contributes to gene transcription by maintaining the DNA in an open conformation, while the asymmetric distribution of PG4M(D500) considerably reduces the probability of blocking the progression of the RNA polymerase complex on the template strand. Our findings provide a comprehensive view of the regulatory function of G4 DNA in gene transcription.
G-四链体或G4 DNA是在富含鸟嘌呤(G)的序列中形成的一种四链DNA结构,据推测它是一种参与基因调控的结构基序。在本研究中,我们检测了位于人类基因组中各基因假定转录调控区域(TRR,-500至+500)的潜在G4 DNA基序(PG4M)的调控作用。我们发现,注释转录起始位点(TSS)下游500 bp区域内的PG4M(PG4M(D500))与基因表达相关。一般来说,PG4M(D500)阳性基因的表达水平高于PG4M(D500)阴性基因,并且PG4M(D500)数量的增加具有累积效应。通过控制基因家族、功能和启动子相似性等属性,这一观察结果得到了验证。我们还观察到PG4M(D500)在两条链之间的分布存在不对称模式,即编码链中PG4M(D500)的频率通常高于模板链。进一步分析表明,PG4M(D500)的存在及其链不对称性与假定TRR处RNA聚合酶II的显著富集相关。基于这些结果,我们提出了一个G4 DNA介导的转录刺激模型,假设PG4M(D500)通过使DNA保持开放构象来促进基因转录,而PG4M(D500)的不对称分布大大降低了模板链上RNA聚合酶复合物前进受阻的概率。我们的研究结果全面展示了G4 DNA在基因转录中的调控功能。