Du Zhuo, Kong Ping, Gao Yu, Li Ning
State Key Laboratory for Agrobiotechnology, China Agricultural University, Beijing, China.
Biochem Biophys Res Commun. 2007 Mar 23;354(4):1067-70. doi: 10.1016/j.bbrc.2007.01.093. Epub 2007 Jan 25.
G-quadruplex or G4 DNA is a stable, four-stranded DNA structure formed from guanine-rich regions. Based on the hypothesis that G4 DNA participated in the regulation of transcription, we analyzed G4 DNA in 5kb 5' flanking regions of 2892 chicken RefSeq genes with annotated transcription start sites (TSS). In total, 4769 distinct putative G4 DNA motifs (G4M) were identified in 1880 (65%) genes. The pattern of distribution of the G4M showed a gradient along the 5' flanking regions; from -5 to -4kb, to -1kb to the TSS, the frequency (number of G4M per kilobase) increased significantly from 0.192 to 0.768, and 62.56% of the G4M in the 1kb upstream regions were located in the region -400 to the TSS, where a core promoter is always present. Thus, 38.24% of the analyzed genes contained at least one G4M in the 400bp upstream region. Our findings support the hypothesis that G4M are involved in gene transcription.
G-四链体或G4 DNA是一种由富含鸟嘌呤的区域形成的稳定的四链DNA结构。基于G4 DNA参与转录调控的假设,我们分析了2892个具有注释转录起始位点(TSS)的鸡RefSeq基因5kb 5'侧翼区域中的G4 DNA。总共在1880个(65%)基因中鉴定出4769个不同的假定G4 DNA基序(G4M)。G4M的分布模式沿5'侧翼区域呈现出梯度变化;从-5至-4kb,到-1kb至TSS,频率(每千碱基的G4M数量)从0.192显著增加至0.768,并且1kb上游区域中62.56%的G4M位于-400至TSS区域,该区域总是存在核心启动子。因此,38.24%的分析基因在400bp上游区域包含至少一个G4M。我们的发现支持G4M参与基因转录的假设。