School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371, Singapore.
School of Biological Sciences, Nanyang Technological University, Singapore 637551, Singapore.
Nucleic Acids Res. 2020 Apr 6;48(6):3315-3327. doi: 10.1093/nar/gkaa008.
G-rich DNA sequences with tracts of three or more continuous guanines (G≥3) are known to have high propensity to adopt stable G-quadruplex (G4) structures. Bioinformatic analyses suggest high prevalence of G-rich sequences with short G-tracts (G≤2) in the human genome. However, due to limited structural studies, the folding principles of such sequences remain largely unexplored and hence poorly understood. Here, we present the solution NMR structure of a sequence named AT26 consisting of irregularly spaced G2 tracts and two isolated single guanines. The structure is a four-layered G4 featuring two bi-layered blocks, locked between themselves in an unprecedented fashion making it a stable scaffold. In addition to edgewise and propeller-type loops, AT26 also harbors two V-shaped loops: a 2-nt V-shaped loop spanning two G-tetrad layers and a 0-nt V-shaped loop spanning three G-tetrad layers, which are named as VS- and VR-loop respectively, based on their distinct structural features. The intra-lock motif can be a basis for extending the G-tetrad core and a very stable intra-locked G4 can be formed by a sequence with G-tracts of various lengths including several G2 tracts. Findings from this study will aid in understanding the folding of G4 topologies from sequences containing irregularly spaced multiple short G-tracts.
富含 G 的 DNA 序列具有三个或更多连续鸟嘌呤(G≥3)的片段,已知具有形成稳定 G-四链体(G4)结构的高倾向。生物信息学分析表明,人类基因组中存在大量富含短 G 片段(G≤2)的 G 序列。然而,由于结构研究有限,这些序列的折叠原则在很大程度上仍未得到探索,因此了解甚少。在这里,我们展示了一个名为 AT26 的序列的溶液 NMR 结构,该序列由不规则间隔的 G2 片段和两个孤立的单个鸟嘌呤组成。该结构是一个四层 G4,具有两个双层块,以一种前所未有的方式相互锁定,使其成为一个稳定的支架。除了边缘和推进器型环外,AT26 还含有两个 V 型环:一个跨越两个 G-四联体层的 2-nt V 型环和一个跨越三个 G-四联体层的 0-nt V 型环,分别称为 VS-和 VR-环,基于其独特的结构特征。内锁基序可以作为扩展 G-四联体核心的基础,并且可以通过具有各种长度的 G 片段的序列形成非常稳定的内锁 G4,包括几个 G2 片段。这项研究的结果将有助于理解含有不规则间隔多个短 G 片段的 G4 拓扑结构的折叠。