Chou S H, Zhu L, Reid B R
Biochemistry Department, University of Washington, Seattle, 98195, USA.
J Mol Biol. 1996 Jun 14;259(3):445-57. doi: 10.1006/jmbi.1996.0331.
While tandem repeats of the human centromere DNA pentamer sequence TGRAA form stable "self-complementary" [TGRAATGRAA]2 duplexes (R = G or A) containing the GA-bracketed unpaired purine stack motif, their phase-shifted variants NAATGNAATG (N = A, G, C, T) were found to exist in solution as an equilibrium mixture of a duplex containing the GA-bracketed unpaired stack motif and a hairpin containing a single-residue loop closed by a sheared G x A pair. The stability of the hairpin form relative to duplex form of GNA triplets was found to be GCA>GAA/GTA>>GGA, with the CAATGCAATG sequence mostly in the hairpin form and the GAATGGAATG sequence mostly in the [GAATGGAATG]2 duplex form. The chemical shifts of the H1' and H4' protons of the central N residue in GNA triplets were found to differ markedly in the duplex and hairpin forms and are diagnostic indicators of which conformation the oligonucleotide adopts. Comparison between the structures of the G x A-closed C loop motif and the G x A-bracketed unpaired G-stack [GGA]2 motif reveals remarkably similar stacking by the loop C residue and the intercalated G residue on the adjacent sheared G x A pair. The anomalous upfield chemical shifts of the H1' and H4' protons in [GGA]2 motifs and the H4' proton in GCA loops, and the different sugar conformations in these two motifs, can be explained by interstrand versus intrastrand stacking of the central (G or C) deoxyribose with the adenine base. Based on these studies, a DNA sequence GTGGAATGGAATGGAAC was designed and shown to form a duplex containing three [GGA]2 motifs, while its 9G-->9C analog GTGGAATGCAATGGAAC was found to adopt a stable hairpin containing a (GGA)2 motif in the stem and a G x A-closed single C-loop.
虽然人类着丝粒DNA五聚体序列TGRAA的串联重复形成了稳定的“自我互补”[TGRAATGRAA]2双链体(R = G或A),其中包含GA括起来的未配对嘌呤堆积基序,但发现它们的相移变体NAATGNAATG(N = A、G、C、T)在溶液中以一种平衡混合物的形式存在,该混合物包含一个含有GA括起来的未配对堆积基序的双链体和一个含有由剪切的G×A对封闭的单残基环的发夹结构。相对于GNA三联体的双链体形式,发夹形式的稳定性为GCA>GAA/GTA>>GGA,其中CAATGCAATG序列大多处于发夹形式,而GAATGGAATG序列大多处于[GAATGGAATG]2双链体形式。发现GNA三联体中中心N残基的H1'和H4'质子的化学位移在双链体和发夹形式中有明显差异,并且是寡核苷酸采用哪种构象的诊断指标。对G×A封闭的C环基序和G×A括起来的未配对G堆积[GGA]2基序的结构进行比较,发现环C残基和插入的G残基在相邻剪切的G×A对上的堆积非常相似。[GGA]2基序中H1'和H4'质子以及GCA环中H4'质子的异常高场化学位移,以及这两个基序中不同的糖构象,可以通过中心(G或C)脱氧核糖与腺嘌呤碱基的链间与链内堆积来解释。基于这些研究,设计了一个DNA序列GTGGAATGGAATGGAAC,并显示其形成了一个包含三个[GGA]2基序的双链体,而其9G→9C类似物GTGGAATGCAATGGAAC被发现采用了一种稳定的发夹结构,其茎中含有一个(GGA)2基序和一个G×A封闭的单C环。