Wen Jin-Der, Gray Donald M
Department of Molecular and Cell Biology, The University of Texas at Dallas, Box 830688, Richardson, Texas 75083-0688, USA.
Biochemistry. 2004 Mar 9;43(9):2622-34. doi: 10.1021/bi030177g.
The gene 5 protein (g5p) encoded by filamentous Ff phages is an ssDNA-binding protein, which binds to and sequesters the nascent ssDNA phage genome in the process of phage morphogenesis. The g5p also binds with high affinity to DNA and RNA sequences that form G-quadruplex structures. However, sequences that would form G-quadruplexes are absent in single copies of the phage genome. Using SELEX (systematic evolution of ligands by exponential enrichment), we have now identified a family of DNA hairpin structures to which g5p binds with high affinity. After eight rounds of selection from a library of 58-mers, 26 of 35 sequences of this family contained two regions of complete or partial complementarity. This family of DNA hairpins is represented by the sequence: 5'-d(CGGGATCCAACGTTTTCACCAGATCTACCTCCTCGGGATCCCAAGAGGCAGAATTCGC)-3' (named U-4), where complementary regions are italicized or underlined. Diethyl pyrocarbonate modification, UV-melting profiles, and BamH I digestion experiments revealed that the italicized sequences form an intramolecular hairpin, and the underlined sequences form intermolecular base pairs so that a dimer exists at higher oligomer concentrations. Gel shift assays and end boundary experiments demonstrated that g5p assembles on the hairpin of U-4 to give a discrete, intermediate complex prior to saturation of the oligomer at high g5p concentrations. Thus, biologically relevant sequences at which g5p initiates assembly might be typified better by DNA hairpins than by G-quadruplexes. Moreover, the finding that hairpins of U-4 can dimerize emphasizes the unexpected nature of sequence-dependent structures that can be recognized by the g5p ssDNA-binding protein.
丝状Ff噬菌体编码的基因5蛋白(g5p)是一种单链DNA结合蛋白,在噬菌体形态发生过程中,它与新生的单链DNA噬菌体基因组结合并使其隔离。g5p还以高亲和力与形成G-四链体结构的DNA和RNA序列结合。然而,在噬菌体基因组的单拷贝中不存在会形成G-四链体的序列。利用指数富集配体系统进化技术(SELEX),我们现已鉴定出一个g5p以高亲和力结合的DNA发夹结构家族。从一个58聚体文库中经过八轮筛选后,该家族35个序列中的26个包含两个完全或部分互补区域。这个DNA发夹家族由以下序列代表:5'-d(CGGGATCCAACGTTTTCACCAGATCTACCTCCTCGGGATCCCAAGAGGCAGAATTCGC)-3'(命名为U-4),其中互补区域用斜体或下划线表示。焦碳酸二乙酯修饰、紫外熔解曲线和BamH I消化实验表明,斜体序列形成分子内发夹,下划线序列形成分子间碱基对,因此在较高的寡聚物浓度下存在二聚体。凝胶迁移实验和末端边界实验表明,在高g5p浓度下寡聚物饱和之前,g5p在U-4的发夹上组装形成一个离散的中间复合物。因此,g5p启动组装的生物学相关序列可能用DNA发夹比用G-四链体能更好地代表。此外,U-4发夹能二聚化这一发现强调了g5p单链DNA结合蛋白可识别的序列依赖性结构的意外性质。