Braddock Demetrios T, Baber James L, Levens David, Clore G Marius
Laboratory of Chemical Physics, Building 5, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892-0510, USA.
EMBO J. 2002 Jul 1;21(13):3476-85. doi: 10.1093/emboj/cdf352.
To elucidate the basis of sequence-specific single-stranded (ss) DNA recognition by K homology (KH) domains, we have solved the solution structure of a complex between the KH3 domain of the transcriptional regulator heterogeneous nuclear ribonucleoprotein K (hnRNP K) and a 10mer ssDNA. We show that hnRNP K KH3 specifically recognizes a tetrad of sequence 5'd-TCCC. The complex is stabilized by a dense network of methyl-oxygen hydrogen bonds involving the methyl groups of three isoleucine residues and the O2 and N3 atoms of the two central cytosine bases. Comparison with the recently solved structure of a specific protein-ssDNA complex involving the KH3 and KH4 domains of the far upstream element (FUSE) binding protein FBP suggests that the amino acid located five residues N-terminal of the invariant GXXG motif, which is characteristic of all KH domains, plays a crucial role in discrimination of the first two bases of the tetrad.
为阐明K-同源(KH)结构域对序列特异性单链(ss)DNA的识别基础,我们解析了转录调节因子不均一核核糖核蛋白K(hnRNP K)的KH3结构域与一条10聚体ssDNA形成的复合物的溶液结构。我们发现hnRNP K KH3特异性识别序列5'd-TCCC的四联体。该复合物通过一个密集的甲基-氧氢键网络得以稳定,该网络涉及三个异亮氨酸残基的甲基以及两个中央胞嘧啶碱基的O2和N3原子。与最近解析的涉及远上游元件(FUSE)结合蛋白FBP的KH3和KH4结构域的特异性蛋白质-ssDNA复合物的结构相比,位于所有KH结构域所特有的保守GXXG基序N端五个残基处的氨基酸,在识别四联体的前两个碱基中起着关键作用。