Pawlak Sebastian D, Radlinska Monika, Chmiel Agnieszka A, Bujnicki Janusz M, Skowronek Krzysztof J
Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. ks. Trojdena 4, 02-109 Warsaw, Poland.
Nucleic Acids Res. 2005 Jan 31;33(2):661-71. doi: 10.1093/nar/gki213. Print 2005.
Thus far, identification of functionally important residues in Type II restriction endonucleases (REases) has been difficult using conventional methods. Even though known REase structures share a fold and marginally recognizable active site, the overall sequence similarities are statistically insignificant, unless compared among proteins that recognize identical or very similar sequences. Bsp6I is a Type II REase, which recognizes the palindromic DNA sequence 5'GCNGC and cleaves between the cytosine and the unspecified nucleotide in both strands, generating a double-strand break with 5'-protruding single nucleotides. There are no solved structures of REases that recognize similar DNA targets or generate cleavage products with similar characteristics. In straightforward comparisons, the Bsp6I sequence shows no significant similarity to REases with known structures. However, using a fold-recognition approach, we have identified a remote relationship between Bsp6I and the structure of PvuII. Starting from the sequence-structure alignment between Bsp6I and PvuII, we constructed a homology model of Bsp6I and used it to predict functionally significant regions in Bsp6I. The homology model was supported by site-directed mutagenesis of residues predicted to be important for dimerization, DNA binding and catalysis. Completing the picture of sequence-structure-function relationships in protein superfamilies becomes an essential task in the age of structural genomics and our study may serve as a paradigm for future analyses of superfamilies comprising strongly diverged members with little or no sequence similarity.
到目前为止,使用传统方法很难鉴定II型限制性核酸内切酶(REases)中功能重要的残基。尽管已知的REase结构具有相同的折叠方式和勉强可识别的活性位点,但总体序列相似性在统计学上并不显著,除非在识别相同或非常相似序列的蛋白质之间进行比较。Bsp6I是一种II型REase,它识别回文DNA序列5'GCNGC,并在两条链的胞嘧啶和未指定的核苷酸之间切割,产生具有5'突出单核苷酸的双链断裂。目前还没有识别相似DNA靶标或产生具有相似特征切割产物的REase的解析结构。在直接比较中,Bsp6I序列与已知结构的REase没有显著相似性。然而,使用折叠识别方法,我们发现了Bsp6I与PvuII结构之间的远缘关系。从Bsp6I和PvuII的序列-结构比对开始,我们构建了Bsp6I的同源模型,并使用它来预测Bsp6I中功能重要的区域。该同源模型得到了对预测对二聚化、DNA结合和催化重要的残基进行定点诱变的支持。在结构基因组学时代,完善蛋白质超家族中序列-结构-功能关系的图景成为一项重要任务,我们的研究可能为未来分析由很少或没有序列相似性的高度分化成员组成的超家族提供一个范例。