Skowronek Krzysztof J, Kosinski Jan, Bujnicki Janusz M
International Institute of Molecular and Cell Biology, Warsaw, Poland.
Proteins. 2006 Jun 1;63(4):1059-68. doi: 10.1002/prot.20920.
Type II restriction enzymes are commercially important deoxyribonucleases and very attractive targets for protein engineering of new specificities. At the same time they are a very challenging test bed for protein structure prediction methods. Typically, enzymes that recognize different sequences show little or no amino acid sequence similarity to each other and to other proteins. Based on crystallographic analyses that revealed the same PD-(D/E)XK fold for more than a dozen case studies, they were nevertheless considered to be related until the combination of bioinformatics and mutational analyses has demonstrated that some of these proteins belong to other, unrelated folds PLD, HNH, and GIY-YIG. As a part of a large-scale project aiming at identification of a three-dimensional fold for all type II REases with known sequences (currently approximately 1000 proteins), we carried out preliminary structure prediction and selected candidates for experimental validation. Here, we present the analysis of HpaI REase, an ORFan with no detectable homologs, for which we detected a structural template by protein fold recognition, constructed a model using the FRankenstein monster approach and identified a number of residues important for the DNA binding and catalysis. These predictions were confirmed by site-directed mutagenesis and in vitro analysis of the mutant proteins. The experimentally validated model of HpaI will serve as a low-resolution structural platform for evolutionary considerations in the subgroup of blunt-cutting REases with different specificities. The research protocol developed in the course of this work represents a streamlined version of the previously used techniques and can be used in a high-throughput fashion to build and validate models for other enzymes, especially ORFans that exhibit no sequence similarity to any other protein in the database.
II型限制酶是具有商业重要性的脱氧核糖核酸酶,也是用于构建新特异性的蛋白质工程的极具吸引力的靶点。同时,它们也是蛋白质结构预测方法极具挑战性的试验平台。通常,识别不同序列的酶彼此之间以及与其他蛋白质几乎没有或没有氨基酸序列相似性。基于晶体学分析,在十几个案例研究中揭示了相同的PD-(D/E)XK折叠,在生物信息学和突变分析相结合证明其中一些蛋白质属于其他不相关的折叠(PLD、HNH和GIY-YIG)之前,它们仍被认为是相关的。作为一个旨在为所有已知序列的II型限制酶(目前约有1000种蛋白质)确定三维折叠的大规模项目的一部分,我们进行了初步结构预测并选择了候选物进行实验验证。在这里,我们展示了对HpaI限制酶的分析,它是一种没有可检测到的同源物的孤儿蛋白,我们通过蛋白质折叠识别检测到了一个结构模板,使用“科学怪人”方法构建了一个模型,并确定了一些对DNA结合和催化很重要的残基。这些预测通过定点诱变和突变蛋白的体外分析得到了证实。经过实验验证的HpaI模型将作为一个低分辨率结构平台,用于对具有不同特异性的平端切割限制酶亚组进行进化考量。在这项工作过程中开发的研究方案是先前使用技术的简化版本,可用于高通量方式构建和验证其他酶的模型,特别是那些与数据库中任何其他蛋白质都没有序列相似性的孤儿蛋白。