Jamroz Michal, Kolinski Andrzej
Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Warsaw, Poland.
BMC Struct Biol. 2010 Feb 11;10:5. doi: 10.1186/1472-6807-10-5.
Template-target sequence alignment and loop modeling are key components of protein comparative modeling. Short loops can be predicted with high accuracy using structural fragments from other, not necessairly homologous proteins, or by various minimization methods. For longer loops multiscale approaches employing coarse-grained de novo modeling techniques should be more effective.
For a representative set of protein structures of various structural classes test predictions of loop regions have been performed using MODELLER, ROSETTA, and a CABS coarse-grained de novo modeling tool. Loops of various length, from 4 to 25 residues, were modeled assuming an ideal target-template alignment of the remaining portions of the protein. It has been shown that classical modeling with MODELLER is usually better for short loops, while coarse-grained de novo modeling is more effective for longer loops. Even very long missing fragments in protein structures could be effectively modeled. Resolution of such models is usually on the level 2-6 A, which could be sufficient for guiding protein engineering. Further improvement of modeling accuracy could be achieved by the combination of different methods. In particular, we used 10 top ranked models from sets of 500 models generated by MODELLER as multiple templates for CABS modeling. On average, the resulting molecular models were better than the models from individual methods.
Accuracy of protein modeling, as demonstrated for the problem of loop modeling, could be improved by the combinations of different modeling techniques.
模板-靶序列比对和环建模是蛋白质比较建模的关键组成部分。短环可以使用来自其他不一定同源的蛋白质的结构片段,或通过各种最小化方法进行高精度预测。对于较长的环,采用粗粒度从头建模技术的多尺度方法应该更有效。
对于一组具有代表性的不同结构类别的蛋白质结构,使用MODELLER、ROSETTA和一种CABS粗粒度从头建模工具对环区域进行了测试预测。假设蛋白质其余部分的靶标-模板比对理想,对4至25个残基的不同长度的环进行了建模。结果表明,使用MODELLER进行经典建模通常对短环更好,而粗粒度从头建模对长环更有效。即使是蛋白质结构中非常长的缺失片段也可以有效地建模。此类模型的分辨率通常在2-6埃的水平,这对于指导蛋白质工程可能已经足够。通过组合不同方法可以进一步提高建模精度。特别是,我们使用了由MODELLER生成的500个模型集中排名前10的模型作为CABS建模的多个模板。平均而言,得到的分子模型比单个方法得到的模型更好。
如环建模问题所示,通过组合不同的建模技术可以提高蛋白质建模的准确性。