Cheng Jianlin, Eickholt Jesse, Wang Zheng, Deng Xin
Department of Computer Science, University of Missouri, Columbia, MO 65211, USA.
J Bioinform Comput Biol. 2012 Jun;10(3):1242003. doi: 10.1142/S0219720012420036.
After decades of research, protein structure prediction remains a very challenging problem. In order to address the different levels of complexity of structural modeling, two types of modeling techniques--template-based modeling and template-free modeling--have been developed. Template-based modeling can often generate a moderate- to high-resolution model when a similar, homologous template structure is found for a query protein but fails if no template or only incorrect templates are found. Template-free modeling, such as fragment-based assembly, may generate models of moderate resolution for small proteins of low topological complexity. Seldom have the two techniques been integrated together to improve protein modeling. Here we develop a recursive protein modeling approach to selectively and collaboratively apply template-based and template-free modeling methods to model template-covered (i.e. certain) and template-free (i.e. uncertain) regions of a protein. A preliminary implementation of the approach was tested on a number of hard modeling cases during the 9th Critical Assessment of Techniques for Protein Structure Prediction (CASP9) and successfully improved the quality of modeling in most of these cases. Recursive modeling can significantly reduce the complexity of protein structure modeling and integrate template-based and template-free modeling to improve the quality and efficiency of protein structure prediction.
经过数十年的研究,蛋白质结构预测仍然是一个极具挑战性的问题。为了解决结构建模中不同层次的复杂性,人们开发了两种建模技术——基于模板的建模和无模板建模。当为查询蛋白找到相似的同源模板结构时,基于模板的建模通常可以生成中等至高分辨率的模型,但如果未找到模板或仅找到错误的模板,则该方法会失效。无模板建模,例如基于片段的组装,对于拓扑复杂性较低的小蛋白可能会生成中等分辨率的模型。这两种技术很少被整合在一起以改进蛋白质建模。在此,我们开发了一种递归蛋白质建模方法,以选择性地、协同地应用基于模板和无模板的建模方法,对蛋白质的模板覆盖区域(即确定区域)和无模板区域(即不确定区域)进行建模。在第九届蛋白质结构预测技术关键评估(CASP9)期间,该方法的初步实现被应用于一些困难的建模案例测试中,并且在大多数案例中成功提高了建模质量。递归建模可以显著降低蛋白质结构建模的复杂性,并整合基于模板和无模板的建模,以提高蛋白质结构预测的质量和效率。