Abbass Jad, Nebel Jean-Christophe
Faculty of Science, Engineering and Computing, Kingston; University, London, KT1 2EE, United Kingdom.
Protein Pept Lett. 2017;24(3):215-222. doi: 10.2174/0929866523666161216124019.
Protein structure prediction is considered a main challenge in computational biology. The biannual international competition, Critical Assessment of protein Structure Prediction (CASP), has shown in its eleventh experiment that free modelling target predictions are still beyond reliable accuracy, therefore, much effort should be made to improve ab initio methods. Arguably, Rosetta is considered as the most competitive method when it comes to targets with no homologues. Relying on fragments of length 9 and 3 from known structures, Rosetta creates putative structures by assembling candidate fragments. Generally, the structure with the lowest energy score, also known as first model, is chosen to be the "predicted one". A thorough study has been conducted on the role and diversity of 3-mers involved in Rosetta's model "refinement" phase. Usage of the standard number of 3-mers - i.e. 200 - has been shown to degrade alpha and alpha-beta protein conformations initially achieved by assembling 9-mers. Therefore, a new prediction pipeline is proposed for Rosetta where the "refinement" phase is customised according to a target's structural class prediction. Over 8% improvement in terms of first model structure accuracy is reported for alpha and alpha-beta classes when decreasing the number of 3- mers.
蛋白质结构预测被认为是计算生物学中的一项主要挑战。两年一度的国际蛋白质结构预测关键评估(CASP)竞赛在其第十一次实验中表明,自由建模目标预测的准确性仍不可靠,因此,应付出更多努力来改进从头算方法。可以说,在处理没有同源物的目标时,Rosetta被认为是最具竞争力的方法。Rosetta依靠已知结构中长度为9和3的片段,通过组装候选片段来创建假定结构。通常,能量得分最低的结构,也就是所谓的第一个模型,被选为“预测结构”。针对Rosetta模型“优化”阶段中涉及的三联体的作用和多样性进行了深入研究。已证明使用标准数量的三联体(即200个)会降低最初通过组装九联体获得的α和α-β蛋白构象。因此,为Rosetta提出了一种新的预测流程,其中“优化”阶段根据目标的结构类别预测进行定制。当减少三联体数量时,α和α-β类别的第一个模型结构准确性提高了8%以上。