Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington.
Howard Hughes Medical Institute, University of Washington, Seattle, Washington.
Proteins. 2019 Dec;87(12):1276-1282. doi: 10.1002/prot.25784. Epub 2019 Aug 5.
Because proteins generally fold to their lowest free energy states, energy-guided refinement in principle should be able to systematically improve the quality of protein structure models generated using homologous structure or co-evolution derived information. However, because of the high dimensionality of the search space, there are far more ways to degrade the quality of a near native model than to improve it, and hence, refinement methods are very sensitive to energy function errors. In the 13th Critial Assessment of techniques for protein Structure Prediction (CASP13), we sought to carry out a thorough search for low energy states in the neighborhood of a starting model using restraints to avoid straying too far. The approach was reasonably successful in improving both regions largely incorrect in the starting models as well as core regions that started out closer to the correct structure. Models with GDT-HA over 70 were obtained for five targets and for one of those, an accuracy of 0.5 å backbone root-mean-square deviation (RMSD) was achieved. An important current challenge is to improve performance in refining oligomers and larger proteins, for which the search problem remains extremely difficult.
由于蛋白质通常会折叠到其最低自由能状态,因此原则上,能量引导的精修应该能够系统地提高使用同源结构或共进化衍生信息生成的蛋白质结构模型的质量。然而,由于搜索空间的高维度,降低近天然模型质量的方法远远多于提高质量的方法,因此,精修方法对能量函数误差非常敏感。在第十三届蛋白质结构预测技术关键评估 (CASP13) 中,我们试图使用约束来避免偏离太远,在起始模型的附近彻底搜索低能量状态。这种方法在改进起始模型中大部分错误的区域以及更接近正确结构的核心区域方面取得了相当大的成功。有五个目标的 GDT-HA 超过 70,其中一个达到了 0.5 å 后骨架均方根偏差 (RMSD) 的准确性。目前的一个重要挑战是提高寡聚体和更大蛋白质的精修性能,对于这些蛋白质,搜索问题仍然极其困难。