Shrestha Rojan, Berenger Francois, Zhang Kam Y J
Zhang Initiative Research Unit, Advanced Science Institute, RIKEN, Hirosawa, Wako, Saitama, Japan.
Acta Crystallogr D Biol Crystallogr. 2011 Sep;67(Pt 9):804-12. doi: 10.1107/S090744491102779X. Epub 2011 Aug 9.
Ab initio phasing is one of the remaining challenges in protein crystallography. Recent progress in computational structure prediction has enabled the generation of de novo models with high enough accuracy to solve the phase problem ab initio. This `ab initio phasing with de novo models' method first generates a huge number of de novo models and then selects some lowest energy models to solve the phase problem using molecular replacement. The amount of CPU time required is huge even for small proteins and this has limited the utility of this method. Here, an approach is described that significantly reduces the computing time required to perform ab initio phasing with de novo models. Instead of performing molecular replacement after the completion of all models, molecular replacement is initiated during the course of each simulation. The approach principally focuses on avoiding the refinement of the best and the worst models and terminating the entire simulation early once suitable models for phasing have been obtained. In a benchmark data set of 20 proteins, this method is over two orders of magnitude faster than the conventional approach. It was observed that in most cases molecular-replacement solutions were determined soon after the coarse-grained models were turned into full-atom representations. It was also found that all-atom refinement was hardly able to change the models sufficiently to enable successful molecular replacement if the coarse-grained models were not very close to the native structure. Therefore, it remains critical to generate good-quality coarse-grained models to enable subsequent all-atom refinement for successful ab initio phasing by molecular replacement.
从头计算相位是蛋白质晶体学中尚存的挑战之一。计算结构预测方面的最新进展使得能够生成具有足够高精度的从头模型,从而从头解决相位问题。这种“使用从头模型进行从头计算相位”的方法首先生成大量的从头模型,然后选择一些能量最低的模型,通过分子置换来解决相位问题。即使对于小蛋白质,所需的CPU时间也非常多,这限制了该方法的实用性。在此,描述了一种方法,该方法可显著减少使用从头模型进行从头计算相位所需的计算时间。不是在所有模型完成后进行分子置换,而是在每次模拟过程中启动分子置换。该方法主要侧重于避免对最佳和最差模型进行优化,并在获得适合相位计算的模型后尽早终止整个模拟。在一个包含20种蛋白质的基准数据集中,该方法比传统方法快两个数量级以上。据观察,在大多数情况下,粗粒度模型转变为全原子表示后不久就能确定分子置换解决方案。还发现,如果粗粒度模型与天然结构不太接近,全原子优化几乎无法充分改变模型以实现成功的分子置换。因此,生成高质量的粗粒度模型对于通过分子置换进行成功的从头计算相位从而实现后续的全原子优化仍然至关重要。