Aldeghi Matteo, Gapsys Vytautas, de Groot Bert L
Computational Biomolecular Dynamics Group, Max Planck Institute for Biophysical Chemistry, 37077 Göttingen, Germany.
ACS Cent Sci. 2018 Dec 26;4(12):1708-1718. doi: 10.1021/acscentsci.8b00717. Epub 2018 Dec 13.
The design of proteins with novel ligand-binding functions holds great potential for application in biomedicine and biotechnology. However, our ability to engineer ligand-binding proteins is still limited, and current approaches rely primarily on experimentation. Computation could reduce the cost of the development process and would allow rigorous testing of our understanding of the principles governing molecular recognition. While computational methods have proven successful in the early stages of the discovery process, optimization approaches that can quantitatively predict ligand affinity changes upon protein mutation are still lacking. Here, we assess the ability of free energy calculations based on first-principles statistical mechanics, as well as the latest Rosetta protocols, to quantitatively predict such affinity changes on a challenging set of 134 mutations. After evaluating different protocols with computational efficiency in mind, we investigate the performance of different force fields. We show that both the free energy calculations and Rosetta are able to quantitatively predict changes in ligand binding affinity upon protein mutations, yet the best predictions are the result of combining the estimates of both methods. These closely match the experimentally determined ΔΔ values, with a root-mean-square error of 1.2 kcal/mol for the full benchmark set and of 0.8 kcal/mol for a subset of protein systems providing the most reproducible results. The currently achievable accuracy offers the prospect of being able to employ computation for the optimization of ligand-binding proteins as well as the prediction of drug resistance.
设计具有新型配体结合功能的蛋白质在生物医学和生物技术领域具有巨大的应用潜力。然而,我们设计配体结合蛋白的能力仍然有限,目前的方法主要依赖于实验。计算可以降低开发过程的成本,并能对我们对分子识别原理的理解进行严格测试。虽然计算方法在发现过程的早期阶段已被证明是成功的,但仍缺乏能够定量预测蛋白质突变后配体亲和力变化的优化方法。在此,我们评估基于第一性原理统计力学的自由能计算以及最新的Rosetta协议在一组具有挑战性的134个突变上定量预测此类亲和力变化的能力。在考虑计算效率评估不同协议后,我们研究了不同力场的性能。我们表明,自由能计算和Rosetta都能够定量预测蛋白质突变后配体结合亲和力的变化,但最佳预测是两种方法估计值相结合的结果。这些结果与实验测定的ΔΔ值非常接近,整个基准集的均方根误差为1.2千卡/摩尔,对于提供最可重复结果的一部分蛋白质系统,均方根误差为0.8千卡/摩尔。目前可达到的精度为利用计算优化配体结合蛋白以及预测耐药性提供了前景。