Lopes Anne, Alexandrov Alexey, Bathelt Christine, Archontis Georgios, Simonson Thomas
Laboratoire de Biochimie (UMR CNRS 7654), Department of Biology, Ecole Polytechnique, 91128, Palaiseau, France.
Proteins. 2007 Jun 1;67(4):853-67. doi: 10.1002/prot.21379.
Structure prediction and computational protein design should benefit from accurate solvent models. We have applied implicit solvent models to two problems that are central to this area. First, we performed sidechain placement for 29 proteins, using a solvent model that combines a screened Coulomb term with an Accessible Surface Area term (CASA model). With optimized parameters, the prediction quality is comparable with earlier work that omitted electrostatics and solvation altogether. Second, we computed the stability changes associated with point mutations involving ionized sidechains. For over 1000 mutations, including many fully or partly buried positions, we compared CASA and two generalized Born models (GB) with a more accurate model, which solves the Poisson equation of continuum electrostatics numerically. CASA predicts the correct sign and order of magnitude of the stability change for 81% of the mutations, compared to 97% with the best GB. We also considered 140 mutations for which experimental data are available. Comparing to experiment requires additional assumptions about the unfolded protein structure, protein relaxation in response to the mutations, and contributions from the hydrophobic effect. With a simple, commonly-used unfolded state model, the mean unsigned error is 2.1 kcal/mol with both CASA and the best GB. Overall, the electrostatic model is not important for sidechain placement; CASA and GB are equivalent for surface mutations, while GB is far superior for fully or partly buried positions. Thus, for problems like protein design that involve all these aspects, the most recent GB models represent an important step forward. Along with the recent discovery of efficient, pairwise implementations of GB, this will open new possibilities for the computational engineering of proteins.
结构预测和计算蛋白质设计应受益于精确的溶剂模型。我们已将隐式溶剂模型应用于该领域的两个核心问题。首先,我们使用一种将屏蔽库仑项与可及表面积项相结合的溶剂模型(CASA模型)对29种蛋白质进行侧链定位。通过优化参数,预测质量与早期完全忽略静电和溶剂化作用的工作相当。其次,我们计算了与涉及离子化侧链的点突变相关的稳定性变化。对于1000多个突变,包括许多完全或部分埋藏的位置,我们将CASA模型和两种广义玻恩模型(GB)与一种更精确的模型进行了比较,该精确模型通过数值方法求解连续介质静电学的泊松方程。CASA模型对81%的突变预测出了稳定性变化的正确符号和量级,而最佳GB模型的这一比例为97%。我们还考虑了140个有实验数据的突变。与实验进行比较需要对未折叠蛋白质结构、蛋白质对突变的弛豫以及疏水效应的贡献做出额外假设。使用一个简单的、常用的未折叠状态模型,CASA模型和最佳GB模型的平均绝对误差均为2.1千卡/摩尔。总体而言,静电模型对于侧链定位并不重要;CASA模型和GB模型在表面突变方面等效,而GB模型在完全或部分埋藏位置方面远优于CASA模型。因此,对于涉及所有这些方面的蛋白质设计等问题,最新的GB模型代表了重要的进步。随着最近发现GB模型的高效成对实现方式,这将为蛋白质的计算工程开辟新的可能性。