Conway Patrick, DiMaio Frank
Department of Biochemistry, University of Washington, Seattle, Washington.
Institute for Protein Design, University of Washington, Seattle, Washington.
Protein Sci. 2016 Aug;25(8):1525-34. doi: 10.1002/pro.2956. Epub 2016 Jun 16.
Forcefields used in biomolecular simulations are comprised of energetic terms that are physical in nature, based on parameter fitting to quantum mechanical simulation or experimental data, or statistical, drawing off high-resolution structural data to describe distributions of molecular features. Combining the two in a single forcefield is challenging, since physical terms describe some, but not all, of the observed statistics, leading to double counting. In this manuscript, we develop a general scheme for correcting statistical potentials used in combination with physical terms. We apply these corrections to the sidechain torsional potential used in the Rosetta all-atom forcefield. We show the approach identifies instances of double-counted interactions, including electrostatic interactions between sidechain and nearby backbone, and steric interactions between neighboring Cβ atoms within secondary structural elements. Moreover, this scheme allows for the inclusion of intraresidue physical terms, previously turned off to avoid overlap with the statistical potential. Combined, these corrections lead to a forcefield with improved performance on several structure prediction tasks, including rotamer prediction and native structure discrimination.
生物分子模拟中使用的力场由本质上具有物理性质的能量项组成,这些能量项基于对量子力学模拟或实验数据的参数拟合,或者基于统计学,利用高分辨率结构数据来描述分子特征的分布。将两者结合在一个单一的力场中具有挑战性,因为物理项描述了部分而非全部观察到的统计数据,从而导致重复计算。在本论文中,我们开发了一种用于校正与物理项结合使用的统计势的通用方案。我们将这些校正应用于Rosetta全原子力场中使用的侧链扭转势。我们表明,该方法能够识别重复计算相互作用的实例,包括侧链与附近主链之间的静电相互作用,以及二级结构元件内相邻Cβ原子之间的空间相互作用。此外,该方案允许纳入残基内物理项,这些项之前被关闭以避免与统计势重叠。综合起来,这些校正导致一个在包括旋转异构体预测和天然结构区分在内的多个结构预测任务上具有改进性能的力场。