Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts, United States of America.
Harvard University Program in Biophysics, Harvard University, Cambridge, Massachusetts, United States of America.
PLoS Comput Biol. 2020 Nov 16;16(11):e1008323. doi: 10.1371/journal.pcbi.1008323. eCollection 2020 Nov.
Atomistic simulations can provide valuable, experimentally-verifiable insights into protein folding mechanisms, but existing ab initio simulation methods are restricted to only the smallest proteins due to severe computational speed limits. The folding of larger proteins has been studied using native-centric potential functions, but such models omit the potentially crucial role of non-native interactions. Here, we present an algorithm, entitled DBFOLD, which can predict folding pathways for a wide range of proteins while accounting for the effects of non-native contacts. In addition, DBFOLD can predict the relative rates of different transitions within a protein's folding pathway. To accomplish this, rather than directly simulating folding, our method combines equilibrium Monte-Carlo simulations, which deploy enhanced sampling, with unfolding simulations at high temperatures. We show that under certain conditions, trajectories from these two types of simulations can be jointly analyzed to compute unknown folding rates from detailed balance. This requires inferring free energies from the equilibrium simulations, and extrapolating transition rates from the unfolding simulations to lower, physiologically-reasonable temperatures at which the native state is marginally stable. As a proof of principle, we show that our method can accurately predict folding pathways and Monte-Carlo rates for the well-characterized Streptococcal protein G. We then show that our method significantly reduces the amount of computation time required to compute the folding pathways of large, misfolding-prone proteins that lie beyond the reach of existing direct simulation. Our algorithm, which is available online, can generate detailed atomistic models of protein folding mechanisms while shedding light on the role of non-native intermediates which may crucially affect organismal fitness and are frequently implicated in disease.
原子模拟可以为蛋白质折叠机制提供有价值的、可通过实验验证的见解,但现有的从头计算模拟方法由于严重的计算速度限制,仅适用于最小的蛋白质。较大蛋白质的折叠已使用基于天然的势能函数进行了研究,但此类模型忽略了非天然相互作用的潜在关键作用。在这里,我们提出了一种名为 DBFOLD 的算法,该算法可以预测广泛的蛋白质的折叠途径,同时考虑非天然接触的影响。此外,DBFOLD 可以预测蛋白质折叠途径中不同转变的相对速率。为了实现这一点,我们的方法不是直接模拟折叠,而是结合了平衡蒙特卡罗模拟(部署增强采样)和高温下的展开模拟。我们表明,在某些条件下,可以共同分析这两种类型的模拟轨迹,从详细平衡计算未知的折叠速率。这需要从平衡模拟推断自由能,并从展开模拟推断过渡速率到较低的、生理上合理的温度,在该温度下天然状态处于边缘稳定状态。作为原理证明,我们表明我们的方法可以准确预测具有良好特征的链球菌蛋白 G 的折叠途径和蒙特卡罗速率。然后,我们表明我们的方法大大减少了计算超出现有直接模拟范围的大型、易错误折叠的蛋白质折叠途径所需的计算时间。我们的算法,可在线获取,能够生成蛋白质折叠机制的详细原子模型,同时阐明可能对生物体适应性产生关键影响且经常与疾病相关的非天然中间体的作用。