Faculty of Chemistry, Department of Computational Biological Chemistry, University of Vienna, Währingerstrasse 17, A-1090 Vienna, Austria.
Vienna Doctoral School in Chemistry (DoSChem), University of Vienna, Währingerstrasse 42, A-1090 Vienna, Austria.
J Phys Chem B. 2022 Apr 21;126(15):2798-2811. doi: 10.1021/acs.jpcb.2c00696. Epub 2022 Apr 11.
A key step during indirect alchemical free energy simulations using quantum mechanical/molecular mechanical (QM/MM) hybrid potential energy functions is the calculation of the free energy difference Δ between the low level (e.g., pure MM) and the high level of theory (QM/MM). A reliable approach uses nonequilibrium work (NEW) switching simulations in combination with Jarzynski's equation; however, it is computationally expensive. In this study, we investigate whether it is more efficient to use more shorter switches or fewer but longer switches. We compare results obtained with various protocols to reference free energy differences calculated with Crooks' equation. The central finding is that fewer longer switches give better converged results. As few as 200 sufficiently long switches lead to Δ values in good agreement with the reference results. This optimized protocol reduces the computational cost by a factor of 40 compared to earlier work. We also describe two tools/ways of analyzing the raw data to detect sources of poor convergence. Specifically, we find it helpful to analyze the raw data (work values from the NEW switching simulations) in a quasi-time series-like manner. Principal component analysis helps to detect cases where one or more conformational degrees of freedom are different at the low and high level of theory.
在使用量子力学/分子力学(QM/MM)杂化势能函数进行间接的量子热力学自由能模拟过程中,关键的一步是计算低水平(例如纯 MM)和高水平理论(QM/MM)之间的自由能差 Δ。一种可靠的方法是结合 Jarzynski 方程使用非平衡工作(NEW)切换模拟;然而,它的计算成本很高。在这项研究中,我们研究了使用更多短的开关或更少但更长的开关是否更有效。我们将各种方案的结果与 Crooks 方程计算的参考自由能差异进行了比较。主要发现是使用更少但更长的开关可以得到更好收敛的结果。使用多达 200 个足够长的开关就可以得到与参考结果非常吻合的 Δ 值。与早期的工作相比,这种优化的方案将计算成本降低了 40 倍。我们还描述了两种分析原始数据的工具/方法,以检测收敛不良的原因。具体来说,我们发现以类似准时间序列的方式分析原始数据(来自 NEW 切换模拟的工作值)很有帮助。主成分分析有助于检测在低水平和高水平理论下一个或多个构象自由度不同的情况。