University of Zurich, Department of Biochemistry, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland.
J Chem Phys. 2019 Mar 14;150(10):104105. doi: 10.1063/1.5063556.
Classical atomistic simulations of biomolecules play an increasingly important role in molecular life science. The structure of current computing architectures favors methods that run multiple trajectories at once without requiring extensive communication between them. Many advanced sampling strategies in the field fit this mold. These approaches often rely on an adaptive logic and create ensembles of comparatively short trajectories whose starting points are not distributed according to the correct Boltzmann weights. This type of bias is notoriously difficult to remove, and Markov state models (MSMs) are one of the few strategies available for recovering the correct kinetics and thermodynamics from these ensembles of trajectories. In this contribution, we analyze the performance of MSMs in the thermodynamic reweighting task for a hierarchical set of systems. We show that MSMs can be rigorous tools to recover the correct equilibrium distribution for systems of sufficiently low dimensionality. This is conditional upon not tampering with local flux imbalances found in the data. For a real-world application, we find that a pure likelihood-based inference of the transition matrix produces the best results. The removal of the bias is incomplete, however, and for this system, all tested MSMs are outperformed by an alternative albeit less general approach rooted in the ideas of statistical resampling. We conclude by formulating some recommendations for how to address the reweighting issue in practice.
经典的生物分子原子模拟在分子生命科学中扮演着越来越重要的角色。当前计算体系结构的结构有利于同时运行多个轨迹的方法,而不需要它们之间进行大量的通信。该领域的许多高级采样策略都符合这一模式。这些方法通常依赖于自适应逻辑,并创建相对较短轨迹的集合,其起点不是根据正确的玻尔兹曼权重分布的。这种类型的偏差很难消除,而马尔可夫状态模型(MSM)是从这些轨迹集合中恢复正确的动力学和热力学的少数几种策略之一。在本研究中,我们分析了 MSM 在一组层次系统的热力学重新加权任务中的性能。我们表明,MSM 可以成为恢复具有足够低维数的系统的正确平衡分布的严格工具。这取决于不干扰数据中发现的局部通量不平衡。对于实际应用,我们发现基于纯似然性的转移矩阵推断会产生最佳结果。然而,偏差的消除并不完全,对于这个系统,所有测试的 MSM 都不如基于统计重采样思想的替代方法表现好。最后,我们提出了一些关于如何在实践中解决重新加权问题的建议。