Noé Frank
DFG Research Center Matheon, FU Berlin, Arnimallee 6, 14159 Berlin, Germany.
J Chem Phys. 2008 Jun 28;128(24):244103. doi: 10.1063/1.2916718.
Molecular dynamics (MD) simulations can be used to estimate transition rates between conformational substates of the simulated molecule. Such an estimation is associated with statistical uncertainty, which depends on the number of observed transitions. In turn, it induces uncertainties in any property computed from the simulation, such as free energy differences or the time scales involved in the system's kinetics. Assessing these uncertainties is essential for testing the reliability of a given observation and also to plan further simulations in such a way that the most serious uncertainties will be reduced with minimal effort. Here, a rigorous statistical method is proposed to approximate the complete statistical distribution of any observable of an MD simulation provided that one can identify conformational substates such that the transition process between them may be modeled with a memoryless jump process, i.e., Markov or Master equation dynamics. The method is based on sampling the statistical distribution of Markov transition matrices that is induced by the observed transition events. It allows physically meaningful constraints to be included, such as sampling only matrices that fulfill detailed balance, or matrices that produce a predefined equilibrium distribution of states. The method is illustrated on mus MD simulations of a hexapeptide for which the distributions and uncertainties of the free energy differences between conformations, the transition matrix elements, and the transition matrix eigenvalues are estimated. It is found that both constraints, detailed balance and predefined equilibrium distribution, can significantly reduce the uncertainty of some observables.
分子动力学(MD)模拟可用于估计模拟分子构象亚态之间的跃迁速率。这种估计与统计不确定性相关,统计不确定性取决于观测到的跃迁次数。反过来,它会在从模拟中计算出的任何属性中引入不确定性,例如自由能差或系统动力学所涉及的时间尺度。评估这些不确定性对于测试给定观测结果的可靠性至关重要,同时对于以最小的努力减少最严重的不确定性的方式规划进一步的模拟也很重要。在此,提出了一种严格的统计方法,以近似MD模拟中任何可观测值的完整统计分布,前提是能够识别构象亚态,使得它们之间的跃迁过程可以用无记忆跳跃过程来建模,即马尔可夫或主方程动力学。该方法基于对由观测到的跃迁事件诱导的马尔可夫跃迁矩阵的统计分布进行采样。它允许纳入具有物理意义的约束,例如仅对满足细致平衡的矩阵进行采样,或对产生预定义状态平衡分布的矩阵进行采样。该方法在一个六肽的MD模拟中得到了说明,其中估计了构象之间自由能差的分布和不确定性、跃迁矩阵元素以及跃迁矩阵特征值。结果发现,细致平衡和预定义平衡分布这两个约束都可以显著降低某些可观测值的不确定性。