Sindhikara Daniel J, Kim Seonah, Voter Arthur F, Roitberg Adrian E
Quantum Theory Project and Departments of Physics and Chemistry, University of Florida, Gainesville, Florida 32611, Department of Chemistry and Biochemistry, University of California, Los Angeles, California 90095, and Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545.
J Chem Theory Comput. 2009 Jun 9;5(6):1624-31. doi: 10.1021/ct800573m. Epub 2009 Apr 28.
Molecular dynamics simulations starting from different initial conditions are commonly used to mimic the behavior of an experimental ensemble. We show in this article that when a Langevin thermostat is used to maintain constant temperature during such simulations, extreme care must be taken when choosing the random number seeds to prevent statistical correlation among the MD trajectories. While recent studies have shown that stochastically thermostatted trajectories evolving within a single potential basin with identical random number seeds tend to synchronize, we show that there is a synchronization effect even for complex, biologically relevant systems. We demonstrate this effect in simulations of alanine trimer and pentamer and in a simulation of a temperature-jump experiment for peptide folding of a 14-residue peptide. Even in replica-exchange simulations, in which the trajectories are at different temperatures, we find partial synchronization occurring when the same random number seed is employed. We explain this by extending the recent derivation of the synchronization effect for two trajectories in a harmonic well to the case in which the trajectories are at two different temperatures. Our results suggest several ways in which mishandling selection of a pseudorandom number generator initial seed can lead to corruption of simulation data. Simulators can fall into this trap in simple situations such as neglecting to specifically indicate different random seeds in either parallel or sequential restart simulations, utilizing a simulation package with a weak pseudorandom number generator, or using an advanced simulation algorithm that has not been programmed to distribute initial seeds.
从不同初始条件开始的分子动力学模拟通常用于模拟实验系综的行为。我们在本文中表明,当在这种模拟中使用朗之万恒温器来维持恒定温度时,在选择随机数种子时必须格外小心,以防止分子动力学轨迹之间的统计相关性。虽然最近的研究表明,在具有相同随机数种子的单个势阱内演化的随机恒温轨迹倾向于同步,但我们表明,即使对于复杂的、与生物学相关的系统,也存在同步效应。我们在丙氨酸三聚体和五聚体的模拟以及14个残基肽的肽折叠温度跳跃实验的模拟中证明了这种效应。即使在副本交换模拟中,其中轨迹处于不同温度,我们也发现当使用相同的随机数种子时会出现部分同步。我们通过将最近对谐波阱中两条轨迹同步效应的推导扩展到轨迹处于两个不同温度的情况来解释这一点。我们的结果表明,在几种情况下,错误处理伪随机数生成器初始种子的选择可能会导致模拟数据的损坏。模拟器可能会在简单情况下陷入这个陷阱,例如在并行或顺序重启模拟中忽略专门指定不同的随机种子、使用具有弱伪随机数生成器的模拟包或使用尚未编程以分配初始种子的高级模拟算法。