Calderon Christopher P, Arora Karunesh
Department of Statistics and Department of Computational and Applied Mathematics, Rice University, Houston, TX 77005-1892, USA.
J Chem Theory Comput. 2009 Jan 1;5(1):47-58. doi: 10.1021/ct800282a.
Low-dimensional stochastic models can summarize dynamical information and make long time predictions associated with observables of complex atomistic systems. Maximum likelihood based techniques for estimating low-dimensional surrogate diffusion models from relatively short time series are presented. It is found that a heterogeneous population of slowly evolving conformational degrees of freedom modulates the dynamics. This underlying heterogeneity results in a collection of estimated low-dimensional diffusion models. Numerical techniques for exploiting this finding to approximate skewed histograms associated with the simulation are presented. In addition, statistical tests are also used to assess the validity of the models and determine physically relevant sampling information, e.g. the maximum sampling frequency at which one can discretely sample from an atomistic time series and have a surrogate diffusion model pass goodness-of-fit tests. The information extracted from such analyses can possibly be used to assist umbrella sampling computations as well as help in approximating effective diffusion coefficients. The techniques are demonstrated on simulations of Adenylate Kinase.
低维随机模型可以总结动力学信息,并对与复杂原子系统可观测量相关的长时间行为进行预测。本文提出了基于最大似然法从相对较短的时间序列估计低维替代扩散模型的技术。研究发现,缓慢演化的构象自由度的异质群体调节着动力学。这种潜在的异质性导致了一系列估计的低维扩散模型。本文介绍了利用这一发现来近似与模拟相关的偏态直方图的数值技术。此外,还使用统计检验来评估模型的有效性,并确定物理上相关的采样信息,例如,从原子时间序列中离散采样并使替代扩散模型通过拟合优度检验的最大采样频率。从这些分析中提取的信息可能有助于辅助伞形采样计算,并有助于近似有效扩散系数。这些技术在腺苷酸激酶的模拟中得到了验证。