Department of Chemistry, Chemical Physics Theory Group, University of Toronto, 80 Saint George Street, Toronto, Ontario M5S 3H6, Canada.
J Chem Phys. 2010 Apr 21;132(15):154110. doi: 10.1063/1.3366523.
In this paper a method of obtaining smooth analytical estimates of probability densities, radial distribution functions, and potentials of mean force from sampled data in a statistically controlled fashion is presented. The approach is general and can be applied to any density of a single random variable. The method outlined here avoids the use of histograms, which require the specification of a physical parameter (bin size) and tend to give noisy results. The technique is an extension of the Berg-Harris method [B. A. Berg and R. C. Harris, Comput. Phys. Commun. 179, 443 (2008)], which is typically inaccurate for radial distribution functions and potentials of mean force due to a nonuniform Jacobian factor. In addition, the standard method often requires a large number of Fourier modes to represent radial distribution functions, which tends to lead to oscillatory fits. It is shown that the issues of poor sampling due to a Jacobian factor can be resolved using a biased resampling scheme, while the requirement of a large number of Fourier modes is mitigated through an automated piecewise construction approach. The method is demonstrated by analyzing the radial distribution functions in an energy-discretized water model. In addition, the fitting procedure is illustrated on three more applications for which the original Berg-Harris method is not suitable, namely, a random variable with a discontinuous probability density, a density with long tails, and the distribution of the first arrival times of a diffusing particle to a sphere, which has both long tails and short-time structure. In all cases, the resampled, piecewise analytical fit outperforms the histogram and the original Berg-Harris method.
本文提出了一种从统计控制的采样数据中获得平滑解析概率密度、径向分布函数和平均力势估计的方法。该方法具有通用性,可应用于任何单一随机变量的密度。这里概述的方法避免了使用直方图,直方图需要指定物理参数(箱大小),并且往往会产生嘈杂的结果。该技术是 Berg-Harris 方法的扩展[B. A. Berg 和 R. C. Harris, Comput. Phys. Commun. 179, 443 (2008)],由于雅可比因子不均匀,该方法通常不适用于径向分布函数和平均力势。此外,标准方法通常需要大量傅里叶模式来表示径向分布函数,这往往会导致波动拟合。结果表明,由于雅可比因子导致的采样不足问题可以通过有偏重采样方案来解决,而大量傅里叶模式的要求则通过自动分段构造方法得到缓解。该方法通过分析能量离散化水模型中的径向分布函数得到了验证。此外,还通过三个不适合原始 Berg-Harris 方法的应用实例说明了拟合过程,即具有不连续概率密度的随机变量、长尾密度和扩散粒子到达球体的首次到达时间分布,该分布既有长尾又有短时间结构。在所有情况下,重采样的分段解析拟合都优于直方图和原始 Berg-Harris 方法。