Systems Biology Centre, University of Warwick, Coventry, United Kingdom.
Biophys J. 2012 Feb 22;102(4):878-86. doi: 10.1016/j.bpj.2011.12.053. Epub 2012 Feb 21.
Nested sampling is a Bayesian sampling technique developed to explore probability distributions localized in an exponentially small area of the parameter space. The algorithm provides both posterior samples and an estimate of the evidence (marginal likelihood) of the model. The nested sampling algorithm also provides an efficient way to calculate free energies and the expectation value of thermodynamic observables at any temperature, through a simple post processing of the output. Previous applications of the algorithm have yielded large efficiency gains over other sampling techniques, including parallel tempering. In this article, we describe a parallel implementation of the nested sampling algorithm and its application to the problem of protein folding in a Gō-like force field of empirical potentials that were designed to stabilize secondary structure elements in room-temperature simulations. We demonstrate the method by conducting folding simulations on a number of small proteins that are commonly used for testing protein-folding procedures. A topological analysis of the posterior samples is performed to produce energy landscape charts, which give a high-level description of the potential energy surface for the protein folding simulations. These charts provide qualitative insights into both the folding process and the nature of the model and force field used.
嵌套抽样是一种贝叶斯抽样技术,旨在探索参数空间中呈指数级小区域的概率分布。该算法提供了模型的后验样本和证据(边际似然)的估计。通过对输出进行简单的后处理,嵌套抽样算法还提供了一种在任何温度下计算自由能和热力学可观测量的期望值的有效方法。该算法的先前应用已经比其他抽样技术(包括并行温度)带来了很大的效率提升。在本文中,我们描述了嵌套抽样算法的并行实现及其在 Gō 类经验势力场中蛋白质折叠问题的应用,该势场旨在稳定室温模拟中的二级结构元件。我们通过对一些常用于测试蛋白质折叠程序的小蛋白质进行折叠模拟来演示该方法。对后验样本进行拓扑分析以生成能量景观图,为蛋白质折叠模拟的势能表面提供了高级描述。这些图表为折叠过程和使用的模型和力场的性质提供了定性的见解。