Biophysics Program, University of Maryland, College Park, MD 20742.
Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742.
Proc Natl Acad Sci U S A. 2022 Aug 9;119(32):e2203656119. doi: 10.1073/pnas.2203656119. Epub 2022 Aug 4.
Using simulations or experiments performed at some set of temperatures to learn about the physics or chemistry at some other arbitrary temperature is a problem of immense practical and theoretical relevance. Here we develop a framework based on statistical mechanics and generative artificial intelligence that allows solving this problem. Specifically, we work with denoising diffusion probabilistic models and show how these models in combination with replica exchange molecular dynamics achieve superior sampling of the biomolecular energy landscape at temperatures that were never simulated without assuming any particular slow degrees of freedom. The key idea is to treat the temperature as a fluctuating random variable and not a control parameter as is usually done. This allows us to directly sample from the joint probability distribution in configuration and temperature space. The results here are demonstrated for a chirally symmetric peptide and single-strand RNA undergoing conformational transitions in all-atom water. We demonstrate how we can discover transition states and metastable states that were previously unseen at the temperature of interest and even bypass the need to perform further simulations for a wide range of temperatures. At the same time, any unphysical states are easily identifiable through very low Boltzmann weights. The procedure while shown here for a class of molecular simulations should be more generally applicable to mixing information across simulations and experiments with varying control parameters.
使用在一组特定温度下进行的模拟或实验来了解其他任意温度下的物理或化学性质,是一个具有巨大实际和理论意义的问题。在这里,我们开发了一个基于统计力学和生成式人工智能的框架,该框架可以解决这个问题。具体来说,我们使用去噪扩散概率模型,并展示了这些模型如何与 replica 交换分子动力学相结合,在从未模拟过的温度下实现生物分子能量景观的优越采样,而无需假设任何特定的慢自由度。关键思想是将温度视为波动的随机变量,而不是通常所做的控制参数。这使我们能够直接从构型和温度空间的联合概率分布中进行采样。这里的结果是针对经历构象转变的手性对称肽和单链 RNA 在全原子水中的情况进行演示的。我们展示了如何发现以前在感兴趣温度下看不到的过渡态和亚稳态,甚至可以绕过在广泛温度范围内进一步进行模拟的需要。同时,任何非物理状态都可以通过非常低的玻尔兹曼权重轻松识别。虽然这里为一类分子模拟展示了该过程,但它应该更普遍地适用于跨具有不同控制参数的模拟和实验混合信息的情况。