Rothchild Daniel, Rosen Andrew S, Taw Eric, Robinson Connie, Gonzalez Joseph E, Krishnapriyan Aditi S
Department of Electrical Engineering and Computer Science, University of California Berkeley USA
Department of Materials Science and Engineering, University of California Berkeley USA.
Chem Sci. 2024 Jul 22;15(33):13506-13522. doi: 10.1039/d3sc05877h. eCollection 2024 Aug 22.
We present an investigation of diffusion models for molecular generation, with the aim of better understanding how their predictions compare to the results of physics-based calculations. The investigation into these models is driven by their potential to significantly accelerate electronic structure calculations using machine learning, without requiring expensive first-principles datasets for training interatomic potentials. We find that the inference process of a popular diffusion model for molecular generation is divided into an exploration phase, where the model chooses the atomic species, and a relaxation phase, where it adjusts the atomic coordinates to find a low-energy geometry. As training proceeds, we show that the model initially learns about the first-order structure of the potential energy surface, and then later learns about higher-order structure. We also find that the relaxation phase of the diffusion model can be re-purposed to sample the Boltzmann distribution over conformations and to carry out structure relaxations. For structure relaxations, the model finds geometries with ∼10× lower energy than those produced by a classical force field for small organic molecules. Initializing a density functional theory (DFT) relaxation at the diffusion-produced structures yields a >2× speedup to the DFT relaxation when compared to initializing at structures relaxed with a classical force field.
我们展示了一项关于分子生成扩散模型的研究,目的是更好地理解其预测结果与基于物理的计算结果相比如何。对这些模型的研究是由它们在使用机器学习显著加速电子结构计算方面的潜力所驱动的,而无需用于训练原子间势的昂贵的第一性原理数据集。我们发现,一种用于分子生成的流行扩散模型的推理过程分为一个探索阶段,在此阶段模型选择原子种类,以及一个弛豫阶段,在此阶段它调整原子坐标以找到低能量几何结构。随着训练的进行,我们表明该模型最初学习势能面的一阶结构,然后学习高阶结构。我们还发现,扩散模型的弛豫阶段可以重新用于对构象上的玻尔兹曼分布进行采样并进行结构弛豫。对于结构弛豫,该模型找到的几何结构的能量比经典力场为小有机分子产生的能量低约10倍。与在经典力场弛豫的结构上初始化相比,在扩散产生的结构上初始化密度泛函理论(DFT)弛豫可使DFT弛豫加速超过2倍。