Isamura Bienfait K, Popelier Paul L A
Department of Chemistry, The University of Manchester, Manchester, M13 9PL, UK.
Phys Chem Chem Phys. 2024 Sep 18;26(36):23677-23691. doi: 10.1039/d4cp01862a.
The polarisable machine-learned force field FFLUX requires pre-trained anisotropic Gaussian process regression (GPR) models of atomic energies and multipole moments to propagate unbiased molecular dynamics simulations. The outcome of FFLUX simulations is highly dependent on the predictive accuracy of the underlying models whose training entails determining the optimal set of model hyperparameters. Unfortunately, traditional direct learning (DL) procedures do not scale well on this task, especially when the hyperparameter search is initiated from a (set of) random guess solution(s). Additionally, the complexity of the hyperparameter space (HS) increases with the number of geometrical input features, at least for anisotropic kernels, making the optimization of hyperparameters even more challenging. In this study, we propose a transfer learning (TL) protocol that accelerates the training process of anisotropic GPR models by facilitating access to promising regions of the HS. The protocol is based on a seeding-relaxation mechanism in which an excellent guess solution is identified by rapidly building one or several small source models over a subset of the target training set before readjusting the previous guess over the entire set. We demonstrate the performance of this protocol by building and assessing the performance of DL and TL models of atomic energies and charges in various conformations of benzene, ethanol, formic acid dimer and the drug fomepizole. Our experiments suggest that TL models can be built one order of magnitude faster while preserving the quality of their DL analogs. Most importantly, when deployed in FFLUX simulations, TL models compete with or even outperform their DL analogs when it comes to performing FFLUX geometry optimization and computing harmonic vibrational modes.
可极化的机器学习力场FFLUX需要预先训练的原子能量和多极矩的各向异性高斯过程回归(GPR)模型,以进行无偏分子动力学模拟。FFLUX模拟的结果高度依赖于基础模型的预测准确性,而这些模型的训练需要确定模型超参数的最佳集合。不幸的是,传统的直接学习(DL)程序在这项任务上扩展性不佳,特别是当超参数搜索从一个(组)随机猜测解开始时。此外,超参数空间(HS)的复杂性会随着几何输入特征数量的增加而增加,至少对于各向异性核来说是这样,这使得超参数的优化更具挑战性。在本研究中,我们提出了一种迁移学习(TL)协议,通过便于访问HS中有希望的区域来加速各向异性GPR模型的训练过程。该协议基于一种播种-松弛机制,其中通过在目标训练集的一个子集上快速构建一个或几个小的源模型来识别一个出色的猜测解,然后在整个集合上重新调整先前的猜测。我们通过构建和评估苯、乙醇、甲酸二聚体和药物甲吡唑各种构象下的原子能量和电荷的DL和TL模型的性能来证明该协议的性能。我们的实验表明,TL模型可以快一个数量级构建,同时保持其DL类似物的质量。最重要的是,当部署在FFLUX模拟中时,在执行FFLUX几何优化和计算谐波振动模式方面,TL模型与它们的DL类似物竞争甚至表现更优。