Jin Yezhi, Perez-Lemus Gustavo R, Zubieta Rico Pablo F, de Pablo Juan J
Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637-1476, United States.
J Phys Chem A. 2024 Aug 29;128(34):7257-7268. doi: 10.1021/acs.jpca.4c01546. Epub 2024 Aug 16.
Machine learned force fields offer the potential for faster execution times while retaining the accuracy of traditional DFT calculations, making them promising candidates for molecular simulations in cases where reliable classical force fields are not available. Some of the challenges associated with machine learned force fields include simulation stability over extended periods of time and ensuring that the statistical and dynamical properties of the underlying simulated systems are correctly captured. In this work, we propose a systematic training pipeline for such force fields that leads to improved model quality, compared to that achieved by traditional data generation and training approaches. That pipeline relies on the use of enhanced sampling techniques, and it is demonstrated here in the context of a liquid crystal, which exemplifies many of the challenges that are encountered in fluids and materials with complex free energy landscapes. Our results indicate that, whereas the majority of traditional machine learned force field training approaches lead to molecular dynamics simulations that are only stable over hundred-picosecond trajectories, our approach allows for stable simulations over tens of nanoseconds for organic molecular systems comprising thousands of atoms.
机器学习力场在保持传统密度泛函理论(DFT)计算精度的同时,具有更快的执行速度,这使其在没有可靠经典力场的情况下成为分子模拟的有前景的候选方法。与机器学习力场相关的一些挑战包括长时间模拟的稳定性,以及确保正确捕捉基础模拟系统的统计和动力学性质。在这项工作中,我们为这类力场提出了一种系统的训练流程,与传统数据生成和训练方法相比,该流程能提高模型质量。该流程依赖于增强采样技术,本文在液晶的背景下进行了演示,液晶体现了具有复杂自由能景观的流体和材料中遇到的许多挑战。我们的结果表明,大多数传统机器学习力场训练方法导致的分子动力学模拟仅在几百皮秒的轨迹上稳定,而我们的方法允许对包含数千个原子的有机分子系统进行长达数十纳秒的稳定模拟。