通过物理约束数据增强改进反应性机器学习势能的键解离

Improving Bond Dissociations of Reactive Machine Learning Potentials through Physics-Constrained Data Augmentation.

作者信息

F Dos Santos Luan G, Nebgen Benjamin T, Allen Alice E A, Hamilton Brenden W, Matin Sakib, Smith Justin S, Messerly Richard A

机构信息

Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409, United States.

Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States.

出版信息

J Chem Inf Model. 2025 Feb 10;65(3):1198-1210. doi: 10.1021/acs.jcim.4c01847. Epub 2025 Jan 28.

DOI:10.1021/acs.jcim.4c01847

PMID:39874212

Abstract

In the field of computational chemistry, predicting bond dissociation energies (BDEs) presents well-known challenges, particularly due to the multireference character of reactive systems. Many chemical reactions involve configurations where single-reference methods fall short, as the electronic structure can significantly change during bond breaking. As generating training data for partially broken bonds is a challenging task, even state-of-the-art reactive machine learning interatomic potentials (MLIPs) often fail to predict reliable BDEs and smooth dissociation curves. By contrast, simple and inexpensive physics-based models, such as the well-established Morse potential, do not suffer from any such limitations. This work leverages the Morse potential to improve reactive MLIPs by augmenting the training data set with inexpensive Morse data along the dissociation pathways. This physics-constrained data augmentation (PCDA) approach results in MLIPs with smooth bond dissociation curves as well as near coupled-cluster level BDEs, all without requiring any expensive multireference quantum mechanical calculations. A case study for methane combustion demonstrates how the PCDA approach can improve an existing reactive MLIP, namely, ANI-1xnr. Not only are the BDEs and bond dissociation curves for all radicals and molecules significantly improved compared to ANI-1xnr but the PCDA-trained MLIP retains the reliability of ANI-1xnr when performing reactive molecular dynamics simulations.

摘要

在计算化学领域，预测键解离能（BDEs）存在诸多众所周知的挑战，尤其是由于反应体系具有多参考特征。许多化学反应涉及单参考方法失效的构型，因为在键断裂过程中电子结构会发生显著变化。由于生成部分断裂键的训练数据是一项具有挑战性的任务，即使是最先进的反应性机器学习原子间势（MLIPs）也常常无法预测可靠的BDEs和平滑的解离曲线。相比之下，简单且成本低廉的基于物理的模型，如成熟的莫尔斯势，不存在此类限制。这项工作利用莫尔斯势，通过沿解离路径用低成本的莫尔斯数据扩充训练数据集来改进反应性MLIPs。这种物理约束数据增强（PCDA）方法能得到具有平滑键解离曲线以及接近耦合簇水平BDEs的MLIPs，且无需任何昂贵的多参考量子力学计算。甲烷燃烧的案例研究展示了PCDA方法如何改进现有的反应性MLIP，即ANI - 1xnr。与ANI - 1xnr相比，所有自由基和分子的BDEs及键解离曲线都得到了显著改善，而且经PCDA训练的MLIP在进行反应性分子动力学模拟时保留了ANI - 1xnr的可靠性。

相似文献

Improving Bond Dissociations of Reactive Machine Learning Potentials through Physics-Constrained Data Augmentation.通过物理约束数据增强改进反应性机器学习势能的键解离

J Chem Inf Model. 2025 Feb 10;65(3):1198-1210. doi: 10.1021/acs.jcim.4c01847. Epub 2025 Jan 28.

Exploring the frontiers of condensed-phase chemistry with a general reactive machine learning potential.利用通用反应性机器学习势能探索凝聚相化学的前沿领域。

Nat Chem. 2024 May;16(5):727-734. doi: 10.1038/s41557-023-01427-3. Epub 2024 Mar 7.

Including Physics-Informed Atomization Constraints in Neural Networks for Reactive Chemistry.在用于反应化学的神经网络中纳入物理知识雾化约束条件。

J Chem Inf Model. 2025 May 12;65(9):4367-4380. doi: 10.1021/acs.jcim.5c00341. Epub 2025 Apr 29.

Development of Multimodal Machine Learning Potentials: Toward a Physics-Aware Artificial Intelligence.多模态机器学习潜力的发展：迈向具有物理意识的人工智能。

Acc Chem Res. 2021 Apr 6;54(7):1575-1585. doi: 10.1021/acs.accounts.0c00868. Epub 2021 Mar 13.

ArcaNN: automated enhanced sampling generation of training sets for chemically reactive machine learning interatomic potentials.ArcaNN：用于化学反应性机器学习原子间势的训练集自动增强采样生成

Digit Discov. 2024 Oct 30;4(1):54-72. doi: 10.1039/d4dd00209a. eCollection 2025 Jan 15.

Rapid quantum mechanical models for the computational estimation of C-H bond dissociation energies as a measure of metabolic stability.用于计算C-H键解离能以衡量代谢稳定性的快速量子力学模型。

Mol Pharm. 2004 Mar-Apr;1(2):128-35. doi: 10.1021/mp049977r.

A big data approach to the ultra-fast prediction of DFT-calculated bond energies.一种大数据方法，可实现对 DFT 计算键能的超快速预测。

J Cheminform. 2013 Jul 12;5:34. doi: 10.1186/1758-2946-5-34. eCollection 2013.

Efficient exploration of reaction pathways using reaction databases and active learning.利用反应数据库和主动学习高效探索反应途径。

J Chem Phys. 2025 Mar 21;162(11). doi: 10.1063/5.0235715.

Data-Efficient Multifidelity Training for High-Fidelity Machine Learning Interatomic Potentials.用于高保真机器学习原子间势的数据高效多保真训练

J Am Chem Soc. 2025 Jan 8;147(1):1042-1054. doi: 10.1021/jacs.4c14455. Epub 2024 Dec 17.

Atomistic modeling of the mechanical properties: the rise of machine learning interatomic potentials.原子级建模的力学性能：机器学习原子间势的兴起。

Mater Horiz. 2023 Jun 6;10(6):1956-1968. doi: 10.1039/d3mh00125c.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过物理约束数据增强改进反应性机器学习势能的键解离

Improving Bond Dissociations of Reactive Machine Learning Potentials through Physics-Constrained Data Augmentation.

作者信息

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献