用于势能面的机器学习：一个广泛的数据库和方法评估

Machine learning for potential energy surfaces: An extensive database and assessment of methods.

作者信息

Schmitz Gunnar, Godtliebsen Ian Heide, Christiansen Ove

机构信息

Department of Chemistry, Aarhus Universitet, DK-8000 Aarhus, Denmark.

出版信息

J Chem Phys. 2019 Jun 28;150(24):244113. doi: 10.1063/1.5100141.

DOI:10.1063/1.5100141

PMID:31255074

Abstract

On the basis of a new extensive database constructed for the purpose, we assess various Machine Learning (ML) algorithms to predict energies in the framework of potential energy surface (PES) construction and discuss black box character, robustness, and efficiency. The database for training ML algorithms in energy predictions based on the molecular structure contains SCF, RI-MP2, RI-MP2-F12, and CCSD(F12)(T) data for around 10.5 × 10 configurations of 15 small molecules. The electronic energies as function of molecular structure are computed from both static and iteratively refined grids in the context of automized PES construction for anharmonic vibrational computations within the n-mode expansion. We explore the performance of a range of algorithms including Gaussian Process Regression (GPR), Kernel Ridge Regression, Support Vector Regression, and Neural Networks (NNs). We also explore methods related to GPR such as sparse Gaussian Process Regression, Gaussian process Markov Chains, and Sparse Gaussian Process Markov Chains. For NNs, we report some explorations of architecture, activation functions, and numerical settings. Different delta-learning strategies are considered, and the use of delta learning targeting CCSD(F12)(T) predictions using, for example, RI-MP2 combined with machine learned CCSD(F12)(T)-RI-MP2 differences is found to be an attractive option.

摘要

基于为此目的构建的一个新的广泛数据库，我们评估了各种机器学习（ML）算法，以在势能面（PES）构建框架中预测能量，并讨论了黑箱特性、稳健性和效率。用于基于分子结构进行能量预测的ML算法训练的数据库包含15个小分子约10.5×10种构型的SCF、RI-MP2、RI-MP2-F12和CCSD(F12)(T)数据。在n模式展开中用于非谐振动计算的自动化PES构建背景下，从静态和迭代细化网格计算作为分子结构函数的电子能量。我们探索了一系列算法的性能，包括高斯过程回归（GPR）、核岭回归、支持向量回归和神经网络（NNs）。我们还探索了与GPR相关的方法，如稀疏高斯过程回归、高斯过程马尔可夫链和稀疏高斯过程马尔可夫链。对于神经网络，我们报告了一些关于架构、激活函数和数值设置的探索。考虑了不同的增量学习策略，发现使用例如RI-MP2结合机器学习的CCSD(F12)(T)-RI-MP2差异来针对CCSD(F12)(T)预测进行增量学习是一个有吸引力的选择。

相似文献

Machine learning for potential energy surfaces: An extensive database and assessment of methods.

J Chem Phys. 2019 Jun 28;150(24):244113. doi: 10.1063/1.5100141.

A Gaussian process regression adaptive density guided approach for potential energy surface construction.

J Chem Phys. 2020 Aug 14;153(6):064105. doi: 10.1063/5.0015344.

Accuracy of Frequencies Obtained with the Aid of Explicitly Correlated Wave Function Based Methods.

J Chem Theory Comput. 2017 Aug 8;13(8):3602-3613. doi: 10.1021/acs.jctc.7b00476. Epub 2017 Jul 24.

Extrapolating MP2 and CCSD explicitly correlated correlation energies to the complete basis set limit with first and second row correlation consistent basis sets.

J Chem Phys. 2009 Nov 21;131(19):194105. doi: 10.1063/1.3265857.

Homogeneous and heterogeneous noncovalent dimers of formaldehyde and thioformaldehyde: structures, energetics, and vibrational frequencies.

J Phys Chem A. 2014 May 8;118(18):3376-85. doi: 10.1021/jp502588h. Epub 2014 Apr 28.

Characterization of the potential energy surfaces of two small but challenging noncovalent dimers: (P2 )2 and (PCCP)2.

J Comput Chem. 2014 Mar 5;35(6):479-87. doi: 10.1002/jcc.23522. Epub 2014 Jan 9.

Machine Learning Models of Vibrating HCO: Comparing Reproducing Kernels, FCHL, and PhysNet.

J Phys Chem A. 2020 Oct 22;124(42):8853-8865. doi: 10.1021/acs.jpca.0c05979. Epub 2020 Oct 13.

Anharmonic zero point vibrational energies: tipping the scales in accurate thermochemistry calculations?

J Chem Phys. 2013 Jan 28;138(4):044311. doi: 10.1063/1.4777568.

Effects of Heterogeneity in Small π-Type Dimers: Homogeneous and Mixed Dimers of Diacetylene and Cyanogen.

J Chem Theory Comput. 2012 Nov 13;8(11):4279-84. doi: 10.1021/ct300644a. Epub 2012 Sep 6.

Big Changes for Small Noncovalent Dimers: Revisiting the Potential Energy Surfaces of (P2)2 and (PCCP)2 with CCSD(T) Optimizations and Vibrational Frequencies.

J Chem Theory Comput. 2016 Apr 12;12(4):1534-41. doi: 10.1021/acs.jctc.5b01105. Epub 2016 Mar 30.

引用本文的文献

From Organic Fragments to Photoswitchable Catalysts: The OFF-ON Structural Repository for Transferable Kernel-Based Potentials.

J Chem Inf Model. 2024 Feb 26;64(4):1201-1212. doi: 10.1021/acs.jcim.3c01953. Epub 2024 Feb 6.

Artificial Neural Network-Derived Unified Six-Dimensional Potential Energy Surface for Tetra Atomic Isomers of the Biogenic [H, C, N, O] System.

J Chem Theory Comput. 2023 Feb 28;19(4):1186-1196. doi: 10.1021/acs.jctc.2c00915. Epub 2023 Feb 3.

OntoPESScan: An Ontology for Potential Energy Surface Scans.

ACS Omega. 2023 Jan 3;8(2):2462-2475. doi: 10.1021/acsomega.2c06948. eCollection 2023 Jan 17.

Molecular Conformer Search with Low-Energy Latent Space.

J Chem Theory Comput. 2022 Jul 12;18(7):4574-4585. doi: 10.1021/acs.jctc.2c00290. Epub 2022 Jun 13.

Choosing the right molecular machine learning potential.

Chem Sci. 2021 Sep 15;12(43):14396-14413. doi: 10.1039/d1sc03564a. eCollection 2021 Nov 10.

Theoretical studies on triplet-state driven dissociation of formaldehyde by quasi-classical molecular dynamics simulation on machine-learning potential energy surface.

J Chem Phys. 2021 Dec 7;155(21):214105. doi: 10.1063/5.0067176.

Gaussian Process Regression for Materials and Molecules.

Chem Rev. 2021 Aug 25;121(16):10073-10141. doi: 10.1021/acs.chemrev.1c00022. Epub 2021 Aug 16.

Efficient Amino Acid Conformer Search with Bayesian Optimization.

J Chem Theory Comput. 2021 Mar 9;17(3):1955-1966. doi: 10.1021/acs.jctc.0c00648. Epub 2021 Feb 12.

On the synergy of matrix-isolation infrared spectroscopy and vibrational configuration interaction computations.

Theor Chem Acc. 2020;139(12):174. doi: 10.1007/s00214-020-02682-0. Epub 2020 Nov 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于势能面的机器学习：一个广泛的数据库和方法评估

Machine learning for potential energy surfaces: An extensive database and assessment of methods.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献