用于分子激发能的多保真机器学习

Multifidelity Machine Learning for Molecular Excitation Energies.

作者信息

Vinod Vivin, Maity Sayan, Zaspel Peter, Kleinekathöfer Ulrich

机构信息

School of Mathematics and Natural Science, University of Wuppertal, Wuppertal 42119, Germany.

School of Computer Science and Engineering, Constructor University, Campus Ring 1, Bremen 28759, Germany.

出版信息

J Chem Theory Comput. 2023 Nov 14;19(21):7658-7670. doi: 10.1021/acs.jctc.3c00882. Epub 2023 Oct 20.

DOI:10.1021/acs.jctc.3c00882

PMID:37862054

Abstract

The accurate but fast calculation of molecular excited states is still a very challenging topic. For many applications, detailed knowledge of the energy funnel in larger molecular aggregates is of key importance, requiring highly accurate excitation energies. To this end, machine learning techniques can be a very useful tool, though the cost of generating highly accurate training data sets still remains a severe challenge. To overcome this hurdle, this work proposes the use of multifidelity machine learning where very little training data from high accuracies is combined with cheaper and less accurate data to achieve the accuracy of the costlier level. In the present study, the approach is employed to predict vertical excitation energies to the first excited state for three molecules of increasing size, namely, benzene, naphthalene, and anthracene. The energies are trained and tested for conformations stemming from classical molecular dynamics and density functional based tight-binding simulations. It can be shown that the multifidelity machine learning model can achieve the same accuracy as a machine learning model built only on high-cost training data while expending a much lower computational effort to generate the data. The numerical gain observed in these benchmark test calculations was over a factor of 30 but certainly can be much higher for high-accuracy data.

摘要

准确而快速地计算分子激发态仍然是一个极具挑战性的课题。对于许多应用而言，深入了解更大分子聚集体中的能量漏斗至关重要，这需要高精度的激发能。为此，机器学习技术可能是一个非常有用的工具，不过生成高精度训练数据集的成本仍然是一个严峻的挑战。为克服这一障碍，本工作提出使用多保真度机器学习，即将来自高精度的极少训练数据与成本较低且精度较低的数据相结合，以达到更高成本水平的精度。在本研究中，该方法被用于预测三种尺寸不断增大的分子（即苯、萘和蒽）到第一激发态的垂直激发能。对源自经典分子动力学和基于密度泛函的紧束缚模拟的构象的能量进行了训练和测试。结果表明，多保真度机器学习模型能够达到仅基于高成本训练数据构建的机器学习模型相同的精度，同时在生成数据时耗费的计算量要低得多。在这些基准测试计算中观察到的数值增益超过30倍，但对于高精度数据肯定可以更高。

相似文献

Multifidelity Machine Learning for Molecular Excitation Energies.

J Chem Theory Comput. 2023 Nov 14;19(21):7658-7670. doi: 10.1021/acs.jctc.3c00882. Epub 2023 Oct 20.

Quasi-Classical Trajectory Calculation of Rate Constants Using an Ab Initio Trained Machine Learning Model (aML-MD) with Multifidelity Data.

J Phys Chem A. 2024 May 2;128(17):3449-3457. doi: 10.1021/acs.jpca.4c00750. Epub 2024 Apr 20.

Multifidelity Statistical Machine Learning for Molecular Crystal Structure Prediction.

J Phys Chem A. 2020 Oct 1;124(39):8065-8078. doi: 10.1021/acs.jpca.0c05006. Epub 2020 Sep 17.

Construction of Highly Accurate Machine Learning Potential Energy Surfaces for Excited-State Dynamics Simulations Based on Low-Level Data Sets.

J Phys Chem A. 2024 Jul 18;128(28):5516-5524. doi: 10.1021/acs.jpca.4c02028. Epub 2024 Jul 2.

Multifidelity Information Fusion with Machine Learning: A Case Study of Dopant Formation Energies in Hafnia.

ACS Appl Mater Interfaces. 2019 Jul 17;11(28):24906-24918. doi: 10.1021/acsami.9b02174. Epub 2019 Apr 16.

Fast Near Potential Energy Surfaces Using Machine Learning.

J Phys Chem A. 2022 Jun 30;126(25):4013-4024. doi: 10.1021/acs.jpca.2c02243. Epub 2022 Jun 17.

Coupled-cluster calculations of the lowest 0-0 bands of the electronic excitation spectrum of naphthalene.

Phys Chem Chem Phys. 2014 Jun 7;16(21):9859-65. doi: 10.1039/c3cp54421d. Epub 2014 Jan 9.

A Look Inside the Black Box of Machine Learning Photodynamics Simulations.

Acc Chem Res. 2022 Jul 19;55(14):1972-1984. doi: 10.1021/acs.accounts.2c00288. Epub 2022 Jul 7.

MF-PCBA: Multifidelity High-Throughput Screening Benchmarks for Drug Discovery and Machine Learning.

J Chem Inf Model. 2023 May 8;63(9):2667-2678. doi: 10.1021/acs.jcim.2c01569. Epub 2023 Apr 14.

Comparison of multifidelity machine learning models for potential energy surfaces.

J Chem Phys. 2023 Jul 28;159(4). doi: 10.1063/5.0158919.

引用本文的文献

Quantum chemical properties of chlorinated polycyclic aromatic hydrocarbons for delta machine learning.

Sci Data. 2025 Jun 21;12(1):1059. doi: 10.1038/s41597-025-05383-0.

Predicting Molecular Energies of Small Organic Molecules With Multi-Fidelity Methods.

J Comput Chem. 2025 Mar 5;46(6):e70056. doi: 10.1002/jcc.70056.

QeMFi: A Multifidelity Dataset of Quantum Chemical Properties of Diverse Molecules.

Sci Data. 2025 Feb 3;12(1):202. doi: 10.1038/s41597-024-04247-3.

Protein Effects on the Excitation Energies and Exciton Dynamics of the CP24 Antenna Complex.

J Phys Chem B. 2024 May 30;128(21):5201-5217. doi: 10.1021/acs.jpcb.4c01637. Epub 2024 May 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于分子激发能的多保真机器学习

Multifidelity Machine Learning for Molecular Excitation Energies.

作者信息

Vinod Vivin, Maity Sayan, Zaspel Peter, Kleinekathöfer Ulrich

机构信息

School of Mathematics and Natural Science, University of Wuppertal, Wuppertal 42119, Germany.

School of Computer Science and Engineering, Constructor University, Campus Ring 1, Bremen 28759, Germany.

出版信息

J Chem Theory Comput. 2023 Nov 14;19(21):7658-7670. doi: 10.1021/acs.jctc.3c00882. Epub 2023 Oct 20.

DOI:10.1021/acs.jctc.3c00882

PMID:37862054

Abstract

摘要

用于分子激发能的多保真机器学习

Multifidelity Machine Learning for Molecular Excitation Energies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于分子激发能的多保真机器学习

Multifidelity Machine Learning for Molecular Excitation Energies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献