通过提高数据效率对最高占据分子轨道-最低未占据分子轨道能隙进行的选定机器学习。

Selected machine learning of HOMO-LUMO gaps with improved data-efficiency.

作者信息

Mazouin Bernard, Schöpfer Alexandre Alain, von Lilienfeld O Anatole

机构信息

University of Vienna, Faculty of Physics and Vienna Doctoral School in Physics Kolingasse 14-16 1090 Vienna Austria.

Department of Chemistry, University of Basel Klingelbergstrasse 70 4056 Basel Switzerland

出版信息

Mater Adv. 2022 Sep 20;3(22):8306-8316. doi: 10.1039/d2ma00742h. eCollection 2022 Nov 14.

DOI:10.1039/d2ma00742h

PMID:36561279

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9662596/

Abstract

Despite their relevance for organic electronics, quantum machine learning (QML) models of molecular electronic properties, such as HOMO-LUMO-gaps, often struggle to achieve satisfying data-efficiency as measured by decreasing prediction errors for increasing training set sizes. We demonstrate that partitioning training sets into different chemical classes prior to training results in independently trained QML models with overall reduced training data needs. For organic molecules drawn from previously published QM7 and QM9-data-sets we have identified and exploited three relevant classes corresponding to compounds containing either aromatic rings and carbonyl groups, or single unsaturated bonds, or saturated bonds The selected QML models of band-gaps (considered at GW and hybrid DFT levels of theory) reach mean absolute prediction errors of ∼0.1 eV for up to an order of magnitude fewer training molecules than for QML models trained on randomly selected molecules. Comparison to Δ-QML models of band-gaps indicates that selected QML exhibit superior data-efficiency. Our findings suggest that selected QML, based on simple classifications prior to training, could help to successfully tackle challenging quantum property screening tasks of large libraries with high fidelity and low computational burden.

摘要

尽管分子电子性质的量子机器学习（QML）模型（如最高占据分子轨道-最低未占据分子轨道能隙）与有机电子学相关，但这些模型往往难以实现令人满意的数据效率，这可通过随着训练集规模增加预测误差减小来衡量。我们证明，在训练前将训练集划分为不同化学类别，会得到独立训练的QML模型，且总体训练数据需求降低。对于从先前发表的QM7和QM9数据集提取的有机分子，我们识别并利用了三个相关类别，分别对应含有芳环和羰基、或单不饱和键、或饱和键的化合物。所选的带隙QML模型（在GW和杂化密度泛函理论水平下考虑）对于训练分子数量比在随机选择分子上训练的QML模型少一个数量级的情况，达到了约0.1 eV的平均绝对预测误差。与带隙的Δ-QML模型比较表明，所选的QML表现出卓越的数据效率。我们的研究结果表明，基于训练前简单分类的所选QML，有助于成功应对大型库中具有挑战性的量子性质筛选任务，且具有高保真度和低计算负担。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9fa5/9662596/302ad829db62/d2ma00742h-f3.jpg

相似文献

Selected machine learning of HOMO-LUMO gaps with improved data-efficiency.

Mater Adv. 2022 Sep 20;3(22):8306-8316. doi: 10.1039/d2ma00742h. eCollection 2022 Nov 14.

Alchemical and structural distribution based representation for universal quantum machine learning.

J Chem Phys. 2018 Jun 28;148(24):241717. doi: 10.1063/1.5020710.

An orbital-based representation for accurate quantum machine learning.

J Chem Phys. 2022 Mar 21;156(11):114101. doi: 10.1063/5.0083301.

Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error.

J Chem Theory Comput. 2017 Nov 14;13(11):5255-5264. doi: 10.1021/acs.jctc.7b00577. Epub 2017 Oct 10.

Quantum Machine Learning in Materials Prediction: A Case Study on ABO Perovskite Structures.

J Phys Chem Lett. 2023 Aug 10;14(31):6940-6947. doi: 10.1021/acs.jpclett.3c01703. Epub 2023 Jul 27.

Quantum machine learning with differential privacy.

Sci Rep. 2023 Feb 11;13(1):2453. doi: 10.1038/s41598-022-24082-z.

Accurate GW frontier orbital energies of 134 kilo molecules.

Sci Data. 2023 Sep 5;10(1):581. doi: 10.1038/s41597-023-02486-4.

Toward DMC Accuracy Across Chemical Space with Scalable Δ-QML.

J Chem Theory Comput. 2023 Mar 28;19(6):1711-1721. doi: 10.1021/acs.jctc.2c01058. Epub 2023 Mar 1.

Kernel based quantum machine learning at record rate: Many-body distribution functionals as compact representations.

J Chem Phys. 2023 Jul 21;159(3). doi: 10.1063/5.0152215.

Transferable Multilevel Attention Neural Network for Accurate Prediction of Quantum Chemistry Properties via Multitask Learning.

J Chem Inf Model. 2021 Mar 22;61(3):1066-1082. doi: 10.1021/acs.jcim.0c01224. Epub 2021 Feb 25.

引用本文的文献

Graph Convolutional Neural Network-Enabled Frontier Molecular Orbital Prediction: A Case Study with Neurotransmitters and Antidepressants.

J Chem Inf Model. 2025 Jul 28;65(14):7447-7462. doi: 10.1021/acs.jcim.5c00724. Epub 2025 Jul 17.

Two-dimensional coronene fractals: modified reverse degree indices, comparative analysis of information entropy and predictive modeling of spectral properties.

Front Chem. 2025 May 6;13:1588942. doi: 10.3389/fchem.2025.1588942. eCollection 2025.

Effect of Molecular Structure on the B3LYP-Computed HOMO-LUMO Gap: A Structure -Property Relationship Using Atomic Signatures.

ACS Omega. 2025 Jan 15;10(3):2799-2808. doi: 10.1021/acsomega.4c08626. eCollection 2025 Jan 28.

3DReact: Geometric Deep Learning for Chemical Reactions.

J Chem Inf Model. 2024 Aug 12;64(15):5771-5785. doi: 10.1021/acs.jcim.4c00104. Epub 2024 Jul 15.

Electronic Excited States from Physically Constrained Machine Learning.

ACS Cent Sci. 2024 Feb 29;10(3):637-648. doi: 10.1021/acscentsci.3c01480. eCollection 2024 Mar 27.

Fast and accurate excited states predictions: machine learning and diabatization.

Phys Chem Chem Phys. 2024 Jan 31;26(5):4306-4319. doi: 10.1039/d3cp05685f.

SPAM(a,b): Encoding the Density Information from Guess Hamiltonian in Quantum Machine Learning Representations.

J Chem Theory Comput. 2024 Feb 13;20(3):1108-1117. doi: 10.1021/acs.jctc.3c01040. Epub 2024 Jan 16.

Deep learning workflow for the inverse design of molecules with specific optoelectronic properties.

Sci Rep. 2023 Nov 16;13(1):20031. doi: 10.1038/s41598-023-45385-9.

本文引用的文献

Exploring chemical compound space with quantum-based machine learning.

Nat Rev Chem. 2020 Jul;4(7):347-358. doi: 10.1038/s41570-020-0189-9. Epub 2020 Jun 12.

Equivariant representations for molecular Hamiltonians and N-center atomic-scale properties.

J Chem Phys. 2022 Jan 7;156(1):014115. doi: 10.1063/5.0072784.

Ab Initio Machine Learning in Chemical Compound Space.

Chem Rev. 2021 Aug 25;121(16):10001-10036. doi: 10.1021/acs.chemrev.0c01303. Epub 2021 Aug 13.

Machine learning of free energies in chemical compound space using ensemble representations: Reaching experimental uncertainty for solvation.

J Chem Phys. 2021 Apr 7;154(13):134113. doi: 10.1063/5.0041548.

Transferable Multilevel Attention Neural Network for Accurate Prediction of Quantum Chemistry Properties via Multitask Learning.

J Chem Inf Model. 2021 Mar 22;61(3):1066-1082. doi: 10.1021/acs.jcim.0c01224. Epub 2021 Feb 25.

Retrospective on a decade of machine learning for chemical discovery.

Nat Commun. 2020 Sep 29;11(1):4895. doi: 10.1038/s41467-020-18556-9.

Quantum machine learning using atom-in-molecule-based fragments selected on the fly.

Nat Chem. 2020 Oct;12(10):945-951. doi: 10.1038/s41557-020-0527-z. Epub 2020 Sep 14.

Combining SchNet and SHARC: The SchNarc Machine Learning Approach for Excited-State Dynamics.

J Phys Chem Lett. 2020 May 21;11(10):3828-3834. doi: 10.1021/acs.jpclett.0c00527. Epub 2020 May 1.

FCHL revisited: Faster and more accurate quantum machine learning.

J Chem Phys. 2020 Jan 31;152(4):044107. doi: 10.1063/1.5126701.

Unifying machine learning and quantum chemistry with a deep neural network for molecular wavefunctions.

Nat Commun. 2019 Nov 15;10(1):5024. doi: 10.1038/s41467-019-12875-2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过提高数据效率对最高占据分子轨道-最低未占据分子轨道能隙进行的选定机器学习。

Selected machine learning of HOMO-LUMO gaps with improved data-efficiency.

作者信息

Mazouin Bernard, Schöpfer Alexandre Alain, von Lilienfeld O Anatole

机构信息

University of Vienna, Faculty of Physics and Vienna Doctoral School in Physics Kolingasse 14-16 1090 Vienna Austria.

Department of Chemistry, University of Basel Klingelbergstrasse 70 4056 Basel Switzerland

出版信息

Mater Adv. 2022 Sep 20;3(22):8306-8316. doi: 10.1039/d2ma00742h. eCollection 2022 Nov 14.

DOI:10.1039/d2ma00742h

PMID:36561279

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9662596/

Abstract

摘要

通过提高数据效率对最高占据分子轨道-最低未占据分子轨道能隙进行的选定机器学习。

Selected machine learning of HOMO-LUMO gaps with improved data-efficiency.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

通过提高数据效率对最高占据分子轨道-最低未占据分子轨道能隙进行的选定机器学习。

Selected machine learning of HOMO-LUMO gaps with improved data-efficiency.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献