通过从 DFT 进行 Δ-机器学习来计算 CCSD(T)-质量 NMR 化学位移。

Computation of CCSD(T)-Quality NMR Chemical Shifts via Δ-Machine Learning from DFT.

机构信息

Mulliken Center for Theoretical Chemistry, Clausius Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany.

出版信息

J Chem Theory Comput. 2023 Jun 27;19(12):3601-3615. doi: 10.1021/acs.jctc.3c00165. Epub 2023 Jun 1.

DOI:10.1021/acs.jctc.3c00165

PMID:37262324

Abstract

NMR spectroscopy undoubtedly plays a central role in determining molecular structures across different chemical disciplines, and the accurate computational prediction of NMR parameters is highly desirable. In this work, a new Δ-machine learning approach is presented to correct DFT-computed NMR chemical shifts using input features from the calculation and in addition highly accurate reference data at the CCSD(T)/pcSseg-2 level of theory with a basis set extrapolation scheme. The model is trained on a data set containing 1000 optimized and geometrically distorted structures of small organic molecules comprising most elements of the first three periods and containing data for 7090 H and 4230 C NMR chemical shifts. Applied to the PBE0/pcSseg-2 method, the mean absolute deviation (MAD) on the internal NMR shift test set is reduced by 81% for H and 92% for C at virtually no additional computational cost. For 12 different DFT functional and basis set combinations, the MAD of the ML-corrected NMR shifts ranges from 0.021 to 0.039 ppm (H) and from 0.38 to 1.07 ppm (C). Importantly, the new method consistently outperforms the simple and widely used linear regression correction technique. This behavior is reproduced on three different external benchmark sets, confirming the generality and robustness of the correction scheme, which can easily be applied in DFT-based spectral simulations.

摘要

NMR 光谱无疑在确定不同化学领域的分子结构方面起着核心作用，因此非常希望能够准确地计算预测 NMR 参数。在这项工作中，提出了一种新的 Δ-机器学习方法，该方法使用计算输入特征以及理论上 CCSD(T)/pcSseg-2 水平的高精度参考数据（具有基组外推方案）来校正 DFT 计算的 NMR 化学位移。该模型在包含 1000 个优化和几何扭曲的小分子结构的数据集上进行了训练，这些小分子结构包含了前三个周期的大多数元素，并且包含了 7090 个 H 和 4230 个 C NMR 化学位移的数据。对于 PBE0/pcSseg-2 方法，内部 NMR 位移测试集的平均绝对偏差（MAD）在 H 上降低了 81%，在 C 上降低了 92%，而几乎没有增加计算成本。对于 12 种不同的 DFT 函数和基组组合，ML 校正的 NMR 位移的 MAD 在 0.021 到 0.039 ppm（H）之间，在 0.38 到 1.07 ppm（C）之间。重要的是，新方法始终优于简单且广泛使用的线性回归校正技术。这种行为在三个不同的外部基准集上得到了重现，证实了校正方案的通用性和稳健性，该方案可以很容易地应用于基于 DFT 的光谱模拟中。

相似文献

Computation of CCSD(T)-Quality NMR Chemical Shifts via Δ-Machine Learning from DFT.

J Chem Theory Comput. 2023 Jun 27;19(12):3601-3615. doi: 10.1021/acs.jctc.3c00165. Epub 2023 Jun 1.

DFT computational schemes for H and C NMR chemical shifts of natural products, exemplified by strychnine.

Magn Reson Chem. 2020 Jan;58(1):56-64. doi: 10.1002/mrc.4922. Epub 2019 Jul 31.

General Protocol for the Accurate Prediction of Molecular C/H NMR Chemical Shifts via Machine Learning Augmented DFT.

J Chem Inf Model. 2020 Aug 24;60(8):3746-3754. doi: 10.1021/acs.jcim.0c00388. Epub 2020 Jul 20.

Calculation of N and P NMR Chemical Shifts of Azoles, Phospholes, and Phosphazoles: A Gateway to Higher Accuracy at Less Computational Cost.

J Phys Chem A. 2018 Aug 23;122(33):6746-6759. doi: 10.1021/acs.jpca.8b05161. Epub 2018 Aug 9.

Do Double-Hybrid Exchange-Correlation Functionals Provide Accurate Chemical Shifts? A Benchmark Assessment for Proton NMR.

J Chem Theory Comput. 2021 Nov 9;17(11):6876-6885. doi: 10.1021/acs.jctc.1c00604. Epub 2021 Oct 12.

Machine learning-based correction for spin-orbit coupling effects in NMR chemical shift calculations.

Phys Chem Chem Phys. 2024 Feb 7;26(6):4870-4884. doi: 10.1039/d3cp05556f.

On the Efficiency of the Density Functional Theory (DFT)-Based Computational Protocol for H and C Nuclear Magnetic Resonance (NMR) Chemical Shifts of Natural Products: Studying the Accuracy of the pecS- ( = 1, 2) Basis Sets.

Int J Mol Sci. 2023 Sep 27;24(19):14623. doi: 10.3390/ijms241914623.

MIM-ML: A Novel Quantum Chemical Fragment-Based Random Forest Model for Accurate Prediction of NMR Chemical Shifts of Nucleic Acids.

J Chem Theory Comput. 2023 Oct 10;19(19):6632-6642. doi: 10.1021/acs.jctc.3c00563. Epub 2023 Sep 13.

On the accuracy of the GIAO-DFT calculation of 15N NMR chemical shifts of the nitrogen-containing heterocycles--a gateway to better agreement with experiment at lower computational cost.

Magn Reson Chem. 2014 May;52(5):222-30. doi: 10.1002/mrc.4055. Epub 2014 Feb 27.

Towards the versatile DFT and MP2 computational schemes for 31P NMR chemical shifts taking into account relativistic corrections.

Magn Reson Chem. 2014 Nov;52(11):699-710. doi: 10.1002/mrc.4122. Epub 2014 Aug 22.

引用本文的文献

Quantum chemical properties of chlorinated polycyclic aromatic hydrocarbons for delta machine learning.

Sci Data. 2025 Jun 21;12(1):1059. doi: 10.1038/s41597-025-05383-0.

The interplay of density functional selection and crystal structure for accurate NMR chemical shift predictions.

Faraday Discuss. 2025 Jan 8;255(0):119-142. doi: 10.1039/d4fd00072b.

Bent naphthodithiophenes: synthesis and characterization of isomeric fluorophores.

RSC Adv. 2024 Aug 12;14(35):25120-25129. doi: 10.1039/d4ra04850d.

Highly Accurate Prediction of NMR Chemical Shifts from Low-Level Quantum Mechanics Calculations Using Machine Learning.

J Chem Theory Comput. 2024 Mar 12;20(5):2152-2166. doi: 10.1021/acs.jctc.3c01256. Epub 2024 Feb 8.

Frontiers of molecular crystal structure prediction for pharmaceuticals and functional organic materials.

Chem Sci. 2023 Nov 3;14(46):13290-13312. doi: 10.1039/d3sc03903j. eCollection 2023 Nov 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过从 DFT 进行 Δ-机器学习来计算 CCSD(T)-质量 NMR 化学位移。

Computation of CCSD(T)-Quality NMR Chemical Shifts via Δ-Machine Learning from DFT.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献