利用机器学习从低水平量子力学计算中高精度预测核磁共振化学位移

Highly Accurate Prediction of NMR Chemical Shifts from Low-Level Quantum Mechanics Calculations Using Machine Learning.

作者信息

Li Jie, Liang Jiashu, Wang Zhe, Ptaszek Aleksandra L, Liu Xiao, Ganoe Brad, Head-Gordon Martin, Head-Gordon Teresa

机构信息

Pitzer Center for Theoretical Chemistry, Department of Chemistry, University of California, Berkeley, California 94720, United States.

Christian Doppler Laboratory for High-Content Structural Biology and Biotechnology, Department of Structural and Computational Biology, Max Perutz Laboratories, University of Vienna, Campus Vienna Biocenter 5, Vienna 1030, Austria.

出版信息

J Chem Theory Comput. 2024 Mar 12;20(5):2152-2166. doi: 10.1021/acs.jctc.3c01256. Epub 2024 Feb 8.

DOI:10.1021/acs.jctc.3c01256

PMID:38331423

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11702896/

Abstract

Theoretical predictions of NMR chemical shifts from first-principles can greatly facilitate experimental interpretation and structure identification of molecules in gas, solution, and solid-state phases. However, accurate prediction of chemical shifts using the gold-standard coupled cluster with singles, doubles, and perturbative triple excitations [CCSD(T)] method with a complete basis set (CBS) can be prohibitively expensive. By contrast, machine learning (ML) methods offer inexpensive alternatives for chemical shift predictions but are hampered by generalization to molecules outside the original training set. Here, we propose several new ideas in ML of the chemical shift prediction for H, C, N, and O that first introduce a novel feature representation, based on the atomic chemical shielding tensors within a molecular environment using an inexpensive quantum mechanics (QM) method, and train it to predict NMR chemical shieldings of a high-level composite theory that approaches the accuracy of CCSD(T)/CBS. In addition, we train the ML model through a new progressive active learning workflow that reduces the total number of expensive high-level composite calculations required while allowing the model to continuously improve on unseen data. Furthermore, the algorithm provides an error estimation, signaling potential unreliability in predictions if the error is large. Finally, we introduce a novel approach to keep the rotational invariance of the features using tensor environment vectors (TEVs) that yields a ML model with the highest accuracy compared to a similar model using data augmentation. We illustrate the predictive capacity of the resulting inexpensive shift machine learning (iShiftML) models across several benchmarks, including unseen molecules in the NS372 data set, gas-phase experimental chemical shifts for small organic molecules, and much larger and more complex natural products in which we can accurately differentiate between subtle diastereomers based on chemical shift assignments.

摘要

从第一原理对核磁共振（NMR）化学位移进行理论预测，能够极大地促进对气相、溶液相和固态相中分子的实验解释及结构鉴定。然而，使用具有完备基组（CBS）的金标准耦合簇单双激发及微扰三激发[CCSD(T)]方法精确预测化学位移的成本可能高得令人望而却步。相比之下，机器学习（ML）方法为化学位移预测提供了低成本的替代方案，但在推广到原始训练集之外的分子时受到限制。在此，我们针对H、C、N和O的化学位移预测在机器学习方面提出了几个新想法，首先引入一种新颖的特征表示，它基于使用低成本量子力学（QM）方法在分子环境中的原子化学屏蔽张量，并对其进行训练以预测接近CCSD(T)/CBS精度的高级复合理论的NMR化学屏蔽。此外，我们通过一种新的渐进式主动学习工作流程来训练ML模型，该流程减少了所需的昂贵高级复合计算的总数，同时允许模型在未见数据上不断改进。此外，该算法提供误差估计，如果误差较大则表明预测可能不可靠。最后，我们引入一种使用张量环境向量（TEV）来保持特征旋转不变性的新颖方法，与使用数据增强的类似模型相比，该方法产生的ML模型具有最高的精度。我们通过几个基准测试展示了所得低成本位移机器学习（iShiftML）模型的预测能力，包括NS372数据集中未见的分子、小有机分子的气相实验化学位移，以及更大且更复杂的天然产物，在这些天然产物中我们可以根据化学位移归属准确区分细微的非对映异构体。

相似文献

Highly Accurate Prediction of NMR Chemical Shifts from Low-Level Quantum Mechanics Calculations Using Machine Learning.

J Chem Theory Comput. 2024 Mar 12;20(5):2152-2166. doi: 10.1021/acs.jctc.3c01256. Epub 2024 Feb 8.

Why benchmark-quality computations are needed to reproduce 1-adamantyl cation NMR chemical shifts accurately.

J Phys Chem A. 2011 Mar 24;115(11):2340-4. doi: 10.1021/jp1103356. Epub 2011 Mar 1.

Electron correlation and vibrational effects in predictions of paramagnetic NMR shifts.

Phys Chem Chem Phys. 2022 Jun 29;24(25):15230-15244. doi: 10.1039/d2cp01206e.

Predicting Density Functional Theory-Quality Nuclear Magnetic Resonance Chemical Shifts via Δ-Machine Learning.

J Chem Theory Comput. 2021 Feb 9;17(2):826-840. doi: 10.1021/acs.jctc.0c00979. Epub 2021 Jan 11.

Protein NMR chemical shift calculations based on the automated fragmentation QM/MM approach.

J Phys Chem B. 2009 Jul 30;113(30):10380-8. doi: 10.1021/jp901992p.

MIM-ML: A Novel Quantum Chemical Fragment-Based Random Forest Model for Accurate Prediction of NMR Chemical Shifts of Nucleic Acids.

J Chem Theory Comput. 2023 Oct 10;19(19):6632-6642. doi: 10.1021/acs.jctc.3c00563. Epub 2023 Sep 13.

Automated Fragmentation Quantum Mechanical Calculation of N and C Chemical Shifts in a Membrane Protein.

J Chem Theory Comput. 2023 Oct 24;19(20):7405-7422. doi: 10.1021/acs.jctc.3c00621. Epub 2023 Oct 3.

Accurate prediction of nuclear magnetic resonance shielding constants: towards the accuracy of CCSD(T) complete basis set limit.

J Chem Phys. 2013 Mar 28;138(12):124113. doi: 10.1063/1.4796485.

Electron Correlation or Basis Set Quality: How to Obtain Converged and Accurate NMR Shieldings for the Third-Row Elements?

Molecules. 2022 Nov 25;27(23):8230. doi: 10.3390/molecules27238230.

IMPRESSION generation 2 - accurate, fast and generalised neural network model for predicting NMR parameters in place of DFT.

Chem Sci. 2025 Mar 31;16(19):8377-8382. doi: 10.1039/d4sc07858f. eCollection 2025 May 14.

引用本文的文献

Accurate and Efficient Structure Elucidation from Routine One-Dimensional NMR Spectra Using Multitask Machine Learning.

ACS Cent Sci. 2024 Nov 13;10(11):2162-2170. doi: 10.1021/acscentsci.4c01132. eCollection 2024 Nov 27.

UCBShift 2.0: Bridging the Gap from Backbone to Side Chain Protein Chemical Shift Prediction for Protein Structures.

J Am Chem Soc. 2024 Nov 20;146(46):31733-31745. doi: 10.1021/jacs.4c10474. Epub 2024 Nov 12.

The interplay of density functional selection and crystal structure for accurate NMR chemical shift predictions.

Faraday Discuss. 2025 Jan 8;255(0):119-142. doi: 10.1039/d4fd00072b.

Bent naphthodithiophenes: synthesis and characterization of isomeric fluorophores.

RSC Adv. 2024 Aug 12;14(35):25120-25129. doi: 10.1039/d4ra04850d.

Accurate Prediction of NMR Chemical Shifts: Integrating DFT Calculations with Three-Dimensional Graph Neural Networks.

J Chem Theory Comput. 2024 Jun 25;20(12):5250-5258. doi: 10.1021/acs.jctc.4c00422. Epub 2024 Jun 6.

本文引用的文献

Computation of CCSD(T)-Quality NMR Chemical Shifts via Δ-Machine Learning from DFT.

J Chem Theory Comput. 2023 Jun 27;19(12):3601-3615. doi: 10.1021/acs.jctc.3c00165. Epub 2023 Jun 1.

An in-silico NMR laboratory for nuclear magnetic shieldings computed via finite fields: Exploring nucleus-specific renormalizations of MP2 and MP3.

J Chem Phys. 2023 Apr 28;158(16). doi: 10.1063/5.0145130.

Efficient Calculation of NMR Shielding Constants Using Composite Method Approximations and Locally Dense Basis Sets.

J Chem Theory Comput. 2023 Jan 24;19(2):514-523. doi: 10.1021/acs.jctc.2c00933. Epub 2023 Jan 3.

A Machine Learning Model of Chemical Shifts for Chemically and Structurally Diverse Molecular Solids.

J Phys Chem C Nanomater Interfaces. 2022 Oct 6;126(39):16710-16720. doi: 10.1021/acs.jpcc.2c03854. Epub 2022 Sep 23.

Revisiting the Performance of Time-Dependent Density Functional Theory for Electronic Excitations: Assessment of 43 Popular and Recently Developed Functionals from Rungs One to Four.

J Chem Theory Comput. 2022 Jun 14;18(6):3460-3473. doi: 10.1021/acs.jctc.2c00160. Epub 2022 May 9.

Extended Benchmark Set of Main-Group Nuclear Shielding Constants and NMR Chemical Shifts and Its Use to Evaluate Modern DFT Methods.

J Chem Theory Comput. 2021 Dec 14;17(12):7602-7621. doi: 10.1021/acs.jctc.1c00919. Epub 2021 Nov 19.

Establishing the accuracy of density functional approaches for the description of noncovalent interactions in ionic liquids.

Phys Chem Chem Phys. 2021 Nov 24;23(45):25558-25564. doi: 10.1039/d1cp03888e.

Real-time prediction of H and C chemical shifts with DFT accuracy using a 3D graph neural network.

Chem Sci. 2021 Aug 9;12(36):12012-12026. doi: 10.1039/d1sc03343c. eCollection 2021 Sep 22.

A critical review on the use of DP4+ in the structural elucidation of natural products: the good, the bad and the ugly. A practical guide.

Nat Prod Rep. 2022 Jan 26;39(1):58-76. doi: 10.1039/d1np00030f.

DP4-AI automated NMR data analysis: straight from spectrometer to structure.

Chem Sci. 2020 Mar 6;11(17):4351-4359. doi: 10.1039/d0sc00442a.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用机器学习从低水平量子力学计算中高精度预测核磁共振化学位移

Highly Accurate Prediction of NMR Chemical Shifts from Low-Level Quantum Mechanics Calculations Using Machine Learning.

作者信息

Li Jie, Liang Jiashu, Wang Zhe, Ptaszek Aleksandra L, Liu Xiao, Ganoe Brad, Head-Gordon Martin, Head-Gordon Teresa

机构信息

Pitzer Center for Theoretical Chemistry, Department of Chemistry, University of California, Berkeley, California 94720, United States.

出版信息

J Chem Theory Comput. 2024 Mar 12;20(5):2152-2166. doi: 10.1021/acs.jctc.3c01256. Epub 2024 Feb 8.

DOI:10.1021/acs.jctc.3c01256

PMID:38331423

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11702896/

Abstract

摘要

利用机器学习从低水平量子力学计算中高精度预测核磁共振化学位移

Highly Accurate Prediction of NMR Chemical Shifts from Low-Level Quantum Mechanics Calculations Using Machine Learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用机器学习从低水平量子力学计算中高精度预测核磁共振化学位移

Highly Accurate Prediction of NMR Chemical Shifts from Low-Level Quantum Mechanics Calculations Using Machine Learning.

作者信息

机构信息

出版信息