Suppr超能文献

一种从常规核磁共振谱进行自动结构解析的框架。

A framework for automated structure elucidation from routine NMR spectra.

作者信息

Huang Zhaorui, Chen Michael S, Woroch Cristian P, Markland Thomas E, Kanan Matthew W

机构信息

Department of Chemistry, Stanford University Stanford CA 94305 USA

出版信息

Chem Sci. 2021 Nov 9;12(46):15329-15338. doi: 10.1039/d1sc04105c. eCollection 2021 Dec 1.

Abstract

Methods to automate structure elucidation that can be applied broadly across chemical structure space have the potential to greatly accelerate chemical discovery. NMR spectroscopy is the most widely used and arguably the most powerful method for elucidating structures of organic molecules. Here we introduce a machine learning (ML) framework that provides a quantitative probabilistic ranking of the most likely structural connectivity of an unknown compound when given routine, experimental one dimensional H and/or C NMR spectra. In particular, our ML-based algorithm takes input NMR spectra and (i) predicts the presence of specific substructures out of hundreds of substructures it has learned to identify; (ii) annotates the spectrum to label peaks with predicted substructures; and (iii) uses the substructures to construct candidate constitutional isomers and assign to them a probabilistic ranking. Using experimental spectra and molecular formulae for molecules containing up to 10 non-hydrogen atoms, the correct constitutional isomer was the highest-ranking prediction made by our model in 67.4% of the cases and one of the top-ten predictions in 95.8% of the cases. This advance will aid in solving the structure of unknown compounds, and thus further the development of automated structure elucidation tools that could enable the creation of fully autonomous reaction discovery platforms.

摘要

能够广泛应用于化学结构空间的自动化结构解析方法,有潜力极大地加速化学发现。核磁共振光谱法是用于解析有机分子结构的最广泛使用且 arguably 最强大的方法。在此,我们介绍一种机器学习(ML)框架,当给出常规的一维氢和/或碳核磁共振光谱时,该框架能对未知化合物最可能的结构连接性提供定量概率排名。特别地,我们基于机器学习的算法接收输入的核磁共振光谱,并且(i)从其已学会识别的数百种子结构中预测特定子结构的存在;(ii)对光谱进行注释,用预测的子结构标记峰;以及(iii)使用这些子结构构建候选构造异构体并为它们分配概率排名。对于含有多达10个非氢原子的分子,使用实验光谱和分子式,在67.4%的情况下,正确的构造异构体是我们模型做出的排名最高的预测,在95.8%的情况下是前十的预测之一。这一进展将有助于解析未知化合物的结构,从而推动自动化结构解析工具的进一步发展,这些工具能够实现完全自主的反应发现平台的创建。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17e/8635205/4800970064b6/d1sc04105c-f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验