• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

条件分子生成网络实现基于碳核磁共振光谱和先验知识的自动结构解析。

Conditional Molecular Generation Net Enables Automated Structure Elucidation Based on C NMR Spectra and Prior Knowledge.

作者信息

Yao Lin, Yang Minjian, Song Jianfei, Yang Zhuo, Sun Hanyu, Shi Hui, Liu Xue, Ji Xiangyang, Deng Yafeng, Wang Xiaojian

机构信息

CarbonSilicon AI Technology Co., Ltd., Beijing 100080, China.

State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China.

出版信息

Anal Chem. 2023 Mar 28;95(12):5393-5401. doi: 10.1021/acs.analchem.2c05817. Epub 2023 Mar 16.

DOI:10.1021/acs.analchem.2c05817
PMID:36926883
Abstract

Structure elucidation of unknown compounds based on nuclear magnetic resonance (NMR) remains a challenging problem in both synthetic organic and natural product chemistry. Library matching has been an efficient method to assist structure elucidation. However, it is limited by the coverage of libraries. In addition, prior knowledge such as molecular fragments is neglected. To solve the problem, we propose a conditional molecular generation net (CMGNet) to allow input of multiple sources of information. CMGNet not only uses C NMR spectrum data as input but molecular formulas and fragments of molecules are also employed as input conditions. Our model applies large-scale pretraining for molecular understanding and fine-tuning on two NMR spectral data sets of different granularity levels to accommodate structure elucidation tasks. CMGNet generates structures based on C NMR data, molecular formula, and fragment information, with a recovery rate of 94.17% in the top 10 recommendations. In addition, the generative model performed well in the generation of various classes of compounds and in the structural revision task. CMGNet has a deep understanding of molecular connectivities from C NMR, molecular formula, and fragments, paving the way for a new paradigm of deep learning-assisted inverse problem-solving.

摘要

基于核磁共振(NMR)对未知化合物进行结构解析,在合成有机化学和天然产物化学领域仍然是一个具有挑战性的问题。库匹配一直是辅助结构解析的有效方法。然而,它受到库覆盖范围的限制。此外,诸如分子片段等先验知识被忽视。为了解决这个问题,我们提出了一种条件分子生成网络(CMGNet),以允许输入多种信息源。CMGNet不仅将碳核磁共振谱数据作为输入,还将分子式和分子片段用作输入条件。我们的模型对分子理解进行大规模预训练,并在两个不同粒度级别的核磁共振光谱数据集上进行微调,以适应结构解析任务。CMGNet基于碳核磁共振数据、分子式和片段信息生成结构,在前10个推荐结果中的回收率为94.17%。此外,生成模型在各类化合物的生成以及结构修正任务中表现良好。CMGNet从碳核磁共振、分子式和片段中对分子连接性有深入理解,为深度学习辅助解决逆问题的新范式铺平了道路。

相似文献

1
Conditional Molecular Generation Net Enables Automated Structure Elucidation Based on C NMR Spectra and Prior Knowledge.条件分子生成网络实现基于碳核磁共振光谱和先验知识的自动结构解析。
Anal Chem. 2023 Mar 28;95(12):5393-5401. doi: 10.1021/acs.analchem.2c05817. Epub 2023 Mar 16.
2
Cross-Modal Retrieval Between C NMR Spectra and Structures Based on Focused Libraries.基于聚焦库的 C NMR 光谱和结构的跨模态检索。
Anal Chem. 2024 Apr 16;96(15):5763-5770. doi: 10.1021/acs.analchem.3c04294. Epub 2024 Apr 2.
3
Spec2D: a structure elucidation system based on 1H NMR and H-H COSY spectra in organic chemistry.Spec2D:一种基于有机化学中¹H NMR和H-H COSY光谱的结构解析系统。
J Chem Inf Model. 2006 Mar-Apr;46(2):775-87. doi: 10.1021/ci0502810.
4
Structure Elucidator: a versatile expert system for molecular structure elucidation from 1D and 2D NMR data and molecular fragments.结构解析器:一款用于从一维和二维核磁共振数据及分子片段解析分子结构的多功能专家系统。
J Chem Inf Comput Sci. 2004 May-Jun;44(3):771-92. doi: 10.1021/ci0341060.
5
Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking.用于代谢物自动推导的构建模块:候选物排序的天然产物似性。
BMC Bioinformatics. 2014 Jul 5;15:234. doi: 10.1186/1471-2105-15-234.
6
A framework for automated structure elucidation from routine NMR spectra.一种从常规核磁共振谱进行自动结构解析的框架。
Chem Sci. 2021 Nov 9;12(46):15329-15338. doi: 10.1039/d1sc04105c. eCollection 2021 Dec 1.
7
Enhancing Chemical Reaction Monitoring with a Deep Learning Model for NMR Spectra Image Matching to Target Compounds.利用深度学习模型增强化学反应该监测,实现 NMR 谱图像匹配目标化合物。
J Chem Inf Model. 2024 Jul 22;64(14):5624-5633. doi: 10.1021/acs.jcim.4c00522. Epub 2024 Jul 9.
8
Application of INADEQUATE NMR techniques for directly tracing out the carbon skeleton of a natural product.应用不充分的 NMR 技术直接追踪天然产物的碳骨架。
Phytochem Anal. 2021 Jan;32(1):7-23. doi: 10.1002/pca.2976. Epub 2020 Jul 15.
9
NMR Calculations with Quantum Methods: Development of New Tools for Structural Elucidation and Beyond.NMR 计算的量子方法:结构解析及超越的新工具的发展。
Acc Chem Res. 2020 Sep 15;53(9):1922-1932. doi: 10.1021/acs.accounts.0c00365. Epub 2020 Aug 14.
10
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

引用本文的文献

1
Automated Determination of the Molecular Substructure from Nuclear Magnetic Resonance Spectra Using Neural Networks.使用神经网络从核磁共振光谱自动确定分子亚结构
J Chem Inf Model. 2025 Aug 25;65(16):8435-8447. doi: 10.1021/acs.jcim.5c00499. Epub 2025 Aug 13.
2
NMRExtractor: leveraging large language models to construct an experimental NMR database from open-source scientific publications.NMRExtractor:利用大语言模型从开源科学出版物构建实验性核磁共振数据库。
Chem Sci. 2025 May 28. doi: 10.1039/d4sc08802f.
3
Synthetic Biology in Natural Product Biosynthesis.
天然产物生物合成中的合成生物学
Chem Rev. 2025 Apr 9;125(7):3814-3931. doi: 10.1021/acs.chemrev.4c00567. Epub 2025 Mar 21.
4
Accurate and Efficient Structure Elucidation from Routine One-Dimensional NMR Spectra Using Multitask Machine Learning.使用多任务机器学习从常规一维核磁共振谱中进行准确高效的结构解析
ACS Cent Sci. 2024 Nov 13;10(11):2162-2170. doi: 10.1021/acscentsci.4c01132. eCollection 2024 Nov 27.
5
Deductive machine learning models for product identification.用于产品识别的演绎机器学习模型。
Chem Sci. 2024 Jul 1;15(30):11995-12005. doi: 10.1039/d3sc04909d. eCollection 2024 Jul 31.