• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于产品识别的演绎机器学习模型。

Deductive machine learning models for product identification.

作者信息

Jin Tianfan, Zhao Qiyuan, Schofield Andrew B, Savoie Brett M

机构信息

Department of Chemical Engineering, Purdue University West Lafayette USA

出版信息

Chem Sci. 2024 Jul 1;15(30):11995-12005. doi: 10.1039/d3sc04909d. eCollection 2024 Jul 31.

DOI:10.1039/d3sc04909d
PMID:39092129
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11290435/
Abstract

Deductive solution strategies are required in prediction scenarios that are under determined, when contradictory information is available, or more generally wherever one-to-many non-functional mappings occur. In contrast, most contemporary machine learning (ML) in the chemical sciences is inductive learning from example, with a fixed set of features. Chemical workflows are replete with situations requiring deduction, including many aspects of lab automation and spectral interpretation. Here, a general strategy is described for designing and training machine learning models capable of deduction that consists of combining individual inductive models into a larger deductive network. The training and testing of these models is demonstrated on the task of deducing reaction products from a mixture of spectral sources. The resulting models can distinguish between intended and unintended reaction outcomes and identify starting material based on a mixture of spectral sources. The models also perform well on tasks that they were not directly trained on, like performing structural inference using real rather than simulated spectral inputs, predicting minor products from named organic chemistry reactions, identifying reagents and isomers as plausible impurities, and handling missing or conflicting information. A new dataset of 1 124 043 simulated spectra that were generated to train these models is also distributed with this work. These findings demonstrate that deductive bottlenecks for chemical problems are not fundamentally insuperable for ML models.

摘要

在预测场景中,如果信息不完整、存在矛盾信息,或者更普遍地说,在出现一对多非功能映射的任何地方,都需要演绎求解策略。相比之下,化学科学中的大多数当代机器学习(ML)都是从示例中进行归纳学习,具有一组固定的特征。化学工作流程中充满了需要演绎的情况,包括实验室自动化和光谱解释的许多方面。在这里,描述了一种用于设计和训练能够进行演绎的机器学习模型的通用策略,该策略包括将单个归纳模型组合成一个更大的演绎网络。在从光谱源混合物中推断反应产物的任务上展示了这些模型的训练和测试。所得模型可以区分预期和非预期的反应结果,并根据光谱源混合物识别起始原料。这些模型在未直接训练的任务上也表现良好,例如使用真实而非模拟光谱输入进行结构推断、预测命名有机化学反应的次要产物、将试剂和异构体识别为可能的杂质以及处理缺失或冲突的信息。这项工作还分发了一个为训练这些模型而生成的包含1124043个模拟光谱的新数据集。这些发现表明,化学问题的演绎瓶颈对于ML模型来说并非根本无法克服。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce6/11290435/1a421f6af34f/d3sc04909d-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce6/11290435/5adb1ff6af6c/d3sc04909d-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce6/11290435/2acab71255bb/d3sc04909d-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce6/11290435/1a421f6af34f/d3sc04909d-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce6/11290435/5adb1ff6af6c/d3sc04909d-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce6/11290435/2acab71255bb/d3sc04909d-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce6/11290435/1a421f6af34f/d3sc04909d-f3.jpg

相似文献

1
Deductive machine learning models for product identification.用于产品识别的演绎机器学习模型。
Chem Sci. 2024 Jul 1;15(30):11995-12005. doi: 10.1039/d3sc04909d. eCollection 2024 Jul 31.
2
Deductive Machine Learning Challenges and Opportunities in Chemical Applications.化学应用中演绎式机器学习的挑战与机遇
Annu Rev Chem Biomol Eng. 2024 Apr 10. doi: 10.1146/annurev-chembioeng-100722-111917.
3
Reproducing Reaction Mechanisms with Machine-Learning Models Trained on a Large-Scale Mechanistic Dataset.使用在大规模机理数据集上训练的机器学习模型再现反应机理
Angew Chem Int Ed Engl. 2024 Oct 21;63(43):e202411296. doi: 10.1002/anie.202411296. Epub 2024 Sep 2.
4
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
5
Performance of a Computational Model of the Mammalian Olfactory System哺乳动物嗅觉系统计算模型的性能
6
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Predicting Energetics Materials' Crystalline Density from Chemical Structure by Machine Learning.通过机器学习从化学结构预测能质材料的结晶密度。
J Chem Inf Model. 2021 May 24;61(5):2147-2158. doi: 10.1021/acs.jcim.0c01318. Epub 2021 Apr 26.
9
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
10
The Brain Networks Basis for Deductive and Inductive Reasoning: A Functional Magnetic Resonance Imaging Study.演绎推理和归纳推理的脑网络基础:一项功能磁共振成像研究
Basic Clin Neurosci. 2023 Jul-Aug;14(4):529-542. doi: 10.32598/bcn.2022.3752.3. Epub 2023 Jul 1.

本文引用的文献

1
Assessment of chemistry knowledge in large language models that generate code.对生成代码的大语言模型中的化学知识评估。
Digit Discov. 2023 Jan 26;2(2):368-376. doi: 10.1039/d2dd00087c. eCollection 2023 Apr 11.
2
Generative Models as an Emerging Paradigm in the Chemical Sciences.生成模型在化学科学中的新兴范例。
J Am Chem Soc. 2023 Apr 26;145(16):8736-8750. doi: 10.1021/jacs.2c13467. Epub 2023 Apr 13.
3
Automatic materials characterization from infrared spectra using convolutional neural networks.使用卷积神经网络从红外光谱中进行自动材料表征。
Chem Sci. 2023 Feb 23;14(13):3600-3609. doi: 10.1039/d2sc05892h. eCollection 2023 Mar 29.
4
Transformer Performance for Chemical Reactions: Analysis of Different Predictive and Evaluation Scenarios.Transformer 模型在化学反应预测中的性能表现:不同预测和评估场景的分析。
J Chem Inf Model. 2023 Apr 10;63(7):1914-1924. doi: 10.1021/acs.jcim.2c01407. Epub 2023 Mar 23.
5
Conditional Molecular Generation Net Enables Automated Structure Elucidation Based on C NMR Spectra and Prior Knowledge.条件分子生成网络实现基于碳核磁共振光谱和先验知识的自动结构解析。
Anal Chem. 2023 Mar 28;95(12):5393-5401. doi: 10.1021/acs.analchem.2c05817. Epub 2023 Mar 16.
6
Computer-aided key step generation in alkaloid total synthesis.生物碱全合成中计算机辅助关键步骤的生成
Science. 2023 Feb 3;379(6631):453-457. doi: 10.1126/science.ade8459. Epub 2023 Feb 2.
7
Rapid Approximate Subset-Based Spectra Prediction for Electron Ionization-Mass Spectrometry.基于快速近似子集的电子电离质谱谱预测。
Anal Chem. 2023 Feb 7;95(5):2653-2663. doi: 10.1021/acs.analchem.2c02093. Epub 2023 Jan 25.
8
Machine-Learning-Guided Discovery of Electrochemical Reactions.机器学习指导下的电化学反应发现。
J Am Chem Soc. 2022 Dec 14;144(49):22599-22610. doi: 10.1021/jacs.2c08997. Epub 2022 Dec 2.
9
On scientific understanding with artificial intelligence.论人工智能辅助下的科学理解
Nat Rev Phys. 2022;4(12):761-769. doi: 10.1038/s42254-022-00518-3. Epub 2022 Oct 11.
10
An autonomous portable platform for universal chemical synthesis.一种用于通用化学合成的自主便携式平台。
Nat Chem. 2022 Nov;14(11):1311-1318. doi: 10.1038/s41557-022-01016-w. Epub 2022 Oct 6.