Suppr
超能文献

增强化学合成：用于预测可行反应条件的两阶段深度神经网络。

Enhancing chemical synthesis: a two-stage deep neural network for predicting feasible reaction conditions.

作者信息

Chen Lung-Yi, Li Yi-Pei

机构信息

Department of Chemical Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei, 10617, Taiwan.

Taiwan International Graduate Program on Sustainable Chemical Science and Technology (TIGP-SCST), No. 128, Sec. 2, Academia Road, Taipei, 11529, Taiwan.

出版信息

J Cheminform. 2024 Jan 24;16(1):11. doi: 10.1186/s13321-024-00805-4.

DOI:10.1186/s13321-024-00805-4

PMID:38268009

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11301986/

Abstract

In the field of chemical synthesis planning, the accurate recommendation of reaction conditions is essential for achieving successful outcomes. This work introduces an innovative deep learning approach designed to address the complex task of predicting appropriate reagents, solvents, and reaction temperatures for chemical reactions. Our proposed methodology combines a multi-label classification model with a ranking model to offer tailored reaction condition recommendations based on relevance scores derived from anticipated product yields. To tackle the challenge of limited data for unfavorable reaction contexts, we employed the technique of hard negative sampling to generate reaction conditions that might be mistakenly classified as suitable, forcing the model to refine its decision boundaries, especially in challenging cases. Our developed model excels in proposing conditions where an exact match to the recorded solvents and reagents is found within the top-10 predictions 73% of the time. It also predicts temperatures within ± 20 [Formula: see text] of the recorded temperature in 89% of test cases. Notably, the model demonstrates its capacity to recommend multiple viable reaction conditions, with accuracy varying based on the availability of condition records associated with each reaction. What sets this model apart is its ability to suggest alternative reaction conditions beyond the constraints of the dataset. This underscores its potential to inspire innovative approaches in chemical research, presenting a compelling opportunity for advancing chemical synthesis planning and elevating the field of reaction engineering. Scientific contribution: The combination of multi-label classification and ranking models provides tailored recommendations for reaction conditions based on the reaction yields. A novel approach is presented to address the issue of data scarcity in negative reaction conditions through data augmentation.

摘要

在化学合成规划领域，准确推荐反应条件对于取得成功结果至关重要。这项工作引入了一种创新的深度学习方法，旨在解决预测化学反应合适试剂、溶剂和反应温度这一复杂任务。我们提出的方法将多标签分类模型与排序模型相结合，根据预期产物收率得出的相关性分数提供定制的反应条件推荐。为应对不利反应情境下数据有限的挑战，我们采用了硬负采样技术来生成可能被错误分类为合适的反应条件，迫使模型细化其决策边界，尤其是在具有挑战性的情况下。我们开发的模型在提出条件方面表现出色，在前10个预测中有73%的时间能找到与记录的溶剂和试剂完全匹配的情况。在89%的测试案例中，它还能将温度预测在记录温度的±20[公式：见原文]范围内。值得注意的是，该模型展示了推荐多种可行反应条件的能力，其准确性因与每个反应相关的条件记录的可用性而异。该模型的独特之处在于它能够在数据集的限制之外建议替代反应条件。这凸显了其在化学研究中激发创新方法的潜力，为推进化学合成规划和提升反应工程领域提供了一个引人注目的机会。科学贡献：多标签分类和排序模型的结合基于反应产率为反应条件提供定制推荐。提出了一种通过数据增强来解决负面反应条件下数据稀缺问题的新方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/37dc/11301986/a43f6d70cfb9/13321_2024_805_Fig1_HTML.jpg

相似文献

Enhancing chemical synthesis: a two-stage deep neural network for predicting feasible reaction conditions.

J Cheminform. 2024 Jan 24;16(1):11. doi: 10.1186/s13321-024-00805-4.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Generic Interpretable Reaction Condition Predictions with Open Reaction Condition Datasets and Unsupervised Learning of Reaction Center.

Research (Wash D C). 2023 Oct 16;6:0231. doi: 10.34133/research.0231. eCollection 2023.

Deep convolutional neural network and IoT technology for healthcare.

Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.

Leveraging auxiliary measures: a deep multi-task neural network for predictive modeling in clinical research.

BMC Med Inform Decis Mak. 2018 Dec 12;18(Suppl 4):126. doi: 10.1186/s12911-018-0676-9.

Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).

Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.

Planning Implications Related to Sterilization-Sensitive Science Investigations Associated with Mars Sample Return (MSR).

Astrobiology. 2022 Jun;22(S1):S112-S164. doi: 10.1089/AST.2021.0113. Epub 2022 May 19.

The future of Cochrane Neonatal.

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

Molecular Machine Learning for Chemical Catalysis: Prospects and Challenges.

Acc Chem Res. 2023 Feb 7;56(3):402-412. doi: 10.1021/acs.accounts.2c00801. Epub 2023 Jan 30.

Generative Modeling to Predict Multiple Suitable Conditions for Chemical Reactions.

J Chem Inf Model. 2022 Dec 12;62(23):5952-5960. doi: 10.1021/acs.jcim.2c01085. Epub 2022 Nov 22.

引用本文的文献

BatGPT-Chem: A Foundation Large Model for Chemical Engineering.

Research (Wash D C). 2025 Sep 10;8:0827. doi: 10.34133/research.0827. eCollection 2025.

Negative chemical data boosts language models in reaction outcome prediction.

Sci Adv. 2025 Jun 13;11(24):eadt5578. doi: 10.1126/sciadv.adt5578.

Enhancing chemical reaction search through contrastive representation learning and human-in-the-loop.

J Cheminform. 2025 Apr 10;17(1):51. doi: 10.1186/s13321-025-00987-5.

AutoTemplate: enhancing chemical reaction datasets for machine learning applications in organic chemistry.

J Cheminform. 2024 Jun 27;16(1):74. doi: 10.1186/s13321-024-00869-2.

本文引用的文献

Data Sharing in Chemistry: Lessons Learned and a Case for Mandating Structured Reaction Data.

J Chem Inf Model. 2023 Jul 24;63(14):4253-4265. doi: 10.1021/acs.jcim.3c00607. Epub 2023 Jul 5.

Negative Data in Data Sets for Machine Learning Training.

J Org Chem. 2023 May 5;88(9):5239-5241. doi: 10.1021/acs.joc.3c00844. Epub 2023 Apr 26.

Reagent prediction with a molecular transformer improves reaction data quality.

Chem Sci. 2023 Mar 1;14(12):3235-3246. doi: 10.1039/d2sc06798f. eCollection 2023 Mar 22.

Explainable uncertainty quantifications for deep learning-based molecular property prediction.

J Cheminform. 2023 Feb 3;15(1):13. doi: 10.1186/s13321-023-00682-3.

Generative Modeling to Predict Multiple Suitable Conditions for Chemical Reactions.

J Chem Inf Model. 2022 Dec 12;62(23):5952-5960. doi: 10.1021/acs.jcim.2c01085. Epub 2022 Nov 22.

Deep Learning-Based Increment Theory for Formation Enthalpy Predictions.

J Phys Chem A. 2022 Oct 20;126(41):7548-7556. doi: 10.1021/acs.jpca.2c04848. Epub 2022 Oct 11.

Artificial Intelligence, Machine Learning, and Deep Learning in Real-Life Drug Design Cases.

Methods Mol Biol. 2022;2390:383-407. doi: 10.1007/978-1-0716-1787-8_16.

The Open Reaction Database.

J Am Chem Soc. 2021 Nov 17;143(45):18820-18826. doi: 10.1021/jacs.1c09820. Epub 2021 Nov 2.

In silico, in vitro, and in vivo machine learning in synthetic biology and metabolic engineering.

Curr Opin Chem Biol. 2021 Dec;65:85-92. doi: 10.1016/j.cbpa.2021.06.002. Epub 2021 Jul 16.

Recent advances in drug repurposing using machine learning.

Curr Opin Chem Biol. 2021 Dec;65:74-84. doi: 10.1016/j.cbpa.2021.06.001. Epub 2021 Jul 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

增强化学合成：用于预测可行反应条件的两阶段深度神经网络。

Enhancing chemical synthesis: a two-stage deep neural network for predicting feasible reaction conditions.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译