使用机器学习预测有机反应结果

Prediction of Organic Reaction Outcomes Using Machine Learning.

作者信息

Coley Connor W, Barzilay Regina, Jaakkola Tommi S, Green William H, Jensen Klavs F

机构信息

Department of Chemical Engineering and Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States.

出版信息

ACS Cent Sci. 2017 May 24;3(5):434-443. doi: 10.1021/acscentsci.7b00064. Epub 2017 Apr 18.

DOI:10.1021/acscentsci.7b00064

PMID:28573205

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5445544/

Abstract

Computer assistance in synthesis design has existed for over 40 years, yet retrosynthesis planning software has struggled to achieve widespread adoption. One critical challenge in developing high-quality pathway suggestions is that proposed reaction steps often fail when attempted in the laboratory, despite initially seeming viable. The true measure of success for any synthesis program is whether the predicted outcome matches what is observed experimentally. We report a model framework for anticipating reaction outcomes that combines the traditional use of reaction templates with the flexibility in pattern recognition afforded by neural networks. Using 15 000 experimental reaction records from granted United States patents, a model is trained to select the major (recorded) product by ranking a self-generated list of candidates where one candidate is known to be the major product. Candidate reactions are represented using a unique edit-based representation that emphasizes the fundamental transformation from reactants to products, rather than the constituent molecules' overall structures. In a 5-fold cross-validation, the trained model assigns the major product rank 1 in 71.8% of cases, rank ≤3 in 86.7% of cases, and rank ≤5 in 90.8% of cases.

摘要

计算机辅助合成设计已经存在了40多年，但逆合成规划软件一直难以得到广泛应用。开发高质量反应路径建议的一个关键挑战是，尽管最初看似可行，但所提出的反应步骤在实验室中尝试时往往会失败。任何合成程序成功的真正衡量标准是预测结果是否与实验观察结果相符。我们报告了一个预测反应结果的模型框架，该框架将反应模板的传统用法与神经网络在模式识别方面的灵活性相结合。使用来自美国授权专利的15000条实验反应记录，训练一个模型，通过对一个自行生成的候选列表进行排序来选择主要（记录的）产物，其中一个候选产物已知是主要产物。候选反应使用一种独特的基于编辑的表示法来表示，这种表示法强调从反应物到产物的基本转化，而不是组成分子的整体结构。在五折交叉验证中，训练后的模型在71.8%的情况下将主要产物排在第1位，在86.7%的情况下排在≤第3位，在90.8%的情况下排在≤第5位。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8bc/5445544/b77b3b8a2194/oc-2017-00064k_0001.jpg

相似文献

Prediction of Organic Reaction Outcomes Using Machine Learning.使用机器学习预测有机反应结果

ACS Cent Sci. 2017 May 24;3(5):434-443. doi: 10.1021/acscentsci.7b00064. Epub 2017 Apr 18.

Machine Learning in Computer-Aided Synthesis Planning.计算机辅助合成规划中的机器学习

Acc Chem Res. 2018 May 15;51(5):1281-1289. doi: 10.1021/acs.accounts.8b00087. Epub 2018 May 1.

RetroRanker: leveraging reaction changes to improve retrosynthesis prediction through re-ranking.RetroRanker：利用反应变化通过重新排序改进逆合成预测。

J Cheminform. 2023 Jun 8;15(1):58. doi: 10.1186/s13321-023-00727-7.

Influence of Template Size, Canonicalization, and Exclusivity for Retrosynthesis and Reaction Prediction Applications.模板大小、规范化和专属性对逆合成和反应预测应用的影响。

J Chem Inf Model. 2022 Jan 10;62(1):16-26. doi: 10.1021/acs.jcim.1c01192. Epub 2021 Dec 23.

Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction.用于逆合成和反应预测的神经符号机器学习

Chemistry. 2017 May 2;23(25):5966-5971. doi: 10.1002/chem.201605499. Epub 2017 Feb 22.

Retrosynthesis prediction using an end-to-end graph generative architecture for molecular graph editing.基于端到端图生成架构的分子图编辑回溯合成预测。

Nat Commun. 2023 May 25;14(1):3009. doi: 10.1038/s41467-023-38851-5.

Molecular Transformer unifies reaction prediction and retrosynthesis across pharma chemical space.分子变换统一了药物化学空间中的反应预测和反合成。

Chem Commun (Camb). 2019 Oct 8;55(81):12152-12155. doi: 10.1039/c9cc05122h.

Improving the performance of models for one-step retrosynthesis through re-ranking.通过重新排序提高一步逆合成模型的性能。

J Cheminform. 2022 Mar 15;14(1):15. doi: 10.1186/s13321-022-00594-8.

Chem Sci. 2022 Apr 26;13(20):6039-6053. doi: 10.1039/d2sc01588a. eCollection 2022 May 25.

RetroComposer: Composing Templates for Template-Based Retrosynthesis Prediction.RetroComposer：基于模板的反合成预测的模板作曲。

Biomolecules. 2022 Sep 19;12(9):1325. doi: 10.3390/biom12091325.

引用本文的文献

Enhancing deep chemical reaction prediction with advanced chirality and fragment representation.利用先进的手性和片段表示法增强深度化学反应预测。

Chem Commun (Camb). 2025 Sep 11. doi: 10.1039/d5cc02641e.

ReactionT5: a pre-trained transformer model for accurate chemical reaction prediction with limited data.反应T5：一种用于在数据有限的情况下进行准确化学反应预测的预训练变压器模型。

J Cheminform. 2025 Aug 19;17(1):126. doi: 10.1186/s13321-025-01075-4.

Does Hessian Data Improve the Performance of Machine Learning Potentials?黑森数据能否提高机器学习势的性能？

J Chem Theory Comput. 2025 Jul 22;21(14):6698-6710. doi: 10.1021/acs.jctc.5c00402. Epub 2025 Jul 2.

Enhancing Monte Carlo Tree Search for Retrosynthesis.增强蒙特卡洛树搜索用于逆合成分析

J Chem Inf Model. 2025 Jul 14;65(13):6537-6546. doi: 10.1021/acs.jcim.5c00417. Epub 2025 Jun 13.

ASKCOS: Open-Source, Data-Driven Synthesis Planning.ASKCOS：开源、数据驱动的合成规划。

Acc Chem Res. 2025 Jun 3;58(11):1764-1775. doi: 10.1021/acs.accounts.5c00155. Epub 2025 May 21.

"Amide - amine + alcohol = carboxylic acid." chemical reactions as linear algebraic analogies in graph neural networks.“酰胺 - 胺 + 醇 = 羧酸。” 作为图神经网络中线性代数类比的化学反应。

Chem Sci. 2025 Apr 23. doi: 10.1039/d4sc05655h.

Deep Learning Reaction Framework (DLRN) for kinetic modeling of time-resolved data.用于时间分辨数据动力学建模的深度学习反应框架（DLRN）。

Commun Chem. 2025 May 15;8(1):153. doi: 10.1038/s42004-025-01541-y.

Generalizable, fast, and accurate DeepQSPR with fastprop.具有快速传播的可推广、快速且准确的深度定量构效关系模型。

J Cheminform. 2025 May 13;17(1):73. doi: 10.1186/s13321-025-01013-4.

Generating diversity and securing completeness in algorithmic retrosynthesis.在算法逆合成中生成多样性并确保完整性。

J Cheminform. 2025 May 13;17(1):72. doi: 10.1186/s13321-025-00981-x.

A mini review on revolutionizing hydrogenation catalysis: unleashing transformative power of artificial intelligence.关于革新氢化催化的小型综述：释放人工智能的变革力量。

J Mol Model. 2025 Apr 30;31(5):152. doi: 10.1007/s00894-025-06376-x.

本文引用的文献

Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction.用于逆合成和反应预测的神经符号机器学习

Chemistry. 2017 May 2;23(25):5966-5971. doi: 10.1002/chem.201605499. Epub 2017 Feb 22.

Modelling Chemical Reasoning to Predict and Invent Reactions.建立化学推理模型以预测和发明反应。

Chemistry. 2017 May 2;23(25):6118-6128. doi: 10.1002/chem.201604556. Epub 2017 Jan 4.

Neural Networks for the Prediction of Organic Chemistry Reactions.用于预测有机化学反应的神经网络。

ACS Cent Sci. 2016 Oct 26;2(10):725-732. doi: 10.1021/acscentsci.6b00219. Epub 2016 Oct 14.

Molecular graph convolutions: moving beyond fingerprints.分子图卷积：超越指纹图谱

J Comput Aided Mol Des. 2016 Aug;30(8):595-608. doi: 10.1007/s10822-016-9938-8. Epub 2016 Aug 24.

Computer-Assisted Synthetic Planning: The End of the Beginning.计算机辅助综合规划：开端的终结。

Angew Chem Int Ed Engl. 2016 May 10;55(20):5904-37. doi: 10.1002/anie.201506101. Epub 2016 Apr 8.

ReactionMap: an efficient atom-mapping algorithm for chemical reactions.反应映射：化学反应的高效原子映射算法。

J Chem Inf Model. 2013 Nov 25;53(11):2812-9. doi: 10.1021/ci400326p. Epub 2013 Nov 11.

Algorithm for reaction classification.反应分类算法。

J Chem Inf Model. 2013 Nov 25;53(11):2884-95. doi: 10.1021/ci400442f. Epub 2013 Oct 25.

Computer-assisted mechanistic evaluation of organic reactions. 15. Heterocycle synthesis.有机反应的计算机辅助机理评估。15. 杂环合成。

J Org Chem. 1988 May 1;53(11):2504-20. doi: 10.1021/jo00246a020.

ReactionPredictor: prediction of complex chemical reactions at the mechanistic level using machine learning.ReactionPredictor：使用机器学习在机理水平上预测复杂化学反应。

J Chem Inf Model. 2012 Oct 22;52(10):2526-40. doi: 10.1021/ci3003039. Epub 2012 Oct 1.

Mining electronic laboratory notebooks: analysis, retrosynthesis, and reaction based enumeration.挖掘电子实验室笔记本：分析、逆合成和基于反应的枚举。

J Chem Inf Model. 2012 Jul 23;52(7):1745-56. doi: 10.1021/ci300116p. Epub 2012 Jun 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用机器学习预测有机反应结果

Prediction of Organic Reaction Outcomes Using Machine Learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献