自动化反应数据库与反应网络分析：利用化学信息学提取反应模板

Automated reaction database and reaction network analysis: extraction of reaction templates using cheminformatics.

作者信息

Plehiers Pieter P, Marin Guy B, Stevens Christian V, Van Geem Kevin M

机构信息

Laboratory for Chemical Technology, Department of Materials, Textiles and Chemical Engineering, Ghent University, Technologiepark 914, 9052, Ghent, Belgium.

SynBioC Research Group, Department of Sustainable Organic Chemistry and Technology, Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000, Ghent, Belgium.

出版信息

J Cheminform. 2018 Mar 9;10(1):11. doi: 10.1186/s13321-018-0269-8.

DOI:10.1186/s13321-018-0269-8

PMID:29524042

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5845084/

Abstract

Both the automated generation of reaction networks and the automated prediction of synthetic trees require, in one way or another, the definition of possible transformations a molecule can undergo. One way of doing this is by using reaction templates. In view of the expanding amount of known reactions, it has become more and more difficult to envision all possible transformations that could occur in a studied system. Nonetheless, most reaction network generation tools rely on user-defined reaction templates. Not only does this limit the amount of chemistry that can be accounted for in the reaction networks, it also confines the wide-spread use of the tools by a broad public. In retrosynthetic analysis, the quality of the analysis depends on what percentage of the known chemistry is accounted for. Using databases to identify templates is therefore crucial in this respect. For this purpose, an algorithm has been developed to extract reaction templates from various types of chemical databases. Some databases such as the Kyoto Encyclopedia for Genes and Genomes and RMG do not report an atom-atom mapping (AAM) for the reactions. This makes the extraction of a template non-straightforward. If no mapping is available, it is calculated by the Reaction Decoder Tool (RDT). With a correct AAM-either calculated by RDT or specified-the algorithm consistently extracts a correct template for a wide variety of reactions, both elementary and non-elementary. The developed algorithm is a first step towards data-driven generation of synthetic trees or reaction networks, and a greater accessibility for non-expert users.

摘要

反应网络的自动生成和合成树的自动预测都需要以某种方式定义分子可能经历的转化。一种实现方法是使用反应模板。鉴于已知反应数量不断增加，设想研究系统中可能发生的所有转化变得越来越困难。尽管如此，大多数反应网络生成工具仍依赖用户定义的反应模板。这不仅限制了反应网络中可以考虑的化学内容量，还限制了广大公众对这些工具的广泛使用。在逆合成分析中，分析的质量取决于已知化学内容的占比。因此，在这方面使用数据库来识别模板至关重要。为此，已开发出一种算法，用于从各种类型的化学数据库中提取反应模板。一些数据库，如京都基因与基因组百科全书和反应分子生成器（RMG），并未报告反应的原子-原子映射（AAM）。这使得模板的提取并非易事。如果没有可用的映射，则由反应解码器工具（RDT）进行计算。有了正确的AAM（无论是由RDT计算得出还是指定的），该算法就能始终为各种反应（包括基元反应和非基元反应）提取正确的模板。所开发的算法是迈向数据驱动的合成树或反应网络生成以及提高非专业用户可及性的第一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9378/5845084/cff4873510f6/13321_2018_269_Fig1_HTML.jpg

相似文献

Automated reaction database and reaction network analysis: extraction of reaction templates using cheminformatics.自动化反应数据库与反应网络分析：利用化学信息学提取反应模板

J Cheminform. 2018 Mar 9;10(1):11. doi: 10.1186/s13321-018-0269-8.

AutoTemplate: enhancing chemical reaction datasets for machine learning applications in organic chemistry.自动模板：增强用于有机化学机器学习应用的化学反应数据集。

J Cheminform. 2024 Jun 27;16(1):74. doi: 10.1186/s13321-024-00869-2.

RDChiral: An RDKit Wrapper for Handling Stereochemistry in Retrosynthetic Template Extraction and Application.RDChiral：一个用于在逆合成模板提取和应用中处理立体化学的 RDKit 包装器。

J Chem Inf Model. 2019 Jun 24;59(6):2529-2537. doi: 10.1021/acs.jcim.9b00286. Epub 2019 Jun 13.

Data Augmentation and Pretraining for Template-Based Retrosynthetic Prediction in Computer-Aided Synthesis Planning.基于模板的回溯合成预测的计算机辅助合成规划中的数据增强和预训练。

J Chem Inf Model. 2020 Jul 27;60(7):3398-3407. doi: 10.1021/acs.jcim.0c00403. Epub 2020 Jul 5.

Influence of Template Size, Canonicalization, and Exclusivity for Retrosynthesis and Reaction Prediction Applications.模板大小、规范化和专属性对逆合成和反应预测应用的影响。

J Chem Inf Model. 2022 Jan 10;62(1):16-26. doi: 10.1021/acs.jcim.1c01192. Epub 2021 Dec 23.

Enhancing Retrosynthetic Reaction Prediction with Deep Learning Using Multiscale Reaction Classification.利用多尺度反应分类增强深度学习的逆合成反应预测

J Chem Inf Model. 2019 Feb 25;59(2):673-688. doi: 10.1021/acs.jcim.8b00801. Epub 2019 Feb 1.

Automatic retrosynthetic route planning using template-free models.使用无模板模型的自动逆合成路线规划。

Chem Sci. 2020 Mar 3;11(12):3355-3364. doi: 10.1039/c9sc03666k.

Automatic mapping of atoms across both simple and complex chemical reactions.自动映射简单和复杂化学反应中的原子。

Nat Commun. 2019 Mar 29;10(1):1434. doi: 10.1038/s41467-019-09440-2.

RMechDB: A Public Database of Elementary Radical Reaction Steps.RMechDB：基本自由基反应步骤公共数据库。

J Chem Inf Model. 2023 Feb 27;63(4):1114-1123. doi: 10.1021/acs.jcim.2c01359. Epub 2023 Feb 17.

Reaction Mechanism Generator v3.0: Advances in Automatic Mechanism Generation.反应机制生成器 v3.0：自动机制生成的进展。

J Chem Inf Model. 2021 Jun 28;61(6):2686-2696. doi: 10.1021/acs.jcim.0c01480. Epub 2021 May 28.

引用本文的文献

Discovery, design, and engineering of enzymes based on molecular retrobiosynthesis.基于分子逆向生物合成的酶的发现、设计与工程

mLife. 2025 Mar 28;4(2):107-125. doi: 10.1002/mlf2.70009. eCollection 2025 Apr.

Rxn-INSIGHT: fast chemical reaction analysis using bond-electron matrices.反应洞察：使用键电子矩阵进行快速化学反应分析。

J Cheminform. 2024 Mar 29;16(1):37. doi: 10.1186/s13321-024-00834-z.

EnzymeMap: curation, validation and data-driven prediction of enzymatic reactions.酶图谱：酶促反应的整理、验证及数据驱动预测

Chem Sci. 2023 Nov 22;14(48):14229-14242. doi: 10.1039/d3sc02048g. eCollection 2023 Dec 13.

Redesigning plant specialized metabolism with supervised machine learning using publicly available reactome data.利用公开可用的Reactome数据，通过监督式机器学习重新设计植物特殊代谢。

Comput Struct Biotechnol J. 2023 Jan 18;21:1639-1650. doi: 10.1016/j.csbj.2023.01.013. eCollection 2023.

Graph-Driven Reaction Discovery: Progress, Challenges, and Future Opportunities.基于图驱动的反应发现：进展、挑战与未来机遇

J Phys Chem A. 2022 Oct 13;126(40):7051-7069. doi: 10.1021/acs.jpca.2c06408. Epub 2022 Oct 3.

-Acetyl Migration within the Sialic Acid Side Chain: A Mechanistic Study Using the Nanoreactor.乙酰基在唾液酸侧链内的迁移：使用纳米反应器的机理研究。

Biochemistry. 2022 Sep 20;61(18):2007-2013. doi: 10.1021/acs.biochem.2c00343. Epub 2022 Sep 2.

Automated Exploration of Prebiotic Chemical Reaction Space: Progress and Perspectives.益生元化学反应空间的自动化探索：进展与展望

Life (Basel). 2021 Oct 26;11(11):1140. doi: 10.3390/life11111140.

EHreact: Extended Hasse Diagrams for the Extraction and Scoring of Enzymatic Reaction Templates.EHreact：用于提取和评分酶反应模板的扩展哈塞图。

J Chem Inf Model. 2021 Oct 25;61(10):4949-4961. doi: 10.1021/acs.jcim.1c00921. Epub 2021 Sep 29.

Quantum Chemistry Calculations for Metabolomics.代谢组学的量子化学计算。

Chem Rev. 2021 May 26;121(10):5633-5670. doi: 10.1021/acs.chemrev.0c00901. Epub 2021 May 12.

RetroBioCat as a computer-aided synthesis planning tool for biocatalytic reactions and cascades.RetroBioCat作为一种用于生物催化反应和级联反应的计算机辅助合成规划工具。

Nat Catal. 2021 Feb;4(2):98-104. doi: 10.1038/s41929-020-00556-z. Epub 2021 Jan 4.

本文引用的文献

Prediction of Organic Reaction Outcomes Using Machine Learning.使用机器学习预测有机反应结果

ACS Cent Sci. 2017 May 24;3(5):434-443. doi: 10.1021/acscentsci.7b00064. Epub 2017 Apr 18.

The computer-aided discovery of novel family of the 5-HT serotonin receptor ligands among derivatives of 4-benzyl-1,3,5-triazine.在 4-苄基-1,3,5-三嗪衍生物中，通过计算机辅助发现新型 5-HT 血清素受体配体家族。

Eur J Med Chem. 2017 Jul 28;135:117-124. doi: 10.1016/j.ejmech.2017.04.033. Epub 2017 Apr 13.

Modelling Chemical Reasoning to Predict and Invent Reactions.建立化学推理模型以预测和发明反应。

Chemistry. 2017 May 2;23(25):6118-6128. doi: 10.1002/chem.201604556. Epub 2017 Jan 4.

A Short Review of Chemical Reaction Database Systems, Computer-Aided Synthesis Design, Reaction Prediction and Synthetic Feasibility.化学反应数据库系统、计算机辅助合成设计、反应预测与合成可行性简述

Mol Inform. 2014 Jun;33(6-7):469-76. doi: 10.1002/minf.201400052. Epub 2014 Jun 2.

AMBIT-SMARTS: Efficient Searching of Chemical Structures and Fragments.AMBIT-SMARTS：高效的化学结构和片段搜索。

Mol Inform. 2011 Aug;30(8):707-20. doi: 10.1002/minf.201100028. Epub 2011 Aug 4.

Reaction Decoder Tool (RDT): extracting features from chemical reactions.反应解码器工具（RDT）：从化学反应中提取特征。

Bioinformatics. 2016 Jul 1;32(13):2065-6. doi: 10.1093/bioinformatics/btw096. Epub 2016 Feb 22.

Computer-Assisted Synthetic Planning: The End of the Beginning.计算机辅助综合规划：开端的终结。

Angew Chem Int Ed Engl. 2016 May 10;55(20):5904-37. doi: 10.1002/anie.201506101. Epub 2016 Apr 8.

Kinetics and Products of Vinyl + 1,3-Butadiene, a Potential Route to Benzene.乙烯基与1,3 - 丁二烯的反应动力学及产物：通往苯的一条潜在途径

J Phys Chem A. 2015 Jul 16;119(28):7325-38. doi: 10.1021/jp512705r. Epub 2015 May 12.

Atom mapping with constraint programming.基于约束规划的原子映射

Algorithms Mol Biol. 2014 Nov 29;9(1):23. doi: 10.1186/s13015-014-0023-3. eCollection 2014.

Organic synthesis: The robo-chemist.有机合成：机器人化学家。

Nature. 2014 Aug 7;512(7512):20-2. doi: 10.1038/512020a.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

自动化反应数据库与反应网络分析：利用化学信息学提取反应模板

Automated reaction database and reaction network analysis: extraction of reaction templates using cheminformatics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献