• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

根对齐的简化分子线性输入规范(SMILES):一种用于化学反应预测的紧凑表示法。

Root-aligned SMILES: a tight representation for chemical reaction prediction.

作者信息

Zhong Zipeng, Song Jie, Feng Zunlei, Liu Tiantao, Jia Lingxiang, Yao Shaolun, Wu Min, Hou Tingjun, Song Mingli

机构信息

College of Computer Science and Technology, Zhejiang University Hangzhou 310027 P. R. China

School of Software Technology, Zhejiang University Ningbo 315048 P. R. China.

出版信息

Chem Sci. 2022 Jul 12;13(31):9023-9034. doi: 10.1039/d2sc02763a. eCollection 2022 Aug 10.

DOI:10.1039/d2sc02763a
PMID:36091202
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9365080/
Abstract

Chemical reaction prediction, involving forward synthesis and retrosynthesis prediction, is a fundamental problem in organic synthesis. A popular computational paradigm formulates synthesis prediction as a sequence-to-sequence translation problem, where the typical SMILES is adopted for molecule representations. However, the general-purpose SMILES neglects the characteristics of chemical reactions, where the molecular graph topology is largely unaltered from reactants to products, resulting in the suboptimal performance of SMILES if straightforwardly applied. In this article, we propose the root-aligned SMILES (R-SMILES), which specifies a tightly aligned one-to-one mapping between the product and the reactant SMILES for more efficient synthesis prediction. Due to the strict one-to-one mapping and reduced edit distance, the computational model is largely relieved from learning the complex syntax and dedicated to learning the chemical knowledge for reactions. We compare the proposed R-SMILES with various state-of-the-art baselines and show that it significantly outperforms them all, demonstrating the superiority of the proposed method.

摘要

化学反应预测,包括正向合成预测和逆向合成预测,是有机合成中的一个基本问题。一种流行的计算范式将合成预测表述为一个序列到序列的翻译问题,其中分子表示采用典型的SMILES。然而,通用的SMILES忽略了化学反应的特征,即从反应物到产物分子图拓扑结构基本不变,直接应用时会导致SMILES性能次优。在本文中,我们提出了根对齐的SMILES(R-SMILES),它为产物和反应物SMILES指定了紧密对齐的一对一映射,以实现更高效的合成预测。由于严格的一对一映射和减小的编辑距离,计算模型在很大程度上无需学习复杂的语法,而专注于学习反应的化学知识。我们将提出的R-SMILES与各种最先进的基线进行比较,结果表明它显著优于所有基线,证明了所提方法的优越性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cd2/9365080/fb47b7fffd6a/d2sc02763a-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cd2/9365080/209c1f75595d/d2sc02763a-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cd2/9365080/d6ff44f8498c/d2sc02763a-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cd2/9365080/d8b28b592ac3/d2sc02763a-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cd2/9365080/416c17b57e5e/d2sc02763a-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cd2/9365080/fb47b7fffd6a/d2sc02763a-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cd2/9365080/209c1f75595d/d2sc02763a-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cd2/9365080/d6ff44f8498c/d2sc02763a-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cd2/9365080/d8b28b592ac3/d2sc02763a-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cd2/9365080/416c17b57e5e/d2sc02763a-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cd2/9365080/fb47b7fffd6a/d2sc02763a-f5.jpg

相似文献

1
Root-aligned SMILES: a tight representation for chemical reaction prediction.根对齐的简化分子线性输入规范(SMILES):一种用于化学反应预测的紧凑表示法。
Chem Sci. 2022 Jul 12;13(31):9023-9034. doi: 10.1039/d2sc02763a. eCollection 2022 Aug 10.
2
Ualign: pushing the limit of template-free retrosynthesis prediction with unsupervised SMILES alignment.Ualign:通过无监督的SMILES比对突破无模板逆合成预测的极限。
J Cheminform. 2024 Jul 15;16(1):80. doi: 10.1186/s13321-024-00877-2.
3
Molecular Transformer unifies reaction prediction and retrosynthesis across pharma chemical space.分子变换统一了药物化学空间中的反应预测和反合成。
Chem Commun (Camb). 2019 Oct 8;55(81):12152-12155. doi: 10.1039/c9cc05122h.
4
Permutation Invariant Graph-to-Sequence Model for Template-Free Retrosynthesis and Reaction Prediction.无模板回溯合成和反应预测的置换不变图到序列模型。
J Chem Inf Model. 2022 Aug 8;62(15):3503-3513. doi: 10.1021/acs.jcim.2c00321. Epub 2022 Jul 26.
5
CTsynther: Contrastive Transformer Model for End-to-End Retrosynthesis Prediction.CTsynther:用于端到端逆合成预测的对比变压器模型。
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2235-2245. doi: 10.1109/TCBB.2024.3455381. Epub 2024 Dec 10.
6
Transfer Learning: Making Retrosynthetic Predictions Based on a Small Chemical Reaction Dataset Scale to a New Level.迁移学习:基于小规模化学反应数据集的逆向合成预测扩展到新的水平。
Molecules. 2020 May 19;25(10):2357. doi: 10.3390/molecules25102357.
7
Substructure-based neural machine translation for retrosynthetic prediction.用于逆合成预测的基于子结构的神经机器翻译
J Cheminform. 2021 Jan 11;13(1):4. doi: 10.1186/s13321-020-00482-z.
8
Self-Supervised Contrastive Molecular Representation Learning with a Chemical Synthesis Knowledge Graph.基于化学合成知识图谱的自监督对比分子表示学习。
J Chem Inf Model. 2024 Mar 25;64(6):1945-1954. doi: 10.1021/acs.jcim.4c00157. Epub 2024 Mar 14.
9
Molecular Transformer: A Model for Uncertainty-Calibrated Chemical Reaction Prediction.分子变压器:一种用于不确定性校准化学反应预测的模型。
ACS Cent Sci. 2019 Sep 25;5(9):1572-1583. doi: 10.1021/acscentsci.9b00576. Epub 2019 Aug 30.
10
MultiGran-SMILES: multi-granularity SMILES learning for molecular property prediction.MultiGran-SMILES:用于分子性质预测的多粒度 SMILES 学习。
Bioinformatics. 2022 Sep 30;38(19):4573-4580. doi: 10.1093/bioinformatics/btac550.

引用本文的文献

1
Graph-sequence enhanced transformer for template-free prediction of natural product biosynthesis.用于天然产物生物合成无模板预测的图序列增强变压器
Patterns (N Y). 2025 Apr 30;6(8):101259. doi: 10.1016/j.patter.2025.101259. eCollection 2025 Aug 8.
2
HiCLR: Knowledge-Induced Hierarchical Contrastive Learning with Retrosynthesis Prediction Yields a Reaction Foundation Model.HiCLR:基于逆合成预测的知识诱导分层对比学习产生反应基础模型。
JACS Au. 2025 Jun 25;5(7):3140-3155. doi: 10.1021/jacsau.5c00289. eCollection 2025 Jul 28.
3
RSGPT: a generative transformer model for retrosynthesis planning pre-trained on ten billion datapoints.

本文引用的文献

1
Retrosynthetic reaction pathway prediction through neural machine translation of atomic environments.通过原子环境的神经机器翻译预测反合成反应途径。
Nat Commun. 2022 Mar 4;13(1):1186. doi: 10.1038/s41467-022-28857-w.
2
Deep Retrosynthetic Reaction Prediction using Local Reactivity and Global Attention.利用局部反应性和全局注意力进行深度逆合成反应预测
JACS Au. 2021 Aug 5;1(10):1612-1620. doi: 10.1021/jacsau.1c00246. eCollection 2021 Oct 25.
3
Molecule Edit Graph Attention Network: Modeling Chemical Reactions as Sequences of Graph Edits.
RSGPT:一种基于一百亿数据点进行预训练的用于逆合成规划的生成式变压器模型。
Nat Commun. 2025 Jul 31;16(1):7012. doi: 10.1038/s41467-025-62308-6.
4
: categorical diffusion ensembles for single-step chemical retrosynthesis.用于单步化学逆合成的分类扩散集合
J Cheminform. 2025 Jul 29;17(1):112. doi: 10.1186/s13321-025-01056-7.
5
From reactants to products: computational methods for biosynthetic pathway design.从反应物到产物:生物合成途径设计的计算方法
Synth Syst Biotechnol. 2025 May 15;10(3):1038-1049. doi: 10.1016/j.synbio.2025.05.005. eCollection 2025 Sep.
6
RPSubAlign: a novel sequence-based molecular representation method for retrosynthesis prediction with improved validity and robustness.RPSubAlign:一种基于序列的新型分子表示方法,用于逆合成预测,具有更高的有效性和鲁棒性。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf257.
7
ASKCOS: Open-Source, Data-Driven Synthesis Planning.ASKCOS:开源、数据驱动的合成规划。
Acc Chem Res. 2025 Jun 3;58(11):1764-1775. doi: 10.1021/acs.accounts.5c00155. Epub 2025 May 21.
8
Cross-disciplinary perspectives on the potential for artificial intelligence across chemistry.关于人工智能在化学领域潜力的跨学科观点。
Chem Soc Rev. 2025 Apr 25. doi: 10.1039/d5cs00146c.
9
Directly optimizing for synthesizability in generative molecular design using retrosynthesis models.利用逆合成模型在生成式分子设计中直接优化可合成性。
Chem Sci. 2025 Mar 21;16(16):6943-6956. doi: 10.1039/d5sc01476j. eCollection 2025 Apr 16.
10
Accelerating the inference of string generation-based chemical reaction models for industrial applications.加速基于字符串生成的化学反应模型在工业应用中的推理。
J Cheminform. 2025 Mar 10;17(1):31. doi: 10.1186/s13321-025-00974-w.
分子图注意力网络:将化学反应建模为图编辑序列。
J Chem Inf Model. 2021 Jul 26;61(7):3273-3284. doi: 10.1021/acs.jcim.1c00537. Epub 2021 Jul 12.
4
Automatic retrosynthetic route planning using template-free models.使用无模板模型的自动逆合成路线规划。
Chem Sci. 2020 Mar 3;11(12):3355-3364. doi: 10.1039/c9sc03666k.
5
Extraction of organic chemistry grammar from unsupervised learning of chemical reactions.从化学反应的无监督学习中提取有机化学语法
Sci Adv. 2021 Apr 7;7(15). doi: 10.1126/sciadv.abe4166. Print 2021 Apr.
6
Valid, Plausible, and Diverse Retrosynthesis Using Tied Two-Way Transformers with Latent Variables.使用具有潜在变量的绑定双向转换器进行有效、可信和多样化的逆合成。
J Chem Inf Model. 2021 Jan 25;61(1):123-133. doi: 10.1021/acs.jcim.0c01074. Epub 2021 Jan 7.
7
State-of-the-art augmented NLP transformer models for direct and single-step retrosynthesis.最先进的增强型自然语言处理转换器模型,用于直接和单步逆合成。
Nat Commun. 2020 Nov 4;11(1):5575. doi: 10.1038/s41467-020-19266-y.
8
Bayesian Algorithm for Retrosynthesis.贝叶斯算法在逆合成中的应用。
J Chem Inf Model. 2020 Oct 26;60(10):4474-4486. doi: 10.1021/acs.jcim.0c00320. Epub 2020 Oct 13.
9
Predicting Retrosynthetic Reactions Using Self-Corrected Transformer Neural Networks.使用自校正变换神经网络预测逆向合成反应。
J Chem Inf Model. 2020 Jan 27;60(1):47-55. doi: 10.1021/acs.jcim.9b00949. Epub 2019 Dec 24.
10
Molecular Transformer: A Model for Uncertainty-Calibrated Chemical Reaction Prediction.分子变压器:一种用于不确定性校准化学反应预测的模型。
ACS Cent Sci. 2019 Sep 25;5(9):1572-1583. doi: 10.1021/acscentsci.9b00576. Epub 2019 Aug 30.