• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用开放反应条件数据集和反应中心的无监督学习进行通用可解释反应条件预测。

Generic Interpretable Reaction Condition Predictions with Open Reaction Condition Datasets and Unsupervised Learning of Reaction Center.

作者信息

Wang Xiaorui, Hsieh Chang-Yu, Yin Xiaodan, Wang Jike, Li Yuquan, Deng Yafeng, Jiang Dejun, Wu Zhenxing, Du Hongyan, Chen Hongming, Li Yun, Liu Huanxiang, Wang Yuwei, Luo Pei, Hou Tingjun, Yao Xiaojun

机构信息

Dr. Neher's Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao, 999078, China.

CarbonSilicon AI Technology Co., Ltd, Hangzhou, Zhejiang310018, China.

出版信息

Research (Wash D C). 2023 Oct 16;6:0231. doi: 10.34133/research.0231. eCollection 2023.

DOI:10.34133/research.0231
PMID:37849643
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10578430/
Abstract

Effective synthesis planning powered by deep learning (DL) can significantly accelerate the discovery of new drugs and materials. However, most DL-assisted synthesis planning methods offer either none or very limited capability to recommend suitable reaction conditions (RCs) for their reaction predictions. Currently, the prediction of RCs with a DL framework is hindered by several factors, including: (a) lack of a standardized dataset for benchmarking, (b) lack of a general prediction model with powerful representation, and (c) lack of interpretability. To address these issues, we first created 2 standardized RC datasets covering a broad range of reaction classes and then proposed a powerful and interpretable Transformer-based RC predictor named Parrot. Through careful design of the model architecture, pretraining method, and training strategy, Parrot improved the overall top-3 prediction accuracy on catalysis, solvents, and other reagents by as much as 13.44%, compared to the best previous model on a newly curated dataset. Additionally, the mean absolute error of the predicted temperatures was reduced by about 4 °C. Furthermore, Parrot manifests strong generalization capacity with superior cross-chemical-space prediction accuracy. Attention analysis indicates that Parrot effectively captures crucial chemical information and exhibits a high level of interpretability in the prediction of RCs. The proposed model Parrot exemplifies how modern neural network architecture when appropriately pretrained can be versatile in making reliable, generalizable, and interpretable recommendation for RCs even when the underlying training dataset may still be limited in diversity.

摘要

由深度学习(DL)驱动的有效合成规划可以显著加速新药和新材料的发现。然而,大多数深度学习辅助的合成规划方法在为其反应预测推荐合适的反应条件(RCs)方面,要么完全没有能力,要么能力非常有限。目前,使用深度学习框架预测反应条件受到几个因素的阻碍,包括:(a)缺乏用于基准测试的标准化数据集,(b)缺乏具有强大表征能力的通用预测模型,以及(c)缺乏可解释性。为了解决这些问题,我们首先创建了2个涵盖广泛反应类别的标准化反应条件数据集,然后提出了一个名为Parrot的基于Transformer的强大且可解释的反应条件预测器。通过精心设计模型架构、预训练方法和训练策略,与新整理数据集上的最佳先前模型相比,Parrot在催化、溶剂和其他试剂方面的总体前3预测准确率提高了13.44%。此外,预测温度的平均绝对误差降低了约4°C。此外,Parrot表现出强大的泛化能力,具有卓越的跨化学空间预测准确率。注意力分析表明,Parrot有效地捕捉了关键化学信息,并在反应条件预测中表现出高度的可解释性。所提出的模型Parrot例证了,即使基础训练数据集在多样性方面可能仍然有限,现代神经网络架构在经过适当预训练后,如何能够在为反应条件做出可靠、可泛化和可解释的推荐方面具有通用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/7ed174f542ec/research.0231.fig.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/19d6889e520f/research.0231.fig.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/d339ab9249ba/research.0231.fig.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/fa5e62b09b7b/research.0231.fig.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/4c02c284b898/research.0231.fig.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/bc029c7cd8a2/research.0231.fig.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/4750a78f5e02/research.0231.fig.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/cb63d68cf1e5/research.0231.fig.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/7ed174f542ec/research.0231.fig.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/19d6889e520f/research.0231.fig.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/d339ab9249ba/research.0231.fig.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/fa5e62b09b7b/research.0231.fig.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/4c02c284b898/research.0231.fig.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/bc029c7cd8a2/research.0231.fig.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/4750a78f5e02/research.0231.fig.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/cb63d68cf1e5/research.0231.fig.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbdf/10578430/7ed174f542ec/research.0231.fig.008.jpg

相似文献

1
Generic Interpretable Reaction Condition Predictions with Open Reaction Condition Datasets and Unsupervised Learning of Reaction Center.利用开放反应条件数据集和反应中心的无监督学习进行通用可解释反应条件预测。
Research (Wash D C). 2023 Oct 16;6:0231. doi: 10.34133/research.0231. eCollection 2023.
2
Enhancing Generic Reaction Yield Prediction through Reaction Condition-Based Contrastive Learning.通过基于反应条件的对比学习提高通用反应产率预测
Research (Wash D C). 2024 Jan 10;7:0292. doi: 10.34133/research.0292. eCollection 2024.
3
Transfer Learning: Making Retrosynthetic Predictions Based on a Small Chemical Reaction Dataset Scale to a New Level.迁移学习:基于小规模化学反应数据集的逆向合成预测扩展到新的水平。
Molecules. 2020 May 19;25(10):2357. doi: 10.3390/molecules25102357.
4
A general prediction model for compound-protein interactions based on deep learning.一种基于深度学习的化合物-蛋白质相互作用通用预测模型。
Front Pharmacol. 2024 Sep 4;15:1465890. doi: 10.3389/fphar.2024.1465890. eCollection 2024.
5
SynCluster: Reaction Type Clustering and Recommendation Framework for Synthesis Planning.SynCluster:用于合成规划的反应类型聚类与推荐框架
JACS Au. 2023 Nov 17;3(12):3446-3461. doi: 10.1021/jacsau.3c00607. eCollection 2023 Dec 25.
6
Transformer-based multitask learning for reaction prediction under low-resource circumstances.基于Transformer的低资源环境下反应预测的多任务学习
RSC Adv. 2022 Nov 8;12(49):32020-32026. doi: 10.1039/d2ra05349g. eCollection 2022 Nov 3.
7
iNGNN-DTI: prediction of drug-target interaction with interpretable nested graph neural network and pretrained molecule models.INGNN-DTI:利用可解释嵌套图神经网络和预训练分子模型预测药物-靶标相互作用。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae135.
8
Prediction of anticancer drug sensitivity using an interpretable model guided by deep learning.利用深度学习指导的可解释模型预测抗癌药物敏感性。
BMC Bioinformatics. 2024 May 9;25(1):182. doi: 10.1186/s12859-024-05669-x.
9
MG-BERT: leveraging unsupervised atomic representation learning for molecular property prediction.MG-BERT:利用无监督原子表示学习进行分子性质预测。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab152.
10
Retrosynthesis prediction using an end-to-end graph generative architecture for molecular graph editing.基于端到端图生成架构的分子图编辑回溯合成预测。
Nat Commun. 2023 May 25;14(1):3009. doi: 10.1038/s41467-023-38851-5.

引用本文的文献

1
Molecular Merged Hypergraph Neural Network for Explainable Solvation Gibbs Free Energy Prediction.用于可解释溶剂化吉布斯自由能预测的分子合并超图神经网络
Research (Wash D C). 2025 Aug 15;8:0740. doi: 10.34133/research.0740. eCollection 2025.
2
HiCLR: Knowledge-Induced Hierarchical Contrastive Learning with Retrosynthesis Prediction Yields a Reaction Foundation Model.HiCLR:基于逆合成预测的知识诱导分层对比学习产生反应基础模型。
JACS Au. 2025 Jun 25;5(7):3140-3155. doi: 10.1021/jacsau.5c00289. eCollection 2025 Jul 28.
3
Road of Large Language Model: Source, Challenge, and Future Perspectives.

本文引用的文献

1
Pushing the Boundaries of Molecular Property Prediction for Drug Discovery with Multitask Learning BERT Enhanced by SMILES Enumeration.通过SMILES枚举增强的多任务学习BERT推动药物发现中分子性质预测的边界
Research (Wash D C). 2022 Dec 15;2022:0004. doi: 10.34133/research.0004. eCollection 2022.
2
Augmenting large language models with chemistry tools.用化学工具增强大语言模型。
Nat Mach Intell. 2024;6(5):525-535. doi: 10.1038/s42256-024-00832-8. Epub 2024 May 8.
3
Exploring Optimal Reaction Conditions Guided by Graph Neural Networks and Bayesian Optimization.
大语言模型之路:来源、挑战及未来展望。
Research (Wash D C). 2025 Jul 29;8:0655. doi: 10.34133/research.0655. eCollection 2025.
4
ASKCOS: Open-Source, Data-Driven Synthesis Planning.ASKCOS:开源、数据驱动的合成规划。
Acc Chem Res. 2025 Jun 3;58(11):1764-1775. doi: 10.1021/acs.accounts.5c00155. Epub 2025 May 21.
5
ProstaNet: A Novel Geometric Vector Perceptrons-Graph Neural Network Algorithm for Protein Stability Prediction in Single- and Multiple-Point Mutations with Experimental Validation.ProstaNet:一种用于单点和多点突变蛋白质稳定性预测的新型几何向量感知器-图神经网络算法,并经过实验验证
Research (Wash D C). 2025 Apr 15;8:0674. doi: 10.34133/research.0674. eCollection 2025.
6
Syn-MolOpt: a synthesis planning-driven molecular optimization method using data-derived functional reaction templates.Syn-MolOpt:一种使用数据驱动的功能反应模板的合成规划驱动的分子优化方法。
J Cheminform. 2025 Mar 2;17(1):27. doi: 10.1186/s13321-025-00975-9.
7
Reacon: a template- and cluster-based framework for reaction condition prediction.Reacon:一种基于模板和聚类的反应条件预测框架。
Chem Sci. 2024 Dec 6;16(2):854-866. doi: 10.1039/d4sc05946h. eCollection 2025 Jan 2.
8
Machine learning-guided strategies for reaction conditions design and optimization.用于反应条件设计与优化的机器学习引导策略。
Beilstein J Org Chem. 2024 Oct 4;20:2476-2492. doi: 10.3762/bjoc.20.212. eCollection 2024.
9
Developing BioNavi for Hybrid Retrosynthesis Planning.用于混合逆合成规划的BioNavi开发。
JACS Au. 2024 Jul 3;4(7):2492-2502. doi: 10.1021/jacsau.4c00228. eCollection 2024 Jul 22.
10
Artificial Intelligence Methods and Models for Retro-Biosynthesis: A Scoping Review.人工智能方法和模型在逆向生物合成中的应用:范围综述。
ACS Synth Biol. 2024 Aug 16;13(8):2276-2294. doi: 10.1021/acssynbio.4c00091. Epub 2024 Jul 24.
基于图神经网络和贝叶斯优化指导探索最优反应条件
ACS Omega. 2022 Dec 2;7(49):44939-44950. doi: 10.1021/acsomega.2c05165. eCollection 2022 Dec 13.
4
Chemistry-informed molecular graph as reaction descriptor for machine-learned retrosynthesis planning.基于化学信息的分子图作为机器学习逆向合成规划的反应描述符。
Proc Natl Acad Sci U S A. 2022 Oct 11;119(41):e2212711119. doi: 10.1073/pnas.2212711119. Epub 2022 Oct 3.
5
Predicting reaction conditions from limited data through active transfer learning.通过主动迁移学习从有限数据预测反应条件。
Chem Sci. 2022 May 11;13(22):6655-6668. doi: 10.1039/d1sc06932b. eCollection 2022 Jun 7.
6
Machine Learning May Sometimes Simply Capture Literature Popularity Trends: A Case Study of Heterocyclic Suzuki-Miyaura Coupling.机器学习有时可能只是捕捉文献的流行趋势:杂环铃木-宫浦偶联反应的案例研究。
J Am Chem Soc. 2022 Mar 23;144(11):4819-4827. doi: 10.1021/jacs.1c12005. Epub 2022 Mar 8.
7
Prediction of the Chemical Context for Buchwald-Hartwig Coupling Reactions.预测 Buchwald-Hartwig 偶联反应的化学环境。
Mol Inform. 2022 Aug;41(8):e2100294. doi: 10.1002/minf.202100294. Epub 2022 Feb 22.
8
The Open Reaction Database.开放式反应数据库。
J Am Chem Soc. 2021 Nov 17;143(45):18820-18826. doi: 10.1021/jacs.1c09820. Epub 2021 Nov 2.
9
Evaluating and clustering retrosynthesis pathways with learned strategy.利用学习到的策略评估和聚类逆合成途径。
Chem Sci. 2020 Nov 23;12(4):1469-1478. doi: 10.1039/d0sc05078d.
10
Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy.使用基于Transformer的模型和超图探索策略预测逆合成途径。
Chem Sci. 2020 Mar 3;11(12):3316-3325. doi: 10.1039/c9sc05704h.