• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

推进分子图及其描述符以预测化学反应产率。

Advancing molecular graphs with descriptors for the prediction of chemical reaction yields.

机构信息

SoftServe, Inc., Lviv, Ukraine.

Ukrainian Catholic University, Lviv, Ukraine.

出版信息

J Comput Chem. 2023 Jan 15;44(2):76-92. doi: 10.1002/jcc.27016. Epub 2022 Oct 20.

DOI:10.1002/jcc.27016
PMID:36264601
Abstract

Chemical yield is the percentage of the reactants converted to the desired products. Chemists use predictive algorithms to select high-yielding reactions and score synthesis routes, saving time and reagents. This study suggests a novel graph neural network architecture for chemical yield prediction. The network combines structural information about participants of the transformation as well as molecular and reaction-level descriptors. It works with incomplete chemical reactions and generates reactants-product atom mapping. We show that the network benefits from advanced information by comparing it with several machine learning models and molecular representations. Models included logistic regression, support vector machine, CatBoost, and Bidirectional Encoder Representations from Transformers. Molecular representations included extended-connectivity fingerprints, Morgan fingerprints, SMILESVec embeddings, and textual. Classification and regression objectives were assessed for each model and feature set. The goal of each classification model was to separate zero- and non-zero-yielding reactions. The models were trained and evaluated on a proprietary dataset of 10 reaction types. Also, the models were benchmarked on two public single reaction type datasets. The study was supplemented with analysis of data, results, and errors, as well as the impact of steric factors, side reactions, isolation, and purification efficiency. The supplementary code is available at https://github.com/SoftServeInc/yield-paper.

摘要

产率是反应物转化为所需产物的百分比。化学家使用预测算法来选择高产率的反应,并对合成路线进行评分,从而节省时间和试剂。本研究提出了一种新的用于化学产率预测的图神经网络架构。该网络结合了反应物-产物原子映射中关于反应物和产物的结构信息以及分子和反应级描述符。它可以处理不完整的化学反应,并生成反应物-产物原子映射。我们通过将其与几种机器学习模型和分子表示进行比较,证明了该网络从高级信息中获益。模型包括逻辑回归、支持向量机、CatBoost 和来自 Transformer 的双向编码器表示。分子表示包括扩展连接指纹、Morgan 指纹、SMILESVec 嵌入和文本。对每个模型和特征集评估了分类和回归目标。每个分类模型的目标是区分零产率和非零产率反应。在 10 种反应类型的专有数据集上对模型进行了训练和评估。此外,还在两个公共单反应类型数据集上对模型进行了基准测试。该研究还对数据、结果和错误进行了分析,并评估了立体因素、副反应、分离和纯化效率的影响。补充代码可在 https://github.com/SoftServeInc/yield-paper 上获得。

相似文献

1
Advancing molecular graphs with descriptors for the prediction of chemical reaction yields.推进分子图及其描述符以预测化学反应产率。
J Comput Chem. 2023 Jan 15;44(2):76-92. doi: 10.1002/jcc.27016. Epub 2022 Oct 20.
2
Bidirectional Graphormer for Reactivity Understanding: Neural Network Trained to Reaction Atom-to-Atom Mapping Task.双向图格默模型用于反应理解:训练用于反应原子到原子映射任务的神经网络。
J Chem Inf Model. 2022 Jul 25;62(14):3307-3315. doi: 10.1021/acs.jcim.2c00344. Epub 2022 Jul 6.
3
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
4
Molecular property prediction based on graph structure learning.基于图结构学习的分子性质预测。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae304.
5
Ensembling machine learning models to boost molecular affinity prediction.集成机器学习模型以提高分子亲和力预测。
Comput Biol Chem. 2021 Aug;93:107529. doi: 10.1016/j.compbiolchem.2021.107529. Epub 2021 Jun 12.
6
Algebraic graph-assisted bidirectional transformers for molecular property prediction.基于代数图辅助的双向转换器在分子性质预测中的应用。
Nat Commun. 2021 Jun 10;12(1):3521. doi: 10.1038/s41467-021-23720-w.
7
Chemical reaction enhanced graph learning for molecule representation.化学反应增强图学习的分子表示。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae558.
8
Unsupervised construction of computational graphs for gene expression data with explicit structural inductive biases.无监督构建具有显式结构归纳偏差的基因表达数据的计算图。
Bioinformatics. 2022 Feb 7;38(5):1320-1327. doi: 10.1093/bioinformatics/btab830.
9
Transfer Learning: Making Retrosynthetic Predictions Based on a Small Chemical Reaction Dataset Scale to a New Level.迁移学习:基于小规模化学反应数据集的逆向合成预测扩展到新的水平。
Molecules. 2020 May 19;25(10):2357. doi: 10.3390/molecules25102357.
10
FP-GNN: a versatile deep learning architecture for enhanced molecular property prediction.FP-GNN:一种用于增强分子性质预测的多功能深度学习架构。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac408.

引用本文的文献

1
log-RRIM: Yield Prediction via Local-to-Global Reaction Representation Learning and Interaction Modeling.log-RRIM:通过局部到全局反应表示学习和相互作用建模进行产量预测。
J Chem Inf Model. 2025 Jul 29. doi: 10.1021/acs.jcim.5c01208.
2
Local reaction condition optimization via machine learning.通过机器学习优化局部反应条件
J Mol Model. 2025 Apr 23;31(5):143. doi: 10.1007/s00894-025-06365-0.
3
Investigating topological descriptor and Gibb's energy for semiconducting carbon allotrope through advanced curve fitting model.通过先进的曲线拟合模型研究半导体碳同素异形体的拓扑描述符和吉布斯自由能。
Sci Rep. 2025 Apr 12;15(1):12561. doi: 10.1038/s41598-025-97005-3.
4
Predicting and Explaining Yields with Machine Learning for Carboxylated Azoles and Beyond.利用机器学习预测和解释羧基化唑类及其他物质的产率
J Chem Inf Model. 2025 Feb 24;65(4):1862-1872. doi: 10.1021/acs.jcim.4c02336. Epub 2025 Feb 7.
5
log-RRIM: Yield Prediction via Local-to-global Reaction Representation Learning and Interaction Modeling.对数RRIM:通过局部到全局反应表示学习和相互作用建模进行产量预测
ArXiv. 2025 Mar 9:arXiv:2411.03320v4.
6
Machine learning-guided strategies for reaction conditions design and optimization.用于反应条件设计与优化的机器学习引导策略。
Beilstein J Org Chem. 2024 Oct 4;20:2476-2492. doi: 10.3762/bjoc.20.212. eCollection 2024.
7
Deep Kernel learning for reaction outcome prediction and optimization.用于反应结果预测与优化的深度核学习
Commun Chem. 2024 Jun 14;7(1):136. doi: 10.1038/s42004-024-01219-x.
8
Incorporating Synthetic Accessibility in Drug Design: Predicting Reaction Yields of Suzuki Cross-Couplings by Leveraging AbbVie's 15-Year Parallel Library Data Set.在药物设计中纳入合成可及性:利用 AbbVie 长达 15 年的平行文库数据集预测铃木交叉偶联反应产率。
J Am Chem Soc. 2024 Jun 5;146(22):15070-15084. doi: 10.1021/jacs.4c00098. Epub 2024 May 20.
9
Enhancing Generic Reaction Yield Prediction through Reaction Condition-Based Contrastive Learning.通过基于反应条件的对比学习提高通用反应产率预测
Research (Wash D C). 2024 Jan 10;7:0292. doi: 10.34133/research.0292. eCollection 2024.
10
When Yield Prediction Does Not Yield Prediction: An Overview of the Current Challenges.当产量预测无法预测时:当前挑战概述。
J Chem Inf Model. 2024 Jan 8;64(1):42-56. doi: 10.1021/acs.jcim.3c01524. Epub 2023 Dec 20.