• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用预训练的图神经网络改进化学反应产率预测

Improving chemical reaction yield prediction using pre-trained graph neural networks.

作者信息

Han Jongmin, Kwon Youngchun, Choi Youn-Suk, Kang Seokho

机构信息

Department of Industrial Engineering, Sungkyunkwan University, 2066 Seobu-ro, Jangan-gu, Suwon, Republic of Korea.

Samsung Advanced Institute of Technology, Samsung Electronics Co. Ltd., 130 Samsung-ro, Yeongtong-gu, Suwon, Republic of Korea.

出版信息

J Cheminform. 2024 Mar 1;16(1):25. doi: 10.1186/s13321-024-00818-z.

DOI:10.1186/s13321-024-00818-z
PMID:38429787
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10905905/
Abstract

Graph neural networks (GNNs) have proven to be effective in the prediction of chemical reaction yields. However, their performance tends to deteriorate when they are trained using an insufficient training dataset in terms of quantity or diversity. A promising solution to alleviate this issue is to pre-train a GNN on a large-scale molecular database. In this study, we investigate the effectiveness of GNN pre-training in chemical reaction yield prediction. We present a novel GNN pre-training method for performance improvement.Given a molecular database consisting of a large number of molecules, we calculate molecular descriptors for each molecule and reduce the dimensionality of these descriptors by applying principal component analysis. We define a pre-text task by assigning a vector of principal component scores as the pseudo-label to each molecule in the database. A GNN is then pre-trained to perform the pre-text task of predicting the pseudo-label for the input molecule. For chemical reaction yield prediction, a prediction model is initialized using the pre-trained GNN and then fine-tuned with the training dataset containing chemical reactions and their yields. We demonstrate the effectiveness of the proposed method through experimental evaluation on benchmark datasets.

摘要

图神经网络(GNNs)已被证明在预测化学反应产率方面是有效的。然而,当使用数量或多样性不足的训练数据集对其进行训练时,它们的性能往往会下降。缓解这一问题的一个有前景的解决方案是在大规模分子数据库上对GNN进行预训练。在本研究中,我们研究了GNN预训练在化学反应产率预测中的有效性。我们提出了一种用于性能提升的新型GNN预训练方法。给定一个由大量分子组成的分子数据库,我们为每个分子计算分子描述符,并通过应用主成分分析来降低这些描述符的维度。我们通过为数据库中的每个分子分配一个主成分得分向量作为伪标签来定义一个预训练任务。然后对一个GNN进行预训练,以执行预测输入分子伪标签的预训练任务。对于化学反应产率预测,使用预训练的GNN初始化一个预测模型,然后使用包含化学反应及其产率的训练数据集对其进行微调。我们通过在基准数据集上的实验评估证明了所提出方法的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/300af32cd22f/13321_2024_818_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/841f2d1ef96e/13321_2024_818_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/41b1a83c13ee/13321_2024_818_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/7adbbca9f5d7/13321_2024_818_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/e422b941954a/13321_2024_818_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/785e5c2342fe/13321_2024_818_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/3f671679cb47/13321_2024_818_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/300af32cd22f/13321_2024_818_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/841f2d1ef96e/13321_2024_818_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/41b1a83c13ee/13321_2024_818_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/7adbbca9f5d7/13321_2024_818_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/e422b941954a/13321_2024_818_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/785e5c2342fe/13321_2024_818_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/3f671679cb47/13321_2024_818_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed2/10905905/300af32cd22f/13321_2024_818_Fig7_HTML.jpg

相似文献

1
Improving chemical reaction yield prediction using pre-trained graph neural networks.使用预训练的图神经网络改进化学反应产率预测
J Cheminform. 2024 Mar 1;16(1):25. doi: 10.1186/s13321-024-00818-z.
2
Adaptive Transfer of Graph Neural Networks for Few-Shot Molecular Property Prediction.图神经网络的自适应转移在少样本分子性质预测中的应用。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Nov-Dec;20(6):3863-3875. doi: 10.1109/TCBB.2023.3327452. Epub 2023 Dec 25.
3
Pre-training graph neural networks for link prediction in biomedical networks.用于生物医学网络中链接预测的预训练图神经网络。
Bioinformatics. 2022 Apr 12;38(8):2254-2262. doi: 10.1093/bioinformatics/btac100.
4
Selecting molecules with diverse structures and properties by maximizing submodular functions of descriptors learned with graph neural networks.通过最大化基于图神经网络学习的描述符的子模函数来选择具有多样结构和性质的分子。
Sci Rep. 2022 Jan 21;12(1):1124. doi: 10.1038/s41598-022-04967-9.
5
MD-GNN: A mechanism-data-driven graph neural network for molecular properties prediction and new material discovery.MD-GNN:一种基于机制数据的图神经网络,用于分子性质预测和新材料发现。
J Mol Graph Model. 2023 Sep;123:108506. doi: 10.1016/j.jmgm.2023.108506. Epub 2023 May 9.
6
Graph Neural Tree: A novel and interpretable deep learning-based framework for accurate molecular property predictions.图神经网络树:一种新颖且可解释的基于深度学习的准确分子性质预测框架。
Anal Chim Acta. 2023 Mar 1;1244:340558. doi: 10.1016/j.aca.2022.340558. Epub 2022 Nov 3.
7
SLI-GNN: A Self-Learning-Input Graph Neural Network for Predicting Crystal and Molecular Properties.SLI-GNN:一种用于预测晶体和分子性质的自学习输入图神经网络。
J Phys Chem A. 2023 Jul 20;127(28):5921-5929. doi: 10.1021/acs.jpca.3c01558. Epub 2023 Jul 7.
8
Improved GNNs for Log  Prediction by Transferring Knowledge from Low-Fidelity Data.通过从低质量数据转移知识来改进图神经网络进行日志预测。
J Chem Inf Model. 2023 Apr 24;63(8):2345-2359. doi: 10.1021/acs.jcim.2c01564. Epub 2023 Mar 31.
9
SP-GNN: Learning structure and position information from graphs.SP-GNN:从图中学习结构和位置信息。
Neural Netw. 2023 Apr;161:505-514. doi: 10.1016/j.neunet.2023.01.051. Epub 2023 Feb 4.
10
Retention time prediction in hydrophilic interaction liquid chromatography with graph neural network and transfer learning.基于图神经网络和迁移学习的亲水相互作用液相色谱保留时间预测。
J Chromatogr A. 2021 Oct 25;1656:462536. doi: 10.1016/j.chroma.2021.462536. Epub 2021 Sep 7.

引用本文的文献

1
A meta-learning approach for selectivity prediction in asymmetric catalysis.一种用于不对称催化中选择性预测的元学习方法。
Nat Commun. 2025 Apr 16;16(1):3599. doi: 10.1038/s41467-025-58854-8.

本文引用的文献

1
MoCL: Data-driven Molecular Fingerprint via Knowledge-aware Contrastive Learning from Molecular Graph.MoCL:通过基于分子图的知识感知对比学习实现的数据驱动分子指纹
KDD. 2021 Aug;2021:3585-3594. doi: 10.1145/3447548.3467186. Epub 2021 Aug 14.
2
Self-Supervised Learning of Graph Neural Networks: A Unified Review.图神经网络的自监督学习:统一综述。
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):2412-2429. doi: 10.1109/TPAMI.2022.3170559. Epub 2023 Jan 6.
3
Uncertainty-aware prediction of chemical reaction yields with graph neural networks.
基于图神经网络的化学反应产率不确定性感知预测。
J Cheminform. 2022 Jan 10;14(1):2. doi: 10.1186/s13321-021-00579-z.
4
A compact review of molecular property prediction with graph neural networks.图神经网络在分子性质预测中的应用综述
Drug Discov Today Technol. 2020 Dec;37:1-12. doi: 10.1016/j.ddtec.2020.11.009. Epub 2020 Dec 17.
5
Machine Learning for Chemical Reactions.机器学习在化学反应中的应用。
Chem Rev. 2021 Aug 25;121(16):10218-10239. doi: 10.1021/acs.chemrev.1c00033. Epub 2021 Jun 7.
6
An effective self-supervised framework for learning expressive molecular global representations to drug discovery.用于药物发现的学习表达性分子全局表示的有效自监督框架。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab109.
7
Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models.图神经网络能否为药物发现学习更好的分子表示?基于描述符和基于图的模型的比较研究。
J Cheminform. 2021 Feb 17;13(1):12. doi: 10.1186/s13321-020-00479-8.
8
Molecular property prediction: recent trends in the era of artificial intelligence.分子性质预测:人工智能时代的最新趋势。
Drug Discov Today Technol. 2019 Dec;32-33:29-36. doi: 10.1016/j.ddtec.2020.05.001. Epub 2020 Jul 1.
9
Machine Learning Prediction of Nine Molecular Properties Based on the SMILES Representation of the QM9 Quantum-Chemistry Dataset.基于 QM9 量子化学数据集的 SMILES 表示的机器学习对 9 种分子性质的预测。
J Phys Chem A. 2020 Nov 25;124(47):9854-9866. doi: 10.1021/acs.jpca.0c05969. Epub 2020 Nov 11.
10
Comprehensive Study on Molecular Supervised Learning with Graph Neural Networks.基于图神经网络的分子监督式学习综合研究。
J Chem Inf Model. 2020 Dec 28;60(12):5936-5945. doi: 10.1021/acs.jcim.0c00416. Epub 2020 Nov 8.