知识嵌入消息传递神经网络：利用人类知识改进分子性质预测

Knowledge-Embedded Message-Passing Neural Networks: Improving Molecular Property Prediction with Human Knowledge.

作者信息

Hasebe Tatsuya

机构信息

Research & Development Group, Hitachi, Ltd., 832-2, Horiguchi, Hitachinaka, Ibaraki 312-0034, Japan.

出版信息

ACS Omega. 2021 Oct 14;6(42):27955-27967. doi: 10.1021/acsomega.1c03839. eCollection 2021 Oct 26.

DOI:10.1021/acsomega.1c03839

PMID:34722995

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8552328/

Abstract

The graph neural network (GNN) has become a promising method to predict molecular properties with end-to-end supervision, as it can learn molecular features directly from chemical graphs in a black-box manner. However, to achieve high prediction accuracy, it is essential to supervise a huge amount of property data, which is often accompanied by a high property experiment cost. Prior to the deep learning method, descriptor-based quantitative structure-property relationships (QSPR) studies have investigated physical and chemical knowledge to manually design descriptors for effectively predicting properties. In this study, we extend a message-passing neural network (MPNN) to include a novel MPNN architecture called the knowledge-embedded MPNN (KEMPNN) that can be supervised together with nonquantitative knowledge annotations by human experts on a chemical graph that contains information on the important substructure of a molecule and its effect on the target property (e.g., positive or negative effect). We evaluated the performance of the KEMPNN in a small training data setting using a physical chemistry dataset in MoleculeNet (ESOL, FreeSolv, Lipophilicity) and a polymer property (glass-transition temperature) dataset with virtual knowledge annotations. The results demonstrate that the KEMPNN with knowledge supervision can improve the prediction accuracy obtained from the MPNN. The results also demonstrate that the accuracy of the KEMPNN is better than or comparable to those of descriptor-based methods even in the case of small training data.

摘要

图神经网络（GNN）已成为一种很有前景的方法，可通过端到端监督来预测分子性质，因为它能够以黑箱方式直接从化学图中学习分子特征。然而，为了实现高预测精度，监督大量的性质数据至关重要，而这通常伴随着高昂的性质实验成本。在深度学习方法出现之前，基于描述符的定量结构-性质关系（QSPR）研究已经探究了物理和化学知识，以手动设计描述符来有效预测性质。在本研究中，我们扩展了消息传递神经网络（MPNN），纳入了一种名为知识嵌入MPNN（KEMPNN）的新型MPNN架构，该架构可以与人类专家对包含分子重要子结构信息及其对目标性质影响（例如，正向或负向影响）的化学图的非定量知识注释一起进行监督。我们使用MoleculeNet中的物理化学数据集（ESOL、FreeSolv、亲脂性）和具有虚拟知识注释的聚合物性质（玻璃化转变温度）数据集，在小训练数据设置下评估了KEMPNN的性能。结果表明，具有知识监督的KEMPNN可以提高从MPNN获得的预测精度。结果还表明，即使在小训练数据的情况下，KEMPNN的准确性也优于基于描述符的方法或与之相当。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/facc/8552328/59c31cea6a3c/ao1c03839_0002.jpg

相似文献

Knowledge-Embedded Message-Passing Neural Networks: Improving Molecular Property Prediction with Human Knowledge.知识嵌入消息传递神经网络：利用人类知识改进分子性质预测

ACS Omega. 2021 Oct 14;6(42):27955-27967. doi: 10.1021/acsomega.1c03839. eCollection 2021 Oct 26.

ABT-MPNN: an atom-bond transformer-based message-passing neural network for molecular property prediction.ABT-MPNN：一种基于原子键变压器的消息传递神经网络，用于分子性质预测。

J Cheminform. 2023 Feb 26;15(1):29. doi: 10.1186/s13321-023-00698-9.

Integrating concept of pharmacophore with graph neural networks for chemical property prediction and interpretation.将药效团概念与图神经网络相结合用于化学性质预测与解释。

J Cheminform. 2022 Aug 4;14(1):52. doi: 10.1186/s13321-022-00634-3.

Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models.图神经网络能否为药物发现学习更好的分子表示？基于描述符和基于图的模型的比较研究。

J Cheminform. 2021 Feb 17;13(1):12. doi: 10.1186/s13321-020-00479-8.

SLI-GNN: A Self-Learning-Input Graph Neural Network for Predicting Crystal and Molecular Properties.SLI-GNN：一种用于预测晶体和分子性质的自学习输入图神经网络。

J Phys Chem A. 2023 Jul 20;127(28):5921-5929. doi: 10.1021/acs.jpca.3c01558. Epub 2023 Jul 7.

When Do Quantum Mechanical Descriptors Help Graph Neural Networks to Predict Chemical Properties?量子力学描述符何时有助于图神经网络预测化学性质？

J Am Chem Soc. 2024 Aug 21;146(33):23103-23120. doi: 10.1021/jacs.4c04670. Epub 2024 Aug 6.

Building attention and edge message passing neural networks for bioactivity and physical-chemical property prediction.构建用于生物活性和物理化学性质预测的注意力和边缘消息传递神经网络。

J Cheminform. 2020 Jan 8;12(1):1. doi: 10.1186/s13321-019-0407-y.

Assessing Graph-based Deep Learning Models for Predicting Flash Point.基于图的深度学习模型预测闪点评估

Mol Inform. 2020 Jun;39(6):e1900101. doi: 10.1002/minf.201900101. Epub 2020 Feb 20.

ReaxFF-MPNN machine learning potential: a combination of reactive force field and message passing neural networks.ReaxFF-MPNN 机器学习势：反应力场与消息传递神经网络的结合。

Phys Chem Chem Phys. 2021 Sep 15;23(35):19457-19464. doi: 10.1039/d1cp01656c.

Improved Lipophilicity and Aqueous Solubility Prediction with Composite Graph Neural Networks.复合图神经网络提高亲脂性和水溶解度预测。

Molecules. 2021 Oct 13;26(20):6185. doi: 10.3390/molecules26206185.

引用本文的文献

I‑GAT: Interpretable Graph Attention Networks for Ligand Optimization.I‑GAT：用于配体优化的可解释图注意力网络

ACS Omega. 2025 Jul 21;10(30):32968-32986. doi: 10.1021/acsomega.5c02173. eCollection 2025 Aug 5.

Deep learning for property prediction of natural fiber polymer composites.用于天然纤维聚合物复合材料性能预测的深度学习

Sci Rep. 2025 Jul 30;15(1):27837. doi: 10.1038/s41598-025-10841-1.

Domain adaptation of a SMILES chemical transformer to SELFIES with limited computational resources.在有限计算资源下将SMILES化学变换器进行域适应以转换为SELFIES

Sci Rep. 2025 Jul 2;15(1):23627. doi: 10.1038/s41598-025-05017-w.

Machine learning applications for thermochemical and kinetic property prediction.用于热化学和动力学性质预测的机器学习应用。

Rev Chem Eng. 2024 Nov 29;41(4):419-449. doi: 10.1515/revce-2024-0027. eCollection 2025 May.

SG-ATT: A Sequence Graph Cross-Attention Representation Architecture for Molecular Property Prediction.SG-ATT：用于分子性质预测的序列图交叉注意力表示架构

Molecules. 2024 Jan 19;29(2):492. doi: 10.3390/molecules29020492.

Cheminformatics and artificial intelligence for accelerating agrochemical discovery.用于加速农用化学品发现的化学信息学与人工智能

Front Chem. 2023 Nov 29;11:1292027. doi: 10.3389/fchem.2023.1292027. eCollection 2023.

LogD7.4 prediction enhanced by transferring knowledge from chromatographic retention time, microscopic pKa and logP.通过从色谱保留时间、微观pKa和logP转移知识增强LogD7.4预测

J Cheminform. 2023 Sep 5;15(1):76. doi: 10.1186/s13321-023-00754-4.

Machine Learning with Enormous "Synthetic" Data Sets: Predicting Glass Transition Temperature of Polyimides Using Graph Convolutional Neural Networks.利用大量“合成”数据集进行机器学习：使用图卷积神经网络预测聚酰亚胺的玻璃化转变温度

ACS Omega. 2022 Nov 17;7(48):43678-43691. doi: 10.1021/acsomega.2c04649. eCollection 2022 Dec 6.

本文引用的文献

Polymer informatics with multi-task learning.基于多任务学习的聚合物信息学

Patterns (N Y). 2021 Apr 9;2(4):100238. doi: 10.1016/j.patter.2021.100238.

Reverse graph self-attention for target-directed atomic importance estimation.反向图自注意力用于目标导向的原子重要性估计。

Neural Netw. 2021 Jan;133:1-10. doi: 10.1016/j.neunet.2020.09.022. Epub 2020 Oct 8.

Transfer Learning for Drug Discovery.药物发现中的迁移学习。

J Med Chem. 2020 Aug 27;63(16):8683-8694. doi: 10.1021/acs.jmedchem.9b02147. Epub 2020 Jul 24.

Predicting Materials Properties with Little Data Using Shotgun Transfer Learning.利用散弹枪迁移学习以少量数据预测材料属性

ACS Cent Sci. 2019 Oct 23;5(10):1717-1730. doi: 10.1021/acscentsci.9b00804. Epub 2019 Sep 30.

Pushing the Boundaries of Molecular Representation for Drug Discovery with the Graph Attention Mechanism.利用图注意力机制拓展药物发现中分子表示的边界。

J Med Chem. 2020 Aug 27;63(16):8749-8760. doi: 10.1021/acs.jmedchem.9b00959. Epub 2019 Aug 27.

Analyzing Learned Molecular Representations for Property Prediction.分析用于性质预测的学习分子表示。

J Chem Inf Model. 2019 Aug 26;59(8):3370-3388. doi: 10.1021/acs.jcim.9b00237. Epub 2019 Aug 13.

Large-scale comparison of machine learning methods for drug target prediction on ChEMBL.基于ChEMBL的药物靶点预测机器学习方法的大规模比较

Chem Sci. 2018 Jun 6;9(24):5441-5451. doi: 10.1039/c8sc00148k. eCollection 2018 Jun 28.

Machine learning for molecular and materials science.机器学习在分子和材料科学中的应用。

Nature. 2018 Jul;559(7715):547-555. doi: 10.1038/s41586-018-0337-2. Epub 2018 Jul 25.

SchNet - A deep learning architecture for molecules and materials.SchNet - 一种用于分子和材料的深度学习架构。

J Chem Phys. 2018 Jun 28;148(24):241722. doi: 10.1063/1.5019779.

MoleculeNet: a benchmark for molecular machine learning.分子网络：分子机器学习的一个基准

Chem Sci. 2017 Oct 31;9(2):513-530. doi: 10.1039/c7sc02664a. eCollection 2018 Jan 14.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

知识嵌入消息传递神经网络：利用人类知识改进分子性质预测

Knowledge-Embedded Message-Passing Neural Networks: Improving Molecular Property Prediction with Human Knowledge.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献