Suppr
超能文献

一种基于端到端注意力机制的图学习方法。

An end-to-end attention-based approach for learning on graphs.

作者信息

Buterez David, Janet Jon Paul, Oglic Dino, Liò Pietro

机构信息

Department of Computer Science and Technology, University of Cambridge, Cambridge, UK.

Molecular AI, BioPharmaceuticals R&D, AstraZeneca, Gothenburg, Sweden.

出版信息

Nat Commun. 2025 Jun 5;16(1):5244. doi: 10.1038/s41467-025-60252-z.

DOI:10.1038/s41467-025-60252-z

PMID:40473623

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12141427/

Abstract

There has been a recent surge in transformer-based architectures for learning on graphs, mainly motivated by attention as an effective learning mechanism and the desire to supersede the hand-crafted operators characteristic of message passing schemes. However, concerns over their empirical effectiveness, scalability, and complexity of the pre-processing steps have been raised, especially in relation to much simpler graph neural networks that typically perform on par with them across a wide range of benchmarks. To address these shortcomings, we consider graphs as sets of edges and propose a purely attention-based approach consisting of an encoder and an attention pooling mechanism. The encoder vertically interleaves masked and vanilla self-attention modules to learn an effective representation of edges while allowing for tackling possible misspecifications in input graphs. Despite its simplicity, the approach outperforms fine-tuned message passing baselines and recently proposed transformer-based methods on more than 70 node and graph-level tasks, including challenging long-range benchmarks. Moreover, we demonstrate state-of-the-art performance across different tasks, ranging from molecular to vision graphs, and heterophilous node classification. The approach also outperforms graph neural networks and transformers in transfer learning settings and scales much better than alternatives with a similar performance level or expressive power.

摘要

最近，基于变压器的架构在图学习方面激增，主要是受注意力作为一种有效学习机制的推动，以及取代消息传递方案中手工制作算子的愿望。然而，人们对它们的经验有效性、可扩展性和预处理步骤的复杂性提出了担忧，特别是与通常在广泛基准测试中与它们表现相当的简单得多的图神经网络相比。为了解决这些缺点，我们将图视为边的集合，并提出一种纯粹基于注意力的方法，该方法由一个编码器和一个注意力池化机制组成。编码器垂直交错掩码自注意力模块和普通自注意力模块，以学习边的有效表示，同时允许处理输入图中可能的错误指定。尽管该方法很简单，但在70多个节点和图级任务上，包括具有挑战性的远程基准测试，它优于微调后的消息传递基线和最近提出的基于变压器的方法。此外，我们展示了在从分子图到视觉图以及异质节点分类等不同任务中的最优性能。该方法在迁移学习设置中也优于图神经网络和变压器，并且比具有相似性能水平或表达能力的替代方法扩展性更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/13da/12141427/d4548a577e7f/41467_2025_60252_Fig1_HTML.jpg

相似文献

An end-to-end attention-based approach for learning on graphs.

Nat Commun. 2025 Jun 5;16(1):5244. doi: 10.1038/s41467-025-60252-z.

Heterophilous distribution propagation for Graph Neural Networks.

Neural Netw. 2025 Apr;184:107014. doi: 10.1016/j.neunet.2024.107014. Epub 2024 Dec 24.

Graph Transformer Networks: Learning meta-path graphs to improve GNNs.

Neural Netw. 2022 Sep;153:104-119. doi: 10.1016/j.neunet.2022.05.026. Epub 2022 Jun 4.

Dynamic Graph Message Passing Networks.

IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):5712-5730. doi: 10.1109/TPAMI.2022.3207500. Epub 2023 Apr 3.

Graph Transformer GANs With Graph Masked Modeling for Architectural Layout Generation.

IEEE Trans Pattern Anal Mach Intell. 2024 Jun;46(6):4298-4313. doi: 10.1109/TPAMI.2024.3355248. Epub 2024 May 7.

ABT-MPNN: an atom-bond transformer-based message-passing neural network for molecular property prediction.

J Cheminform. 2023 Feb 26;15(1):29. doi: 10.1186/s13321-023-00698-9.

Heterophily-Aware Representation Learning on Heterogeneous Graphs.

IEEE Trans Pattern Anal Mach Intell. 2025 Sep;47(9):7852-7866. doi: 10.1109/TPAMI.2025.3573615.

Augmented Graph Neural Network with hierarchical global-based residual connections.

Neural Netw. 2022 Jun;150:149-166. doi: 10.1016/j.neunet.2022.03.008. Epub 2022 Mar 10.

An Integrated Fuzzy Neural Network and Topological Data Analysis for Molecular Graph Representation Learning and Property Forecasting.

Mol Inform. 2025 Mar;44(3):e202400335. doi: 10.1002/minf.202400335.

Multi-level attention pooling for graph neural networks: Unifying graph representations with multiple localities.

Neural Netw. 2022 Jan;145:356-373. doi: 10.1016/j.neunet.2021.11.001. Epub 2021 Nov 10.

引用本文的文献

AI-Driven Drug Discovery: A Comprehensive Review.

ACS Omega. 2025 Jun 6;10(23):23889-23903. doi: 10.1021/acsomega.5c00549. eCollection 2025 Jun 17.

本文引用的文献

Data-driven quantum chemical property prediction leveraging 3D conformations with Uni-Mol.

Nat Commun. 2024 Aug 19;15(1):7104. doi: 10.1038/s41467-024-51321-w.

Transfer learning with graph neural networks for improved molecular property prediction in the multi-fidelity setting.

Nat Commun. 2024 Feb 26;15(1):1517. doi: 10.1038/s41467-024-45566-8.

Chemprop: A Machine Learning Package for Chemical Property Prediction.

J Chem Inf Model. 2024 Jan 8;64(1):9-17. doi: 10.1021/acs.jcim.3c01250. Epub 2023 Dec 26.

Scaling deep learning for materials discovery.

Nature. 2023 Dec;624(7990):80-85. doi: 10.1038/s41586-023-06735-9. Epub 2023 Nov 29.

Modelling local and general quantum mechanical properties with attention-based pooling.

Commun Chem. 2023 Nov 29;6(1):262. doi: 10.1038/s42004-023-01045-7.

Learning skillful medium-range global weather forecasting.

Science. 2023 Dec 22;382(6677):1416-1421. doi: 10.1126/science.adi2336. Epub 2023 Nov 14.

Accurate GW frontier orbital energies of 134 kilo molecules.

Sci Data. 2023 Sep 5;10(1):581. doi: 10.1038/s41597-023-02486-4.

Physics-inspired machine learning of localized intensive properties.

Chem Sci. 2023 Apr 10;14(18):4913-4922. doi: 10.1039/d3sc00841j. eCollection 2023 May 10.

Discovering small-molecule senolytics with deep neural networks.

Nat Aging. 2023 Jun;3(6):734-750. doi: 10.1038/s43587-023-00415-z. Epub 2023 May 4.

MF-PCBA: Multifidelity High-Throughput Screening Benchmarks for Drug Discovery and Machine Learning.

J Chem Inf Model. 2023 May 8;63(9):2667-2678. doi: 10.1021/acs.jcim.2c01569. Epub 2023 Apr 14.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

一种基于端到端注意力机制的图学习方法。

An end-to-end attention-based approach for learning on graphs.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译