Institute of Cyberspace Security, Zhejiang University of Technology, Hangzhou, 310023, China; Binjiang Institute of Artificial Intelligence, Hangzhou, 310056, China; College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou, 310023, China.
Institute of Cyberspace Security, Zhejiang University of Technology, Hangzhou, 310023, China; Binjiang Institute of Artificial Intelligence, Hangzhou, 310056, China.
Neural Netw. 2024 Dec;180:106650. doi: 10.1016/j.neunet.2024.106650. Epub 2024 Aug 23.
Real-world graphs exhibit increasing heterophily, where nodes no longer tend to be connected to nodes with the same label, challenging the homophily assumption of classical graph neural networks (GNNs) and impeding their performance. Intriguingly, from the observation of heterophilous data, we notice that certain high-order information exhibits higher homophily, which motivates us to involve high-order information in node representation learning. However, common practices in GNNs to acquire high-order information mainly through increasing model depth and altering message-passing mechanisms, which, albeit effective to a certain extent, suffer from three shortcomings: (1) over-smoothing due to excessive model depth and propagation times; (2) high-order information is not fully utilized; (3) low computational efficiency. In this regard, we design a similarity-based path sampling strategy to capture smooth paths containing high-order homophily. Then we propose a lightweight model based on multi-layer perceptrons (MLP), named PathMLP, which can encode messages carried by paths via simple transformation and concatenation operations, and effectively learn node representations in heterophilous graphs through adaptive path aggregation. Extensive experiments demonstrate that our method outperforms baselines on 16 out of 20 datasets, underlining its effectiveness and superiority in alleviating the heterophily problem. In addition, our method is immune to over-smoothing and has high computational efficiency. The source code will be available in https://github.com/Graph4Sec-Team/PathMLP.
真实世界的图表现出越来越强的异配性,节点之间不再倾向于与具有相同标签的节点相连,这挑战了传统图神经网络(GNN)的同配性假设,并阻碍了它们的性能。有趣的是,从异配数据的观察中,我们注意到某些高阶信息表现出更高的同配性,这促使我们在节点表示学习中引入高阶信息。然而,GNN 中获取高阶信息的常见做法主要是通过增加模型深度和改变消息传递机制,虽然在一定程度上有效,但存在三个缺点:(1)由于模型深度和传播次数过多而导致过度平滑;(2)高阶信息未被充分利用;(3)计算效率低。在这方面,我们设计了一种基于相似度的路径采样策略来捕捉包含高阶同配性的平滑路径。然后,我们提出了一种基于多层感知机(MLP)的轻量级模型,称为 PathMLP,它可以通过简单的变换和连接操作对路径携带的消息进行编码,并通过自适应路径聚合有效地学习异配图中的节点表示。大量实验表明,我们的方法在 20 个数据集的 16 个数据集上优于基线,突出了其在缓解异配性问题方面的有效性和优越性。此外,我们的方法对过度平滑具有免疫力,并且具有高效的计算效率。代码将在 https://github.com/Graph4Sec-Team/PathMLP 上提供。