Wei Qiang, Hu Guangmin
University of Electronic Science and Technology of China, School of Information and Communication Engineering, Chengdu, Sichuan, China.
National Key Laboratory of Science and Technology on Blind Signal Processing, Chengdu, Sichuan, China.
PeerJ Comput Sci. 2022 Mar 4;8:e901. doi: 10.7717/peerj-cs.901. eCollection 2022.
It is often the case that only a portion of the underlying network structure is observed in real-world settings. However, as most network analysis methods are built on a complete network structure, the natural questions to ask are: (a) how well these methods perform with incomplete network structure, (b) which structural observation and network analysis method to choose for a specific task, and (c) is it beneficial to complete the missing structure.
In this paper, we consider the incomplete network structure as one random sampling instance from a complete graph, and we choose graph neural networks (GNNs), which have achieved promising results on various graph learning tasks, as the representative of network analysis methods. To identify the robustness of GNNs under graph sampling scenarios, we systemically evaluated six state-of-the-art GNNs under four commonly used graph sampling methods.
We show that GNNs can still be applied on single static networks under graph sampling scenarios, and simpler GNN models are able to outperform more sophisticated ones in a fairly experimental procedure. More importantly, we find that completing the sampled subgraph does improve the performance of downstream tasks in most cases; however, completion is not always effective and needs to be evaluated for a specific dataset. Our code is available at https://github.com/weiqianglg/evaluate-GNNs-under-graph-sampling.
在现实世界的场景中,通常只能观察到底层网络结构的一部分。然而,由于大多数网络分析方法是基于完整的网络结构构建的,自然而然会产生以下问题:(a) 这些方法在不完整网络结构上的表现如何;(b) 针对特定任务应选择哪种结构观测和网络分析方法;(c) 补全缺失结构是否有益。
在本文中,我们将不完整的网络结构视为从完全图中随机抽样得到的一个实例,并选择在各种图学习任务中取得了良好效果的图神经网络(GNN)作为网络分析方法的代表。为了确定GNN在图抽样场景下的鲁棒性,我们在四种常用的图抽样方法下系统地评估了六种先进的GNN。
我们表明,在图抽样场景下,GNN仍然可以应用于单个静态网络,并且在相当的实验过程中,较简单的GNN模型能够优于更复杂的模型。更重要的是,我们发现补全抽样子图在大多数情况下确实能提高下游任务的性能;然而,补全并不总是有效的,需要针对特定数据集进行评估。我们的代码可在https://github.com/weiqianglg/evaluate-GNNs-under-graph-sampling获取。