Data Intelligence System Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, Beijing, China.
Neural Netw. 2024 Apr;172:106125. doi: 10.1016/j.neunet.2024.106125. Epub 2024 Jan 17.
Graph Contrastive Learning (GCL) is increasingly employed in graph representation learning with the primary aim of learning node/graph representations from a predefined pretext task that can generalize to various downstream tasks. Meanwhile, the transition from a specific pretext task to diverse and unpredictable downstream tasks poses a significant challenge for GCL's generalization ability. Most existing GCL approaches maximize mutual information between two views derived from the original graph, either randomly or heuristically. However, the generalization ability of GCL and its theoretical principles are still less studied. In this paper, we introduce a novel metric GCL-GE, to quantify the generalization gap between predefined pretext and agnostic downstream tasks. Given the inherent intractability of GCL-GE, we leverage concepts from information theory to derive a mutual information upper bound that is independent of the downstream tasks, thus enabling the metric's optimization despite the variability in downstream tasks. Based on the theoretical insight, we propose InfoAdv, a GCL framework to directly enhance generalization by jointly optimizing GCL-GE and InfoMax. Extensive experiments validate the capability of InfoAdv to enhance performance across a wide variety of downstream tasks, demonstrating its effectiveness in improving the generalizability of GCL.
图对比学习(Graph Contrastive Learning,GCL)越来越多地被应用于图表示学习中,其主要目的是从预定义的前置任务中学习节点/图表示,这些表示可以泛化到各种下游任务中。然而,从特定的前置任务到多样化和不可预测的下游任务的转变对 GCL 的泛化能力提出了重大挑战。大多数现有的 GCL 方法通过最大化从原始图中随机或启发式得到的两个视图之间的互信息来最大化互信息。然而,GCL 的泛化能力及其理论原理仍然研究较少。在本文中,我们引入了一种新的度量 GCL-GE,用于量化预定义的前置任务和不可知的下游任务之间的泛化差距。由于 GCL-GE 的内在复杂性,我们利用信息论中的概念来推导出一个不依赖于下游任务的互信息上界,从而可以在下游任务变化的情况下优化该度量。基于理论见解,我们提出了 InfoAdv,这是一种 GCL 框架,可以通过联合优化 GCL-GE 和 InfoMax 来直接增强泛化能力。广泛的实验验证了 InfoAdv 在各种下游任务中提高性能的能力,证明了它在提高 GCL 的泛化能力方面的有效性。