School of Computer Science, Peking University, Beijing, 100871, Beijing, China.
School of Mathematical Sciences, Peking University, Beijing, 100871, Beijing, China.
Neural Netw. 2022 Jul;151:70-79. doi: 10.1016/j.neunet.2022.03.018. Epub 2022 Mar 24.
Graph classification aims to predict the property of the whole graph, which has attracted growing attention in the graph learning community. This problem has been extensively studied in the literature of both graph convolutional networks and graph kernels. Graph convolutional networks can learn effective node representations via message passing to mine graph topology in an implicit way, whereas graph kernels can explicitly utilize graph structural knowledge for classification. Due to the scarcity of labeled data in real-world applications, semi-supervised algorithms are anticipated for this problem. In this paper, we propose Graph Harmonic Neural Network (GHNN) which combines the advantages of both worlds to sufficiently leverage the unlabeled data, and thus overcomes label scarcity in semi-supervised scenarios. Specifically, our GHNN consists of a graph convolutional network (GCN) module and a graph kernel network (GKN) module that explore graph topology information from complementary perspectives. To fully leverage the unlabeled data, we develop a novel harmonic contrastive loss and a harmonic consistency loss to harmonize the training of two modules by giving priority to high-quality unlabeled data, thereby reconciling prediction consistency between both of them. In this manner, the two modules mutually enhance each other to sufficiently explore the graph topology of both labeled and unlabeled data. Extensive experiments on a variety of benchmarks demonstrate the effectiveness of our approach over competitive baselines.
图分类旨在预测整个图的属性,这在图学习社区中引起了越来越多的关注。这个问题在图卷积网络和图核文献中都得到了广泛的研究。图卷积网络可以通过消息传递学习有效的节点表示,从而以隐式的方式挖掘图拓扑结构,而图核可以显式地利用图结构知识进行分类。由于现实应用中标记数据的稀缺,预计这个问题需要使用半监督算法。在本文中,我们提出了图调和神经网络(GHNN),它结合了这两个领域的优势,充分利用未标记数据,从而克服了半监督场景中的标签稀缺问题。具体来说,我们的 GHNN 由一个图卷积网络(GCN)模块和一个图核网络(GKN)模块组成,它们从互补的角度探索图拓扑信息。为了充分利用未标记数据,我们开发了一种新的调和对比损失和调和一致性损失,通过优先考虑高质量的未标记数据来协调两个模块的训练,从而协调它们之间的预测一致性。通过这种方式,两个模块相互促进,充分挖掘有标签和无标签数据的图拓扑结构。在各种基准上的广泛实验表明,我们的方法优于竞争基线。