IEEE Trans Cybern. 2022 Dec;52(12):12771-12784. doi: 10.1109/TCYB.2021.3088880. Epub 2022 Nov 18.
Clustering techniques attempt to group objects with similar properties into a cluster. Clustering the nodes of an attributed graph, in which each node is associated with a set of feature attributes, has attracted significant attention. Graph convolutional networks (GCNs) represent an effective approach for integrating the two complementary factors of node attributes and structural information for attributed graph clustering. Smoothness is an indicator for assessing the degree of similarity of feature representations among nearby nodes in a graph. Oversmoothing in GCNs, caused by unnecessarily high orders of graph convolution, produces indistinguishable representations of nodes, such that the nodes in a graph tend to be grouped into fewer clusters, and pose a challenge due to the resulting performance drop. In this study, we propose a smoothness sensor for attributed graph clustering based on adaptive smoothness-transition graph convolutions, which senses the smoothness of a graph and adaptively terminates the current convolution once the smoothness is saturated to prevent oversmoothing. Furthermore, as an alternative to graph-level smoothness, a novel fine-grained nodewise-level assessment of smoothness is proposed, in which smoothness is computed in accordance with the neighborhood conditions of a given node at a certain order of graph convolution. In addition, a self-supervision criterion is designed considering both the tightness within clusters and the separation between clusters to guide the entire neural network training process. The experiments show that the proposed methods significantly outperform 13 other state-of-the-art baselines in terms of different metrics across five benchmark datasets. In addition, an extensive study reveals the reasons for their effectiveness and efficiency.
聚类技术试图将具有相似属性的对象组合成一个簇。对具有属性的图的节点进行聚类,其中每个节点都与一组特征属性相关联,这引起了人们的极大关注。图卷积网络(GCN)是一种有效的方法,可以将节点属性和结构信息这两个互补因素结合起来进行属性图聚类。平滑度是评估图中邻近节点特征表示之间相似程度的指标。由于图卷积的阶数过高,GCN 中会出现过度平滑,从而导致节点的表示无法区分,使得图中的节点往往会被分成较少的簇,从而导致性能下降。在这项研究中,我们提出了一种基于自适应平滑过渡图卷积的属性图聚类平滑度传感器,该传感器可以感知图的平滑度,并在平滑度达到饱和时自适应地终止当前卷积,以防止过度平滑。此外,作为图级平滑度的替代方法,我们提出了一种新的细粒度节点级平滑度评估方法,该方法根据给定节点在某一阶图卷积时的邻域条件来计算平滑度。此外,还设计了一种自监督准则,该准则同时考虑了簇内的紧密度和簇间的分离度,以指导整个神经网络训练过程。实验表明,所提出的方法在五个基准数据集的不同指标上均显著优于其他 13 种最先进的基线方法。此外,广泛的研究揭示了它们的有效性和效率的原因。