School of Information Engineering, Yancheng Institute of Technology, Yancheng, 224051, China.
School of Information Engineering, Yancheng Institute of Technology, Yancheng, 224051, China.
Neural Netw. 2023 Oct;167:118-128. doi: 10.1016/j.neunet.2023.08.020. Epub 2023 Aug 19.
Recently, deep clustering has been extensively employed for various data mining tasks, and it can be divided into auto-encoder (AE)-based and graph neural networks (GNN)-based methods. However, existing AE-based methods fall short in effectively extracting structural information, while GNN suffer from smoothing and heterophily. Although methods that combine AE and GNN achieve impressive performance, there remains an inadequate balance between preserving the raw structure and exploring the underlying structure. Accordingly, we propose a novel network named Structure-Aware Deep Clustering network (SADC). Firstly, we compute the cumulative influence of non-adjacent nodes at multiple depths and, thus, enhance the adjacency matrix. Secondly, an enhanced graph auto-encoder is designed. Thirdly, the latent space of AE is endowed with the ability to perceive the raw structure during the learning process. Besides, we design self-supervised mechanisms to achieve co-optimization of node representation learning and topology learning. A new loss function is designed to preserve the inherent structure while also allowing for exploration of latent data structure. Extensive experiments on six benchmark datasets validate that our method outperforms state-of-the-art methods.
最近,深度聚类在各种数据挖掘任务中得到了广泛应用,可以分为基于自动编码器 (AE) 和基于图神经网络 (GNN) 的方法。然而,现有的基于 AE 的方法在有效提取结构信息方面存在不足,而 GNN 则存在平滑和异质性问题。尽管结合 AE 和 GNN 的方法取得了令人印象深刻的性能,但在保留原始结构和探索潜在结构之间仍然存在不平衡。因此,我们提出了一种名为结构感知深度聚类网络 (SADC) 的新网络。首先,我们计算多个深度中非相邻节点的累积影响,从而增强邻接矩阵。其次,设计了增强图自动编码器。第三,在学习过程中,AE 的潜在空间被赋予感知原始结构的能力。此外,我们设计了自监督机制,以实现节点表示学习和拓扑学习的共同优化。设计了一个新的损失函数,以在保留固有结构的同时允许对潜在数据结构进行探索。在六个基准数据集上的广泛实验验证了我们的方法优于最先进的方法。