Wang Zhen, Li Zhaoqing, Wang Rong, Nie Feiping, Li Xuelong
IEEE Trans Pattern Anal Mach Intell. 2021 Dec;43(12):4426-4440. doi: 10.1109/TPAMI.2020.3002587. Epub 2021 Nov 3.
Spectral clustering methods are gaining more and more interests and successfully applied in many fields because of their superior performance. However, there still exist two main problems to be solved: 1) spectral clustering methods consist of two successive optimization stages-spectral embedding and spectral rotation, which may not lead to globally optimal solutions, 2) and it is known that spectral methods are time-consuming with very high computational complexity. There are methods proposed to reduce the complexity for data vectors but not for graphs that only have information about similarity matrices. In this paper, we propose a new method to solve these two challenging problems for graph clustering. In the new method, a new framework is established to perform spectral embedding and spectral rotation simultaneously. The newly designed objective function consists of both terms of embedding and rotation, and we use an improved spectral rotation method to make it mathematically rigorous for the optimization. To further accelerate the algorithm, we derive a low-dimensional representation matrix from a graph by using label propagation, with which, in return, we can reconstruct a double-stochastic and positive semidefinite similarity matrix. Experimental results demonstrate that our method has excellent performance in time cost and accuracy.
谱聚类方法因其卓越的性能而越来越受到关注,并在许多领域得到成功应用。然而,仍然存在两个主要问题有待解决:1)谱聚类方法由两个连续的优化阶段组成——谱嵌入和谱旋转,这可能无法得到全局最优解;2)众所周知,谱方法耗时且计算复杂度非常高。已经有方法被提出来降低数据向量的复杂度,但对于仅具有相似性矩阵信息的图却没有。在本文中,我们提出了一种新方法来解决图聚类中的这两个具有挑战性的问题。在新方法中,建立了一个新框架来同时执行谱嵌入和谱旋转。新设计的目标函数包含嵌入和旋转两项,并且我们使用一种改进的谱旋转方法以使优化在数学上更加严谨。为了进一步加速算法,我们通过使用标签传播从图中导出一个低维表示矩阵,反过来,我们可以用它来重建一个双随机且半正定的相似性矩阵。实验结果表明,我们的方法在时间成本和准确性方面具有优异的性能。