Fang Si-Guo, Huang Dong, Cai Xiao-Sha, Wang Chang-Dong, He Chaobo, Tang Yong
IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):11436-11447. doi: 10.1109/TNNLS.2023.3261460. Epub 2024 Aug 5.
Although previous graph-based multi-view clustering (MVC) algorithms have gained significant progress, most of them are still faced with three limitations. First, they often suffer from high computational complexity, which restricts their applications in large-scale scenarios. Second, they usually perform graph learning either at the single-view level or at the view-consensus level, but often neglect the possibility of the joint learning of single-view and consensus graphs. Third, many of them rely on the k -means for discretization of the spectral embeddings, which lack the ability to directly learn the graph with discrete cluster structure. In light of this, this article presents an efficient MVC approach via u nified and d iscrete b ipartite g raph l earning (UDBGL). Specifically, the anchor-based subspace learning is incorporated to learn the view-specific bipartite graphs from multiple views, upon which the bipartite graph fusion is leveraged to learn a view-consensus bipartite graph with adaptive weight learning. Furthermore, the Laplacian rank constraint is imposed to ensure that the fused bipartite graph has discrete cluster structures (with a specific number of connected components). By simultaneously formulating the view-specific bipartite graph learning, the view-consensus bipartite graph learning, and the discrete cluster structure learning into a unified objective function, an efficient minimization algorithm is then designed to tackle this optimization problem and directly achieve a discrete clustering solution without requiring additional partitioning, which notably has linear time complexity in data size. Experiments on a variety of multi-view datasets demonstrate the robustness and efficiency of our UDBGL approach. The code is available at https://github.com/huangdonghere/UDBGL.
尽管先前基于图的多视图聚类(MVC)算法已取得显著进展,但其中大多数仍面临三个局限性。首先,它们常常具有较高的计算复杂度,这限制了它们在大规模场景中的应用。其次,它们通常在单视图级别或视图共识级别执行图学习,但往往忽略了单视图和共识图联合学习的可能性。第三,它们中的许多依赖k均值对谱嵌入进行离散化,这缺乏直接学习具有离散聚类结构的图的能力。鉴于此,本文提出了一种通过统一离散二分图学习(UDBGL)的高效MVC方法。具体而言,引入基于锚点的子空间学习从多个视图中学习特定于视图的二分图,在此基础上利用二分图融合通过自适应权重学习来学习视图共识二分图。此外,施加拉普拉斯秩约束以确保融合后的二分图具有离散聚类结构(具有特定数量的连通分量)。通过将特定于视图的二分图学习、视图共识二分图学习和离散聚类结构学习同时纳入一个统一的目标函数,然后设计了一种高效的最小化算法来解决此优化问题,并直接获得离散聚类解而无需额外的划分,其在数据规模上具有显著的线性时间复杂度。在各种多视图数据集上的实验证明了我们的UDBGL方法的鲁棒性和效率。代码可在https://github.com/huangdonghere/UDBGL获取。