University of the Basque Country UPV/EHU, San Sebastian, Spain; IKERBASQUE, Basque Foundation for Science, Bilbao, Spain; Ho Chi Minh City Open University, Ho Chi Minh City, Viet Nam.
University of the Basque Country UPV/EHU, San Sebastian, Spain.
Neural Netw. 2024 May;173:106197. doi: 10.1016/j.neunet.2024.106197. Epub 2024 Feb 23.
Recently, clustering data collected from various sources has become a hot topic in real-world applications. The most common methods for multi-view clustering can be divided into several categories: Spectral clustering algorithms, subspace multi-view clustering algorithms, matrix factorization approaches, and kernel methods. Despite the high performance of these methods, they directly fuse all similarity matrices of all views and separate the affinity learning process from the multiview clustering process. The performance of these algorithms can be affected by noisy affinity matrices. To overcome this drawback, this paper presents a novel method called One Step Multi-view Clustering via Consensus Graph Learning and Nonnegative Embedding (OSMGNE). Instead of directly merging the similarity matrices of different views, which may contain noise, a step of learning a consensus similarity matrix is performed. This step forces the similarity matrices of different views to be too similar, which eliminates the problem of noisy data. Moreover, the use of the nonnegative embedding matrix (soft cluster assignment matrix makes it possible to directly obtain the final clustering result without any extra step. The proposed method can solve five subtasks simultaneously. It jointly estimates the similarity matrix of all views, the similarity matrix of each view, the corresponding spectral projection matrix, the unified clustering indicator matrix, and automatically gives the weight of each view without the use of hyper-parameters. In addition, another version of our method is also studied in this paper. This method differs from the first one by using a consensus spectral projection matrix and a consensus Laplacian matrix over all views. An iterative algorithm is proposed to solve the optimization problem of these two methods. The two proposed methods are tested on several real datasets, which prove their superiority.
最近,从各种来源收集的数据聚类已成为现实应用中的热门话题。多视图聚类的最常见方法可以分为几类:谱聚类算法、子空间多视图聚类算法、矩阵分解方法和核方法。尽管这些方法性能很高,但它们直接融合了所有视图的相似性矩阵,并将亲和学习过程与多视图聚类过程分开。这些算法的性能可能会受到噪声相似性矩阵的影响。为了克服这一缺点,本文提出了一种新的方法,称为通过一致图学习和非负嵌入的一步多视图聚类(OSMGNE)。该方法不是直接合并不同视图的相似性矩阵,这些矩阵可能包含噪声,而是执行学习一致相似性矩阵的步骤。该步骤迫使不同视图的相似性矩阵过于相似,从而消除了噪声数据的问题。此外,使用非负嵌入矩阵(软聚类分配矩阵)可以直接获得最终聚类结果,而无需任何额外步骤。所提出的方法可以同时解决五个子任务。它联合估计所有视图的相似性矩阵、每个视图的相似性矩阵、相应的谱投影矩阵、统一聚类指示矩阵,并自动为每个视图分配权重,而无需使用超参数。此外,本文还研究了我们方法的另一个版本。该方法与第一个方法的不同之处在于使用了所有视图的一致谱投影矩阵和一致拉普拉斯矩阵。提出了一种迭代算法来解决这两种方法的优化问题。在几个真实数据集上对所提出的两种方法进行了测试,证明了它们的优越性。