Wan Xinhang, Liu Jiyuan, Gan Xinbiao, Liu Xinwang, Wang Siwei, Wen Yi, Wan Tianjiao, Zhu En
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):5774-5786. doi: 10.1109/TNNLS.2024.3378194. Epub 2025 Feb 28.
Multi-View clustering has attracted broad attention due to its capacity to utilize consistent and complementary information among views. Although tremendous progress has been made recently, most existing methods undergo high complexity, preventing them from being applied to large-scale tasks. Multi-View clustering via matrix factorization is a representative to address this issue. However, most of them map the data matrices into a fixed dimension, limiting the model's expressiveness. Moreover, a range of methods suffers from a two-step process, i.e., multimodal learning and the subsequent k-means, inevitably causing a suboptimal clustering result. In light of this, we propose a one-step multi-view clustering with diverse representation (OMVCDR) method, which incorporates multi-view learning and k-means into a unified framework. Specifically, we first project original data matrices into various latent spaces to attain comprehensive information and auto-weight them in a self-supervised manner. Then, we directly use the information matrices under diverse dimensions to obtain consensus discrete clustering labels. The unified work of representation learning and clustering boosts the quality of the final results. Furthermore, we develop an efficient optimization algorithm with proven convergence to solve the resultant problem. Comprehensive experiments on various datasets demonstrate the promising clustering performance of our proposed method. The code is publicly available at https://github.com/wanxinhang/OMVCDR.
多视图聚类因其能够利用视图间的一致和互补信息而受到广泛关注。尽管近年来取得了巨大进展,但大多数现有方法的复杂度较高,阻碍了它们应用于大规模任务。基于矩阵分解的多视图聚类是解决这一问题的一种代表性方法。然而,它们中的大多数将数据矩阵映射到固定维度,限制了模型的表现力。此外,一系列方法存在两步过程,即多模态学习和随后的k均值,不可避免地导致次优聚类结果。鉴于此,我们提出了一种具有多样表示的一步多视图聚类(OMVCDR)方法,该方法将多视图学习和k均值纳入一个统一框架。具体来说,我们首先将原始数据矩阵投影到各种潜在空间中以获取全面信息,并以自监督方式对其进行自动加权。然后,我们直接使用不同维度下的信息矩阵来获得一致的离散聚类标签。表示学习和聚类的统一工作提高了最终结果的质量。此外,我们开发了一种具有收敛性证明的高效优化算法来解决由此产生的问题。在各种数据集上的综合实验证明了我们提出的方法具有良好的聚类性能。代码可在https://github.com/wanxinhang/OMVCDR上公开获取。