Tang Jiayi, Zhao Long, Liu Xinwang
IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):8317-8330. doi: 10.1109/TNNLS.2024.3424464. Epub 2025 May 2.
In contrast to traditional single-view clustering methods, multiview clustering (MVC) approaches aim to extract, analyze, and integrate structural information from diverse perspectives, providing a more comprehensive understanding of internal data structures. However, with an increasing number of views, maintaining the integrity of view information becomes challenging, giving rise to incomplete MVC (IMVC) methods. While existing IMVC methods have shown notable performance on many incomplete multiview (IMV) databases, they still grapple with two key shortcomings: 1) they treat the information of each view as a whole, disregarding the differences among samples within each view; and 2) they rely on eigenvalue and eigenvector operations on the view matrix, limiting their scalability for large-scale samples and views. To overcome these limitations, we propose a novel multiview clustering with consistent information (IMVC-CI) of sample points. Our method explores the multiview information set of sample points to extract consensus structural information and subsequently restores unknown information in each view. Importantly, our approach operates independently on individual sample points, eliminating the need for eigenvalue and eigenvector operations on the view information matrix and facilitating parallel computation. This significantly enhances algorithmic efficiency and mitigates challenges associated with dimensionality. Experimental results on various public datasets demonstrate that our algorithm outperforms state-of-the-art IMVC methods in terms of clustering performance and computational efficiency. The code for our article has been uploaded to https://github.com/PhdJiayiTang/IMVC-CI.git.
与传统的单视图聚类方法不同,多视图聚类(MVC)方法旨在从不同角度提取、分析和整合结构信息,从而更全面地理解内部数据结构。然而,随着视图数量的增加,保持视图信息的完整性变得具有挑战性,由此产生了不完全多视图聚类(IMVC)方法。虽然现有的IMVC方法在许多不完全多视图(IMV)数据库上表现出显著性能,但它们仍存在两个关键缺点:1)它们将每个视图的信息视为一个整体,忽略了每个视图内样本之间的差异;2)它们依赖于对视图矩阵进行特征值和特征向量运算,限制了其对大规模样本和视图的可扩展性。为克服这些限制,我们提出了一种具有样本点一致信息的新型多视图聚类方法(IMVC-CI)。我们的方法探索样本点的多视图信息集以提取一致的结构信息,随后恢复每个视图中的未知信息。重要的是,我们的方法在单个样本点上独立运行,无需对视图信息矩阵进行特征值和特征向量运算,并便于并行计算。这显著提高了算法效率并减轻了与维度相关的挑战。在各种公共数据集上的实验结果表明,我们的算法在聚类性能和计算效率方面优于现有最先进的IMVC方法。我们文章的代码已上传至https://github.com/PhdJiayiTang/IMVC-CI.git。