Li Zhuowen, Chen Hongmei, Xiang Biao, Yuan Zhong, Luo Chuan, Horng Shi-Jinn, Li Tianrui
School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, 611756, China; National Engineering Laboratory of Integrated Transportation Big Data Application Technology, Southwest Jiaotong University, Chengdu, 611756, China; Engineering Research Center of Sustainable Urban Intelligent Transportation, Ministry of Education, Chengdu, 611756, PR China; Manufacturing Industry Chains Collaboration and Information Support Technology Key Laboratory of Sichuan Province, Southwest Jiaotong University, Chengdu, 611756, PR China.
College of Computer Science, Sichuan University, Chengdu, 610065, China.
Neural Netw. 2025 Nov;191:107836. doi: 10.1016/j.neunet.2025.107836. Epub 2025 Jul 7.
Real-world multiview data often suffers from complex missingness problems, leading to significant performance degradation of clustering methods. Existing methods usually focus only on data completion while ignoring inter-view consistency, or the recovery of missing data is unreliable. For this reason, this paper proposes an incomplete multiview clustering algorithm for recovering missing data based on multiview characteristics. Unlike existing methods, the proposed method achieves reliable recovery of missing data and clustering optimization through consistency preservation. First, a latent subspace representation shared among views is constructed, and the local structure of each view is aligned to the global consensus graph through adaptive graph learning to solve the dimensional heterogeneity problem effectively. Second, the clustering metrics of non-missing samples are used to guide the iterative optimization of missing data to ensure the distributional consistency between the complementary data and the existing instances. Finally, a view weight assignment strategy is introduced to adjust the contribution of each view according to its difference from the consensus graph. The model improves the clustering performance synchronously during the data complementation process. Experiments on multiple datasets show the superior performance of the proposed method over various approaches.
现实世界中的多视图数据常常存在复杂的缺失问题,导致聚类方法的性能显著下降。现有方法通常仅关注数据补全,却忽略了视图间的一致性,或者缺失数据的恢复并不可靠。因此,本文提出了一种基于多视图特征的用于恢复缺失数据的不完全多视图聚类算法。与现有方法不同,该方法通过一致性保持实现了缺失数据的可靠恢复和聚类优化。首先,构建视图间共享的潜在子空间表示,并通过自适应图学习将每个视图的局部结构与全局共识图对齐,从而有效解决维度异质性问题。其次,利用非缺失样本的聚类指标来指导缺失数据的迭代优化,以确保补充数据与现有实例之间的分布一致性。最后,引入视图权重分配策略,根据每个视图与共识图的差异来调整其贡献。该模型在数据补全过程中同步提高了聚类性能。在多个数据集上的实验表明,该方法相较于各种方法具有优越的性能。