Wen Yi, Wang Siwei, Liao Qing, Liang Weixuan, Liang Ke, Wan Xinhang, Liu Xinwang
IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16049-16063. doi: 10.1109/TNNLS.2023.3291696. Epub 2024 Oct 29.
Multi-view clustering (MVC), which effectively fuses information from multiple views for better performance, has received increasing attention. Most existing MVC methods assume that multi-view data are fully paired, which means that the mappings of all corresponding samples between views are predefined or given in advance. However, the data correspondence is often incomplete in real-world applications due to data corruption or sensor differences, referred to as the data-unpaired problem (DUP) in multi-view literature. Although several attempts have been made to address the DUP issue, they suffer from the following drawbacks: 1) most methods focus on the feature representation while ignoring the structural information of multi-view data, which is essential for clustering tasks; 2) existing methods for partially unpaired problems rely on pregiven cross-view alignment information, resulting in their inability to handle fully unpaired problems; and 3) their inevitable parameters degrade the efficiency and applicability of the models. To tackle these issues, we propose a novel parameter-free graph clustering framework termed unpaired multi-view graph clustering framework with cross-view structure matching (UPMGC-SM). Specifically, unlike the existing methods, UPMGC-SM effectively utilizes the structural information from each view to refine cross-view correspondences. Besides, our UPMGC-SM is a unified framework for both the fully and partially unpaired multi-view graph clustering. Moreover, existing graph clustering methods can adopt our UPMGC-SM to enhance their ability for unpaired scenarios. Extensive experiments demonstrate the effectiveness and generalization of our proposed framework for both paired and unpaired datasets.
多视图聚类(MVC)能够有效地融合来自多个视图的信息以获得更好的性能,因此受到了越来越多的关注。大多数现有的MVC方法都假设多视图数据是完全配对的,这意味着视图之间所有对应样本的映射都是预先定义或给定的。然而,在实际应用中,由于数据损坏或传感器差异,数据对应关系往往是不完整的,在多视图文献中被称为数据未配对问题(DUP)。尽管已经进行了几次尝试来解决DUP问题,但它们存在以下缺点:1)大多数方法侧重于特征表示,而忽略了多视图数据的结构信息,而这对于聚类任务至关重要;2)现有的部分未配对问题的方法依赖于预先给定的跨视图对齐信息,导致它们无法处理完全未配对的问题;3)它们不可避免的参数降低了模型的效率和适用性。为了解决这些问题,我们提出了一种新颖的无参数图聚类框架,称为具有跨视图结构匹配的未配对多视图图聚类框架(UPMGC-SM)。具体来说,与现有方法不同,UPMGC-SM有效地利用每个视图的结构信息来细化跨视图对应关系。此外,我们的UPMGC-SM是一个用于完全和部分未配对多视图图聚类的统一框架。而且,现有的图聚类方法可以采用我们的UPMGC-SM来增强它们在未配对场景下的能力。大量实验证明了我们提出的框架对于配对和未配对数据集的有效性和通用性。