Li Xuelong, Zhang Han, Wang Rong, Nie Feiping
IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):330-344. doi: 10.1109/TPAMI.2020.3011148. Epub 2021 Dec 7.
Multiview clustering partitions data into different groups according to their heterogeneous features. Most existing methods degenerate the applicability of models due to their intractable hyper-parameters triggered by various regularization terms. Moreover, traditional spectral based methods always encounter the expensive time overheads and fail in exploring the explicit clusters from graphs. In this paper, we present a scalable and parameter-free graph fusion framework for multiview clustering, seeking for a joint graph compatible across multiple views in a self-supervised weighting manner. Our formulation coalesces multiple view-wise graphs straightforward and learns the weights as well as the joint graph interactively, which could actively release the model from any weight-related hyper-parameters. Meanwhile, we manipulate the joint graph by a connectivity constraint such that the connected components indicate clusters directly. The designed algorithm is initialization-independent and time-economical which obtains the stable performance and scales well with the data size. Substantial experiments on toy data as well as real datasets are conducted that verify the superiority of the proposed method compared to the state-of-the-arts over the clustering performance and time expenditure.
多视图聚类根据数据的异构特征将其划分为不同的组。由于各种正则化项引发的难以处理的超参数,大多数现有方法降低了模型的适用性。此外,传统的基于谱的方法总是面临高昂的时间开销,并且无法从图中探索出明确的聚类。在本文中,我们提出了一种用于多视图聚类的可扩展且无参数的图融合框架,以自监督加权的方式寻找跨多个视图兼容的联合图。我们的公式直接合并多个视图的图,并以交互方式学习权重以及联合图,这可以使模型从任何与权重相关的超参数中解放出来。同时,我们通过连通性约束来处理联合图,使得连通分量直接表示聚类。所设计的算法与初始化无关且节省时间,具有稳定的性能并且随着数据规模的增大能很好地扩展。我们在玩具数据以及真实数据集上进行了大量实验,验证了所提方法在聚类性能和时间开销方面相对于现有最先进方法的优越性。