Xie Yuan, Liu Jinyan, Qu Yanyun, Tao Dacheng, Zhang Wensheng, Dai Longquan, Ma Lizhuang
IEEE Trans Neural Netw Learn Syst. 2021 Feb;32(2):868-881. doi: 10.1109/TNNLS.2020.2979685. Epub 2021 Feb 4.
In this article, we propose a multiview self-representation model for nonlinear subspaces clustering. By assuming that the heterogeneous features lie within the union of multiple linear subspaces, the recent multiview subspace learning methods aim to capture the complementary and consensus from multiple views to boost the performance. However, in real-world applications, data feature usually resides in multiple nonlinear subspaces, leading to undesirable results. To this end, we propose a kernelized version of tensor-based multiview subspace clustering, which is referred to as Kt-SVD-MSC, to jointly learn self-representation coefficients in mapped high-dimensional spaces and multiple views correlation in unified tensor space. In view-specific feature space, a kernel-induced mapping is introduced for each view to ensure the separability of self-representation coefficients. In unified tensor space, a new kind of tensor low-rank regularizer is employed on the rotated self-representation coefficient tensor to preserve the global consistency across different views. We also derive an algorithm to efficiently solve the optimization problem with all the subproblems having closed-form solutions. Furthermore, by incorporating the nonnegative and sparsity constraints, the proposed method can be easily extended to a useful variant, meaning that several useful variants can be easily constructed in a similar way. Extensive experiments of the proposed method are tested on eight challenging data sets, in which a significant (even a breakthrough) advance over state-of-the-art multiview clustering is achieved.
在本文中,我们提出了一种用于非线性子空间聚类的多视图自表示模型。通过假设异构特征位于多个线性子空间的并集中,最近的多视图子空间学习方法旨在从多个视图中捕捉互补性和一致性,以提高性能。然而,在实际应用中,数据特征通常存在于多个非线性子空间中,从而导致不理想的结果。为此,我们提出了一种基于张量的多视图子空间聚类的核化版本,称为Kt-SVD-MSC,以在映射的高维空间中联合学习自表示系数,并在统一的张量空间中学习多视图相关性。在特定视图特征空间中,为每个视图引入核诱导映射,以确保自表示系数的可分离性。在统一张量空间中,对旋转后的自表示系数张量采用一种新型的张量低秩正则化器,以保持不同视图之间的全局一致性。我们还推导了一种算法来有效解决优化问题,所有子问题都有闭式解。此外,通过纳入非负和稀疏约束,所提出的方法可以很容易地扩展为一个有用的变体,这意味着可以以类似的方式轻松构建几个有用的变体。在所提出的方法在八个具有挑战性的数据集上进行了广泛的实验,在这些实验中,相对于现有的多视图聚类方法取得了显著(甚至是突破性)的进展。