Wang Yang, Wu Lin, Lin Xuemin, Gao Junbin
IEEE Trans Neural Netw Learn Syst. 2018 Oct;29(10):4833-4843. doi: 10.1109/TNNLS.2017.2777489. Epub 2018 Jan 4.
Multiview data clustering attracts more attention than their single-view counterparts due to the fact that leveraging multiple independent and complementary information from multiview feature spaces outperforms the single one. Multiview spectral clustering aims at yielding the data partition agreement over their local manifold structures by seeking eigenvalue-eigenvector decompositions. Among all the methods, low-rank representation (LRR) is effective, by exploring the multiview consensus structures beyond the low rankness to boost the clustering performance. However, as we observed, such classical paradigm still suffers from the following stand-out limitations for multiview spectral clustering of overlooking the flexible local manifold structure, caused by aggressively enforcing the low-rank data correlation agreement among all views, and such a strategy, therefore, cannot achieve the satisfied between-views agreement; worse still, LRR is not intuitively flexible to capture the latent data clustering structures. In this paper, first, we present the structured LRR by factorizing into the latent low-dimensional data-cluster representations, which characterize the data clustering structure for each view. Upon such representation, second, the Laplacian regularizer is imposed to be capable of preserving the flexible local manifold structure for each view. Third, we present an iterative multiview agreement strategy by minimizing the divergence objective among all factorized latent data-cluster representations during each iteration of optimization process, where such latent representation from each view serves to regulate those from other views, and such an intuitive process iteratively coordinates all views to be agreeable. Fourth, we remark that such data-cluster representation can flexibly encode the data clustering structure from any view with an adaptive input cluster number. To this end, finally, a novel nonconvex objective function is proposed via the efficient alternating minimization strategy. The complexity analysis is also presented. The extensive experiments conducted against the real-world multiview data sets demonstrate the superiority over the state of the arts.
多视图数据聚类比单视图数据聚类更受关注,因为利用多视图特征空间中的多个独立且互补的信息比单一信息表现更优。多视图谱聚类旨在通过寻求特征值 - 特征向量分解,在局部流形结构上达成数据划分的一致性。在所有方法中,低秩表示(LRR)是有效的,它通过探索低秩之外的多视图共识结构来提升聚类性能。然而,正如我们所观察到的,对于多视图谱聚类,这种经典范式仍存在以下突出局限性:由于在所有视图之间强行实施低秩数据相关性一致性,从而忽略了灵活的局部流形结构,因此这种策略无法实现令人满意的视图间一致性;更糟糕的是,LRR在直观上不够灵活,难以捕捉潜在的数据聚类结构。在本文中,首先,我们通过分解为潜在的低维数据聚类表示来提出结构化LRR,这些表示刻画了每个视图的数据聚类结构。基于这种表示,其次,引入拉普拉斯正则化器以能够为每个视图保留灵活的局部流形结构。第三,我们提出一种迭代多视图一致性策略,在优化过程的每次迭代中,通过最小化所有分解后的潜在数据聚类表示之间的散度目标,其中每个视图的这种潜在表示用于调节其他视图的表示,并且这种直观的过程迭代地协调所有视图以达成一致。第四,我们指出这种数据聚类表示可以通过自适应输入聚类数量灵活地编码来自任何视图的数据聚类结构。为此,最后,通过有效的交替最小化策略提出了一种新颖的非凸目标函数。还进行了复杂度分析。针对真实世界多视图数据集进行的大量实验证明了该方法优于现有技术。