Li Zhihui, Nie Feiping, Chang Xiaojun, Yang Yi, Zhang Chengqi, Sebe Nicu
IEEE Trans Neural Netw Learn Syst. 2018 Dec;29(12):6323-6332. doi: 10.1109/TNNLS.2018.2829867. Epub 2018 May 18.
Spectral clustering (SC) has been widely applied to various computer vision tasks, where the key is to construct a robust affinity matrix for data partitioning. With the increase in visual features, conventional SC methods are facing two challenges: 1) how to effectively generate an affinity matrix based on multiple features? and 2) how to deal with high-dimensional visual features which could be redundant? To address these issues mentioned earlier, we present a new approach to: 1) learn a robust affinity matrix using multiple features, allowing us to simultaneously determine optimal weights for each feature; and 2) decide a set of optimal projection matrixes, one for each feature, that decide the lower dimensional space, as well as the optimal affinity weight of each data pair in the lower dimensional space. There are two major advantages of our new approach over the existing clustering techniques. First, our approach assigns affinity weights for data points on a per-data-pair basis. The learning procedure avoids the explicit specification of the size of the neighborhood in the affinity matrix, and the bandwidth parameter required to compute the Gaussian kernel, both of which are sensitive and yet difficult to determine beforehand. Second, the affinity weights are based on the distances in a lower dimensional space, while the low-dimensional space is inferred according to the optimized affinity weights. Both variables are jointly optimized so as to leverage mutual benefits. The experimental results outperform the compared alternatives, which indicate that the proposed method is effective in simultaneously learning the affinity graph and feature fusion, resulting in better clustering results.
谱聚类(SC)已被广泛应用于各种计算机视觉任务,其中关键在于构建一个用于数据划分的鲁棒亲和矩阵。随着视觉特征的增加,传统的谱聚类方法面临两个挑战:1)如何基于多个特征有效地生成亲和矩阵?2)如何处理可能冗余的高维视觉特征?为了解决上述问题,我们提出了一种新方法:1)使用多个特征学习一个鲁棒的亲和矩阵,使我们能够同时为每个特征确定最优权重;2)确定一组最优投影矩阵,每个特征一个,用于确定低维空间以及低维空间中每个数据对的最优亲和权重。与现有的聚类技术相比,我们的新方法有两个主要优点。首先,我们的方法基于每个数据对为数据点分配亲和权重。学习过程避免了在亲和矩阵中明确指定邻域大小以及计算高斯核所需的带宽参数,这两者都很敏感且难以预先确定。其次,亲和权重基于低维空间中的距离,而低维空间是根据优化后的亲和权重推断出来的。这两个变量共同优化以实现互利。实验结果优于比较的其他方法,表明所提出的方法在同时学习亲和图和特征融合方面是有效的,从而产生更好的聚类结果。