IEEE Trans Image Process. 2017 Jul;26(7):3317-3330. doi: 10.1109/TIP.2017.2651389. Epub 2017 Jan 11.
In this paper, we present a superpixel segmentation algorithm called linear spectral clustering (LSC), which is capable of producing superpixels with both high boundary adherence and visual compactness for natural images with low computational costs. In LSC, a normalized cuts-based formulation of image segmentation is adopted using a distance metric that measures both the color similarity and the space proximity between image pixels. However, rather than directly using the traditional eigen-based algorithm, we approximate the similarity metric through a deliberately designed kernel function such that pixel values can be explicitly mapped to a high-dimensional feature space. We then apply the conclusion that by appropriately weighting each point in this feature space, the objective functions of the weighted K-means and the normalized cuts share the same optimum points. Consequently, it is possible to optimize the cost function of the normalized cuts by iteratively applying simple K-means clustering in the proposed feature space. LSC possesses linear computational complexity and high memory efficiency, since it avoids both the decomposition of the affinity matrix and the generation of the large kernel matrix. By utilizing the underlying mathematical equivalence between the two types of seemingly different methods, LSC successfully preserves global image structures through efficient local operations. Experimental results show that LSC performs as well as or even better than the state-of-the-art superpixel segmentation algorithms in terms of several commonly used evaluation metrics in image segmentation. The applicability of LSC is further demonstrated in two related computer vision tasks.
在本文中,我们提出了一种称为线性光谱聚类(LSC)的超像素分割算法,它能够以较低的计算成本为自然图像生成具有高边界一致性和视觉紧凑性的超像素。在 LSC 中,采用基于归一化割的图像分割公式,使用一种距离度量来度量图像像素之间的颜色相似性和空间接近度。然而,我们不是直接使用传统的基于特征的算法,而是通过精心设计的核函数来近似相似性度量,使得像素值可以明确地映射到高维特征空间。然后我们得出结论,通过适当地为这个特征空间中的每个点加权,加权 K-均值和归一化割的目标函数具有相同的最优点。因此,通过在提出的特征空间中迭代应用简单的 K-均值聚类,可以优化归一化割的代价函数。LSC 具有线性计算复杂度和高内存效率,因为它避免了相似矩阵的分解和大核矩阵的生成。通过利用两种看似不同的方法之间的基础数学等价性,LSC 通过有效的局部操作成功地保留了全局图像结构。实验结果表明,在图像分割中常用的几种评估指标方面,LSC 的性能与最先进的超像素分割算法一样好,甚至更好。LSC 的适用性在两个相关的计算机视觉任务中得到了进一步证明。