BeingThere Centre, Institute for Media Innovation, Nanyang Technological University, Singapore.
IEEE Trans Image Process. 2013 Oct;22(10):4019-27. doi: 10.1109/TIP.2013.2268973. Epub 2013 Jun 14.
This paper considers the problem of automatically segmenting an image into a small number of regions that correspond to objects conveying semantics or high-level structure. Although such object-level segmentation usually requires additional high-level knowledge or learning process, we explore what low level cues can produce for this purpose. Our idea is to construct a feature vector for each pixel, which elaborately integrates spectral attributes, color Gaussian mixture models, and geodesic distance, such that it encodes global color and spatial cues as well as global structure information. Then, we formulate the Potts variational model in terms of the feature vectors to provide a variational image segmentation algorithm that is performed in the feature space. We also propose a heuristic approach to automatically select the number of segments. The use of feature attributes enables the Potts model to produce regions that are coherent in color and position, comply with global structures corresponding to objects or parts of objects and meanwhile maintain a smooth and accurate boundary. We demonstrate the effectiveness of our algorithm against the state-of-the-art with the data set from the famous Berkeley benchmark.
本文考虑了将图像自动分割成对应于语义或高层结构的少量区域的问题。尽管这种对象级别的分割通常需要额外的高级知识或学习过程,但我们探索了低层次线索可以为此目的产生什么。我们的想法是为每个像素构建一个特征向量,它精心整合了光谱属性、颜色高斯混合模型和测地距离,以便编码全局颜色和空间线索以及全局结构信息。然后,我们根据特征向量来制定 Potts 变分模型,以提供在特征空间中执行的变分图像分割算法。我们还提出了一种启发式方法来自动选择片段的数量。特征属性的使用使 Potts 模型能够生成在颜色和位置上一致的区域,符合对应于对象或对象部分的全局结构,同时保持平滑和准确的边界。我们使用著名的伯克利基准数据集展示了我们的算法与最先进算法相比的有效性。