Ding Jundi, Ma Runing, Chen Songcan
Department of Computer Science and Engineering, Nanijing University of Aeronautics and Astronautics, Nanjing, China.
IEEE Trans Image Process. 2008 Feb;17(2):204-16. doi: 10.1109/TIP.2007.912918.
This paper presents a connected coherence tree algorithm (CCTA) for image segmentation with no prior knowledge. It aims to find regions of semantic coherence based on the proposed epsilon-neighbor coherence segmentation criterion. More specifically, with an adaptive spatial scale and an appropriate intensity-difference scale, CCTA often achieves several sets of coherent neighboring pixels which maximize the probability of being a single image content (including kinds of complex backgrounds). In practice, each set of coherent neighboring pixels corresponds to a coherence class (CC). The fact that each CC just contains a single equivalence class (EC) ensures the separability of an arbitrary image theoretically. In addition, the resultant CCs are represented by tree-based data structures, named connected coherence tree (CCT)s. In this sense, CCTA is a graph-based image analysis algorithm, which expresses three advantages: 1) its fundamental idea, epsilon-neighbor coherence segmentation criterion, is easy to interpret and comprehend; 2) it is efficient due to a linear computational complexity in the number of image pixels; 3) both subjective comparisons and objective evaluation have shown that it is effective for the tasks of semantic object segmentation and figure-ground separation in a wide variety of images. Those images either contain tiny, long and thin objects or are severely degraded by noise, uneven lighting, occlusion, poor illumination, and shadow.
本文提出了一种无需先验知识的用于图像分割的连通相干树算法(CCTA)。它旨在基于所提出的ε邻域相干分割准则找到语义相干区域。更具体地说,通过自适应空间尺度和适当的强度差尺度,CCTA通常能得到几组相干相邻像素,这些像素使属于单个图像内容(包括各种复杂背景)的概率最大化。在实际应用中,每组相干相邻像素对应一个相干类(CC)。每个CC仅包含一个等价类(EC)这一事实从理论上确保了任意图像的可分离性。此外,所得的CC由基于树的数据结构表示,称为连通相干树(CCT)。从这个意义上讲,CCTA是一种基于图的图像分析算法,它具有三个优点:1)其基本思想,即ε邻域相干分割准则,易于解释和理解;2)由于其计算复杂度与图像像素数量呈线性关系,所以效率高;3)主观比较和客观评估均表明,它对于各种图像中的语义对象分割和前景 - 背景分离任务均有效。这些图像要么包含微小、细长的物体,要么因噪声、光照不均匀、遮挡、照明不佳和阴影而严重退化。