用于多媒体检索的多尺度边缘场自动目标提取
Automatic object extraction over multiscale edge field for multimedia retrieval.
作者信息
Kiranyaz Serkan, Ferreira Miguel, Gabbouj Moncef
机构信息
Institute of Signal Processing, Tampere University of Technology, FIN-33101 Tampere, Finland.
出版信息
IEEE Trans Image Process. 2006 Dec;15(12):3759-72. doi: 10.1109/tip.2006.881966.
In this work, we focus on automatic extraction of object boundaries from Canny edge field for the purpose of content-based indexing and retrieval over image and video databases. A multiscale approach is adopted where each successive scale provides further simplification of the image by removing more details, such as texture and noise, while keeping major edges. At each stage of the simplification, edges are extracted from the image and gathered in a scale-map, over which a perceptual subsegment analysis is performed in order to extract true object boundaries. The analysis is mainly motivated by Gestalt laws and our experimental results suggest a promising performance for main objects extraction, even for images with crowded textural edges and objects with color, texture, and illumination variations. Finally, integrating the whole process as feature extraction module into MUVIS framework allows us to test the mutual performance of the proposed object extraction method and subsequent shape description in the context of multimedia indexing and retrieval. A promising retrieval performance is achieved, and especially in some particular examples, the experimental results show that the proposed method presents such a retrieval performance that cannot be achieved by using other features such as color or texture.
在这项工作中,我们专注于从Canny边缘场自动提取物体边界,以便在图像和视频数据库上进行基于内容的索引和检索。我们采用了一种多尺度方法,其中每个连续的尺度通过去除更多细节(如纹理和噪声)来进一步简化图像,同时保留主要边缘。在简化的每个阶段,从图像中提取边缘并收集到一个尺度图中,在该尺度图上进行感知子分割分析,以提取真实的物体边界。该分析主要受格式塔定律的启发,我们的实验结果表明,即使对于具有密集纹理边缘的图像以及具有颜色、纹理和光照变化的物体,该方法在主要物体提取方面也具有良好的性能。最后,将整个过程作为特征提取模块集成到MUVIS框架中,使我们能够在多媒体索引和检索的背景下测试所提出的物体提取方法和后续形状描述的相互性能。我们实现了良好的检索性能,特别是在一些特定示例中,实验结果表明,所提出的方法呈现出使用颜色或纹理等其他特征无法实现的检索性能。