Suppr超能文献

视觉词歧义。

Visual word ambiguity.

机构信息

Département d'Informatique, Ecole Normale Supérieure, Paris, France.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2010 Jul;32(7):1271-83. doi: 10.1109/TPAMI.2009.132.

Abstract

This paper studies automatic image classification by modeling soft assignment in the popular codebook model. The codebook model describes an image as a bag of discrete visual words selected from a vocabulary, where the frequency distributions of visual words in an image allow classification. One inherent component of the codebook model is the assignment of discrete visual words to continuous image features. Despite the clear mismatch of this hard assignment with the nature of continuous features, the approach has been successfully applied for some years. In this paper, we investigate four types of soft assignment of visual words to image features. We demonstrate that explicitly modeling visual word assignment ambiguity improves classification performance compared to the hard assignment of the traditional codebook model. The traditional codebook model is compared against our method for five well-known data sets: 15 natural scenes, Caltech-101, Caltech-256, and Pascal VOC 2007/2008. We demonstrate that large codebook vocabulary sizes completely deteriorate the performance of the traditional model, whereas the proposed model performs consistently. Moreover, we show that our method profits in high-dimensional feature spaces and reaps higher benefits when increasing the number of image categories.

摘要

本文研究了通过对流行码本模型中的软分配进行建模来实现自动图像分类。码本模型将图像描述为从词汇中选择的离散视觉单词的集合,其中图像中的视觉单词的频率分布允许进行分类。码本模型的一个固有组成部分是离散视觉单词到连续图像特征的分配。尽管这种硬分配与连续特征的性质明显不匹配,但该方法已经成功应用了多年。在本文中,我们研究了四种将视觉单词软分配给图像特征的方法。我们证明,与传统码本模型的硬分配相比,显式地对视觉单词分配的模糊性进行建模可以提高分类性能。我们将传统的码本模型与我们的方法在五个著名数据集上进行了比较:15 个自然场景、Caltech-101、Caltech-256 和 Pascal VOC 2007/2008。我们证明,大的码本词汇量会完全降低传统模型的性能,而所提出的模型则始终保持一致。此外,我们还表明,我们的方法在高维特征空间中表现良好,并且随着图像类别数量的增加,会获得更高的收益。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验