Suppr超能文献

级联类别感知视觉搜索。

Cascade category-aware visual search.

出版信息

IEEE Trans Image Process. 2014 Jun;23(6):2514-27. doi: 10.1109/TIP.2014.2317986. Epub 2014 Apr 17.

Abstract

Incorporating image classification into image retrieval system brings many attractive advantages. For instance, the search space can be narrowed down by rejecting images in irrelevant categories of the query. The retrieved images can be more consistent in semantics by indexing and returning images in the relevant categories together. However, due to their different goals on recognition accuracy and retrieval scalability, it is hard to efficiently incorporate most image classification works into large-scale image search. To study this problem, we propose cascade category-aware visual search, which utilizes weak category clue to achieve better retrieval accuracy, efficiency, and memory consumption. To capture the category and visual clues of an image, we first learn category-visual words, which are discriminative and repeatable local features labeled with categories. By identifying category-visual words in database images, we are able to discard noisy local features and extract image visual and category clues, which are hence recorded in a hierarchical index structure. Our retrieval system narrows down the search space by: 1) filtering the noisy local features in query; 2) rejecting irrelevant categories in database; and 3) preforming discriminative visual search in relevant categories. The proposed algorithm is tested on object search, landmark search, and large-scale similar image search on the large-scale LSVRC10 data set. Although the category clue introduced is weak, our algorithm still shows substantial advantages in retrieval accuracy, efficiency, and memory consumption than the state-of-the-art.

摘要

将图像分类纳入图像检索系统带来了许多吸引人的优势。例如,可以通过拒绝查询中不相关类别的图像来缩小搜索空间。通过索引并返回相关类别的图像,可以使检索到的图像在语义上更加一致。然而,由于它们在识别准确性和检索可扩展性方面的目标不同,将大多数图像分类工作有效地纳入大规模图像搜索是很困难的。为了研究这个问题,我们提出了级联类别感知视觉搜索,它利用弱类别线索来实现更好的检索准确性、效率和内存消耗。为了捕获图像的类别和视觉线索,我们首先学习类别视觉词,这是具有类别标签的有区分性和可重复性的局部特征。通过在数据库图像中识别类别视觉词,我们能够丢弃噪声局部特征,并提取图像视觉和类别线索,这些线索将被记录在分层索引结构中。我们的检索系统通过以下方式缩小搜索空间:1)过滤查询中的噪声局部特征;2)拒绝数据库中不相关的类别;3)在相关类别中进行有区分的视觉搜索。该算法在大型 LSVRC10 数据集上的物体搜索、地标搜索和大规模相似图像搜索中进行了测试。尽管引入的类别线索较弱,但与最先进的方法相比,我们的算法在检索准确性、效率和内存消耗方面仍然具有显著优势。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验