级联类别感知视觉搜索。

Cascade category-aware visual search.

出版信息

IEEE Trans Image Process. 2014 Jun;23(6):2514-27. doi: 10.1109/TIP.2014.2317986. Epub 2014 Apr 17.

DOI:10.1109/TIP.2014.2317986

Abstract

Incorporating image classification into image retrieval system brings many attractive advantages. For instance, the search space can be narrowed down by rejecting images in irrelevant categories of the query. The retrieved images can be more consistent in semantics by indexing and returning images in the relevant categories together. However, due to their different goals on recognition accuracy and retrieval scalability, it is hard to efficiently incorporate most image classification works into large-scale image search. To study this problem, we propose cascade category-aware visual search, which utilizes weak category clue to achieve better retrieval accuracy, efficiency, and memory consumption. To capture the category and visual clues of an image, we first learn category-visual words, which are discriminative and repeatable local features labeled with categories. By identifying category-visual words in database images, we are able to discard noisy local features and extract image visual and category clues, which are hence recorded in a hierarchical index structure. Our retrieval system narrows down the search space by: 1) filtering the noisy local features in query; 2) rejecting irrelevant categories in database; and 3) preforming discriminative visual search in relevant categories. The proposed algorithm is tested on object search, landmark search, and large-scale similar image search on the large-scale LSVRC10 data set. Although the category clue introduced is weak, our algorithm still shows substantial advantages in retrieval accuracy, efficiency, and memory consumption than the state-of-the-art.

摘要

将图像分类纳入图像检索系统带来了许多吸引人的优势。例如，可以通过拒绝查询中不相关类别的图像来缩小搜索空间。通过索引并返回相关类别的图像，可以使检索到的图像在语义上更加一致。然而，由于它们在识别准确性和检索可扩展性方面的目标不同，将大多数图像分类工作有效地纳入大规模图像搜索是很困难的。为了研究这个问题，我们提出了级联类别感知视觉搜索，它利用弱类别线索来实现更好的检索准确性、效率和内存消耗。为了捕获图像的类别和视觉线索，我们首先学习类别视觉词，这是具有类别标签的有区分性和可重复性的局部特征。通过在数据库图像中识别类别视觉词，我们能够丢弃噪声局部特征，并提取图像视觉和类别线索，这些线索将被记录在分层索引结构中。我们的检索系统通过以下方式缩小搜索空间：1）过滤查询中的噪声局部特征；2）拒绝数据库中不相关的类别；3）在相关类别中进行有区分的视觉搜索。该算法在大型 LSVRC10 数据集上的物体搜索、地标搜索和大规模相似图像搜索中进行了测试。尽管引入的类别线索较弱，但与最先进的方法相比，我们的算法在检索准确性、效率和内存消耗方面仍然具有显著优势。

相似文献

Cascade category-aware visual search.级联类别感知视觉搜索。

IEEE Trans Image Process. 2014 Jun;23(6):2514-27. doi: 10.1109/TIP.2014.2317986. Epub 2014 Apr 17.

Semantic-Aware Co-Indexing for Image Retrieval.基于语义感知的图像检索协同索引。

IEEE Trans Pattern Anal Mach Intell. 2015 Dec;37(12):2573-87. doi: 10.1109/TPAMI.2015.2417573.

Generating descriptive visual words and visual phrases for large-scale image applications.生成描述性视觉词汇和视觉短语，用于大规模图像应用。

IEEE Trans Image Process. 2011 Sep;20(9):2664-77. doi: 10.1109/TIP.2011.2128333. Epub 2011 Mar 17.

A boosting framework for visuality-preserving distance metric learning and its application to medical image retrieval.一种保持视觉保真度的距离度量学习的提升框架及其在医学图像检索中的应用。

IEEE Trans Pattern Anal Mach Intell. 2010 Jan;32(1):30-44. doi: 10.1109/TPAMI.2008.273.

IntentSearch: Capturing User Intention for One-Click Internet Image Search.意图搜索：实现一键式互联网图像搜索中的用户意图捕获。

IEEE Trans Pattern Anal Mach Intell. 2012 Jul;34(7):1342-53. doi: 10.1109/TPAMI.2011.242. Epub 2011 Dec 13.

Instance-Aware Hashing for Multi-Label Image Retrieval.实例感知哈希多标签图像检索。

IEEE Trans Image Process. 2016 Jun;25(6):2469-79. doi: 10.1109/TIP.2016.2545300. Epub 2016 Mar 22.

A statistical framework for image category search from a mental picture.一种基于心理图像进行图像类别搜索的统计框架。

IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):1087-101. doi: 10.1109/TPAMI.2008.259.

BSIFT: toward data-independent codebook for large scale image search.BSIFT：面向大规模图像搜索的与数据无关的码本。

IEEE Trans Image Process. 2015 Mar;24(3):967-79. doi: 10.1109/TIP.2015.2389624. Epub 2015 Jan 9.

Spatially-Constrained Similarity Measurefor Large-Scale Object Retrieval.基于空间约束的大规模目标检索相似度度量方法

IEEE Trans Pattern Anal Mach Intell. 2014 Jun;36(6):1229-41. doi: 10.1109/TPAMI.2013.237.

Modeling semantic aspects for cross-media image indexing.跨媒体图像索引的语义方面建模

IEEE Trans Pattern Anal Mach Intell. 2007 Oct;29(10):1802-17. doi: 10.1109/TPAMI.2007.1097.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

级联类别感知视觉搜索。

Cascade category-aware visual search.

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献