IEEE Trans Image Process. 2014 Jun;23(6):2514-27. doi: 10.1109/TIP.2014.2317986. Epub 2014 Apr 17.
Incorporating image classification into image retrieval system brings many attractive advantages. For instance, the search space can be narrowed down by rejecting images in irrelevant categories of the query. The retrieved images can be more consistent in semantics by indexing and returning images in the relevant categories together. However, due to their different goals on recognition accuracy and retrieval scalability, it is hard to efficiently incorporate most image classification works into large-scale image search. To study this problem, we propose cascade category-aware visual search, which utilizes weak category clue to achieve better retrieval accuracy, efficiency, and memory consumption. To capture the category and visual clues of an image, we first learn category-visual words, which are discriminative and repeatable local features labeled with categories. By identifying category-visual words in database images, we are able to discard noisy local features and extract image visual and category clues, which are hence recorded in a hierarchical index structure. Our retrieval system narrows down the search space by: 1) filtering the noisy local features in query; 2) rejecting irrelevant categories in database; and 3) preforming discriminative visual search in relevant categories. The proposed algorithm is tested on object search, landmark search, and large-scale similar image search on the large-scale LSVRC10 data set. Although the category clue introduced is weak, our algorithm still shows substantial advantages in retrieval accuracy, efficiency, and memory consumption than the state-of-the-art.
将图像分类纳入图像检索系统带来了许多吸引人的优势。例如,可以通过拒绝查询中不相关类别的图像来缩小搜索空间。通过索引并返回相关类别的图像,可以使检索到的图像在语义上更加一致。然而,由于它们在识别准确性和检索可扩展性方面的目标不同,将大多数图像分类工作有效地纳入大规模图像搜索是很困难的。为了研究这个问题,我们提出了级联类别感知视觉搜索,它利用弱类别线索来实现更好的检索准确性、效率和内存消耗。为了捕获图像的类别和视觉线索,我们首先学习类别视觉词,这是具有类别标签的有区分性和可重复性的局部特征。通过在数据库图像中识别类别视觉词,我们能够丢弃噪声局部特征,并提取图像视觉和类别线索,这些线索将被记录在分层索引结构中。我们的检索系统通过以下方式缩小搜索空间:1)过滤查询中的噪声局部特征;2)拒绝数据库中不相关的类别;3)在相关类别中进行有区分的视觉搜索。该算法在大型 LSVRC10 数据集上的物体搜索、地标搜索和大规模相似图像搜索中进行了测试。尽管引入的类别线索较弱,但与最先进的方法相比,我们的算法在检索准确性、效率和内存消耗方面仍然具有显著优势。