Do Thanh-Toan, Le Khoa, Hoang Tuan, Le Huu, Nguyen Tam V, Cheung Ngai-Man
IEEE Trans Image Process. 2019 Oct;28(10):4954-4969. doi: 10.1109/TIP.2019.2913509. Epub 2019 May 8.
Representing images by compact hash codes is an attractive approach for large-scale content-based image retrieval. In most state-of-the-art hashing-based image retrieval systems, for each image, local descriptors are first aggregated as a global representation vector. This global vector is then subjected to a hashing function to generate a binary hash code. In previous works, the aggregating and the hashing processes are designed independently. Hence, these frameworks may generate suboptimal hash codes. In this paper, we first propose a novel unsupervised hashing framework in which feature aggregating and hashing are designed simultaneously and optimized jointly. Specifically, our joint optimization generates aggregated representations that can be better reconstructed by some binary codes. This leads to more discriminative binary hash codes and improved retrieval accuracy. In addition, the proposed method is flexible. It can be extended for supervised hashing. When the data label is available, the framework can be adapted to learn binary codes which minimize the reconstruction loss with respect to label vectors. Furthermore, we also propose a fast version of the state-of-the-art hashing method Binary Autoencoder to be used in our proposed frameworks. Extensive experiments on benchmark datasets under various settings show that the proposed methods outperform the state-of-the-art unsupervised and supervised hashing methods.
用紧凑哈希码表示图像是大规模基于内容的图像检索的一种有吸引力的方法。在大多数基于哈希的最先进图像检索系统中,对于每幅图像,局部描述符首先被聚合为一个全局表示向量。然后,这个全局向量经过一个哈希函数以生成一个二进制哈希码。在以往的工作中,聚合和哈希过程是独立设计的。因此,这些框架可能会生成次优的哈希码。在本文中,我们首先提出了一种新颖的无监督哈希框架,其中特征聚合和哈希是同时设计并联合优化的。具体而言,我们的联合优化生成的聚合表示可以通过一些二进制码更好地重建。这导致了更具判别力的二进制哈希码和更高的检索准确率。此外,所提出的方法具有灵活性。它可以扩展用于有监督哈希。当数据标签可用时,该框架可以进行调整以学习相对于标签向量使重建损失最小化的二进制码。此外,我们还提出了一种最先进的哈希方法二进制自动编码器的快速版本,用于我们提出的框架。在各种设置下对基准数据集进行的大量实验表明,所提出的方法优于最先进的无监督和有监督哈希方法。