Deng Cheng, Yang Erkun, Liu Tongliang, Li Jie, Liu Wei, Tao Dacheng
IEEE Trans Image Process. 2019 Aug;28(8):4032-4044. doi: 10.1109/TIP.2019.2903661. Epub 2019 Mar 13.
Hashing plays a pivotal role in nearest-neighbor searching for large-scale image retrieval. Recently, deep learning-based hashing methods have achieved promising performance. However, most of these deep methods involve discriminative models, which require large-scale, labeled training datasets, thus hindering their real-world applications. In this paper, we propose a novel strategy to exploit the semantic similarity of the training data and design an efficient generative adversarial framework to learn binary hash codes in an unsupervised manner. Specifically, our model consists of three different neural networks: an encoder network to learn hash codes from images, a generative network to generate images from hash codes, and a discriminative network to distinguish between pairs of hash codes and images. By adversarially training these networks, we successfully learn mutually coherent encoder and generative networks, and can output efficient hash codes from the encoder network. We also propose a novel strategy, which utilizes both feature and neighbor similarities, to construct a semantic similarity matrix, then use this matrix to guide the hash code learning process. Integrating the supervision of this semantic similarity matrix into the adversarial learning framework can efficiently preserve the semantic information of training data in Hamming space. The experimental results on three widely used benchmarks show that our method not only significantly outperforms several state-of-the-art unsupervised hashing methods, but also achieves comparable performance with popular supervised hashing methods.
哈希在大规模图像检索的最近邻搜索中起着关键作用。近年来,基于深度学习的哈希方法取得了令人瞩目的性能。然而,这些深度方法大多涉及判别模型,这需要大规模的带标签训练数据集,从而阻碍了它们在实际中的应用。在本文中,我们提出了一种新颖的策略来利用训练数据的语义相似性,并设计了一个高效的生成对抗框架,以无监督的方式学习二进制哈希码。具体来说,我们的模型由三个不同的神经网络组成:一个编码器网络,用于从图像中学习哈希码;一个生成网络,用于从哈希码生成图像;一个判别网络,用于区分哈希码对和图像对。通过对这些网络进行对抗训练,我们成功地学习到相互连贯的编码器和生成网络,并能从编码器网络输出高效的哈希码。我们还提出了一种新颖的策略,该策略利用特征和邻居相似性来构建语义相似性矩阵,然后使用这个矩阵来指导哈希码学习过程。将这种语义相似性矩阵的监督集成到对抗学习框架中,可以有效地在汉明空间中保留训练数据的语义信息。在三个广泛使用的基准上的实验结果表明,我们的方法不仅显著优于几种现有的无监督哈希方法,而且与流行的有监督哈希方法取得了相当的性能。