Zhou Chang, Po Lai Man, Ou Weifeng
IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1638-1649. doi: 10.1109/TNNLS.2020.3043103. Epub 2022 Apr 4.
Most of the deep quantization methods adopt unsupervised approaches, and the quantization process usually occurs in the Euclidean space on top of the deep feature and its approximate value. When this approach is applied to the retrieval tasks, since the internal product space of the retrieval process is different from the Euclidean space of quantization, minimizing the quantization error (QE) does not necessarily lead to a good performance on the maximum inner product search (MIPS). To solve these problems, we treat Softmax classification as vector quantization (VQ) with angular decision boundaries and propose angular deep supervised VQ (ADSVQ) for image retrieval. Our approach can simultaneously learn the discriminative feature representation and the updatable codebook, both lying on a hypersphere. To reduce the QE between centroids and deep features, two regularization terms are proposed as supervision signals to encourage the intra-class compactness and inter-class balance, respectively. ADSVQ explicitly reformulates the asymmetric distance computation in MIPS to transform the image retrieval process into a two-stage classification process. Moreover, we discuss the extension of multiple-label cases from the perspective of quantization with binary classification. Extensive experiments demonstrate that the proposed ADSVQ has excellent performance on four well-known image data sets when compared with the state-of-the-art hashing methods.
大多数深度量化方法采用无监督方法,量化过程通常发生在深度特征及其近似值之上的欧几里得空间中。当这种方法应用于检索任务时,由于检索过程的内积空间与量化的欧几里得空间不同,最小化量化误差(QE)并不一定能在最大内积搜索(MIPS)上带来良好的性能。为了解决这些问题,我们将Softmax分类视为具有角度决策边界的向量量化(VQ),并提出用于图像检索的角度深度监督VQ(ADSVQ)。我们的方法可以同时学习判别性特征表示和可更新的码本,两者都位于超球面上。为了减少质心与深度特征之间的QE,提出了两个正则化项作为监督信号,分别鼓励类内紧凑性和类间平衡。ADSVQ明确地重新制定了MIPS中的不对称距离计算,将图像检索过程转化为两阶段分类过程。此外,我们从二元分类量化的角度讨论了多标签情况的扩展。大量实验表明,与现有哈希方法相比,所提出的ADSVQ在四个著名的图像数据集上具有优异的性能。