Duan Ling-Yu, Wu Yuwei, Huang Yicheng, Wang Zhe, Yuan Junsong, Gao Wen
IEEE Trans Image Process. 2018 Mar 21. doi: 10.1109/TIP.2018.2818008.
Hashing, a widely-studied solution to the approximate nearest neighbor (ANN) search, aims to map data points in the high-dimensional Euclidean space to the low-dimensional Hamming space while preserving the similarity between original points. As directly learning binary codes can be NP-hard due to discrete constraints, a two-stage scheme, namely "projection and quantization", has already become a standard paradigm for learning similarity-preserving hash codes. However, most existing hashing methods typically separate these two stages and thus fail to investigate complementary effects of both stages. In this paper, we systematically study the relationship between "projection and quantization", and propose a novel minimal reconstruction bias hashing (MRH) method to learn compact binary codes, in which the projection learning and quantization optimizing are jointly performed. By introducing a lower bound analysis, we design an effective ternary search algorithm to solve the corresponding optimization problem. Furthermore, we conduct some insightful discussions on the proposed MRH approach, including the theoretical proof, and computational complexity. Distinct from previous works, MRH can adaptively adjust the projection dimensionality to balance the information loss between projection and quantization. The proposed framework not only provides a unique perspective to view traditional hashing methods but also evokes some other researches, e.g., guiding the design of the loss functions in deep networks. Extensive experiment results have shown that the proposed MRH significantly outperforms a variety of state-of-the-art methods over eight widely used benchmarks.
哈希作为一种针对近似最近邻(ANN)搜索进行了广泛研究的解决方案,旨在将高维欧几里得空间中的数据点映射到低维汉明空间,同时保留原始点之间的相似性。由于离散约束,直接学习二进制码可能是NP难问题,因此一种两阶段方案,即“投影和量化”,已经成为学习保留相似性的哈希码的标准范式。然而,大多数现有的哈希方法通常将这两个阶段分开,因此未能研究两个阶段的互补效应。在本文中,我们系统地研究了“投影和量化”之间的关系,并提出了一种新颖的最小重构偏差哈希(MRH)方法来学习紧凑的二进制码,其中投影学习和量化优化是联合进行的。通过引入下界分析,我们设计了一种有效的三分搜索算法来解决相应的优化问题。此外,我们对提出的MRH方法进行了一些有见地的讨论,包括理论证明和计算复杂性。与以前的工作不同,MRH可以自适应地调整投影维度,以平衡投影和量化之间的信息损失。所提出的框架不仅为看待传统哈希方法提供了独特的视角,还引发了一些其他研究,例如指导深度网络中损失函数的设计。大量实验结果表明,在八个广泛使用的基准测试中,所提出的MRH明显优于各种现有方法。