Suppr超能文献

通过联合投影学习和量化最小化重建偏差哈希

Minimizing Reconstruction Bias Hashing via Joint Projection Learning and Quantization.

作者信息

Duan Ling-Yu, Wu Yuwei, Huang Yicheng, Wang Zhe, Yuan Junsong, Gao Wen

出版信息

IEEE Trans Image Process. 2018 Mar 21. doi: 10.1109/TIP.2018.2818008.

Abstract

Hashing, a widely-studied solution to the approximate nearest neighbor (ANN) search, aims to map data points in the high-dimensional Euclidean space to the low-dimensional Hamming space while preserving the similarity between original points. As directly learning binary codes can be NP-hard due to discrete constraints, a two-stage scheme, namely "projection and quantization", has already become a standard paradigm for learning similarity-preserving hash codes. However, most existing hashing methods typically separate these two stages and thus fail to investigate complementary effects of both stages. In this paper, we systematically study the relationship between "projection and quantization", and propose a novel minimal reconstruction bias hashing (MRH) method to learn compact binary codes, in which the projection learning and quantization optimizing are jointly performed. By introducing a lower bound analysis, we design an effective ternary search algorithm to solve the corresponding optimization problem. Furthermore, we conduct some insightful discussions on the proposed MRH approach, including the theoretical proof, and computational complexity. Distinct from previous works, MRH can adaptively adjust the projection dimensionality to balance the information loss between projection and quantization. The proposed framework not only provides a unique perspective to view traditional hashing methods but also evokes some other researches, e.g., guiding the design of the loss functions in deep networks. Extensive experiment results have shown that the proposed MRH significantly outperforms a variety of state-of-the-art methods over eight widely used benchmarks.

摘要

哈希作为一种针对近似最近邻(ANN)搜索进行了广泛研究的解决方案,旨在将高维欧几里得空间中的数据点映射到低维汉明空间,同时保留原始点之间的相似性。由于离散约束,直接学习二进制码可能是NP难问题,因此一种两阶段方案,即“投影和量化”,已经成为学习保留相似性的哈希码的标准范式。然而,大多数现有的哈希方法通常将这两个阶段分开,因此未能研究两个阶段的互补效应。在本文中,我们系统地研究了“投影和量化”之间的关系,并提出了一种新颖的最小重构偏差哈希(MRH)方法来学习紧凑的二进制码,其中投影学习和量化优化是联合进行的。通过引入下界分析,我们设计了一种有效的三分搜索算法来解决相应的优化问题。此外,我们对提出的MRH方法进行了一些有见地的讨论,包括理论证明和计算复杂性。与以前的工作不同,MRH可以自适应地调整投影维度,以平衡投影和量化之间的信息损失。所提出的框架不仅为看待传统哈希方法提供了独特的视角,还引发了一些其他研究,例如指导深度网络中损失函数的设计。大量实验结果表明,在八个广泛使用的基准测试中,所提出的MRH明显优于各种现有方法。

相似文献

3
Equivalent Continuous Formulation of General Hashing Problem.通用哈希问题的等效连续公式。
IEEE Trans Cybern. 2021 Aug;51(8):4089-4099. doi: 10.1109/TCYB.2019.2894020. Epub 2021 Aug 4.
4
Structure Sensitive Hashing With Adaptive Product Quantization.结构敏感哈希与自适应乘积量化。
IEEE Trans Cybern. 2016 Oct;46(10):2252-2264. doi: 10.1109/TCYB.2015.2474742. Epub 2015 Oct 1.
5
Ordinal Constraint Binary Coding for Approximate Nearest Neighbor Search.
IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):941-955. doi: 10.1109/TPAMI.2018.2819978. Epub 2018 Mar 27.
6
A General Framework for Linear Distance Preserving Hashing.一种线性距离保持哈希的通用框架。
IEEE Trans Image Process. 2018 Feb;27(2):907-922. doi: 10.1109/TIP.2017.2751150. Epub 2017 Sep 11.
7
Sequential Discrete Hashing for Scalable Cross-Modality Similarity Retrieval.用于可扩展跨模态相似性检索的序贯离散哈希。
IEEE Trans Image Process. 2017 Jan;26(1):107-118. doi: 10.1109/TIP.2016.2619262. Epub 2016 Oct 19.
8
Distributed Adaptive Binary Quantization for Fast Nearest Neighbor Search.分布式自适应二进制量化用于快速最近邻搜索。
IEEE Trans Image Process. 2017 Nov;26(11):5324-5336. doi: 10.1109/TIP.2017.2729896. Epub 2017 Jul 24.
9
Large-Scale Unsupervised Hashing with Shared Structure Learning.大规模无监督哈希共享结构学习。
IEEE Trans Cybern. 2015 Sep;45(9):1811-22. doi: 10.1109/TCYB.2014.2360856. Epub 2014 Nov 20.
10
Hashing with Angular Reconstructive Embeddings.基于角度重建嵌入的哈希。
IEEE Trans Image Process. 2018 Feb;27(2):545-555. doi: 10.1109/TIP.2017.2749147. Epub 2017 Sep 4.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验