《对局部敏感哈希的辩护》

In Defense of Locality-Sensitive Hashing.

出版信息

IEEE Trans Neural Netw Learn Syst. 2018 Jan;29(1):87-103. doi: 10.1109/TNNLS.2016.2615085. Epub 2016 Oct 24.

DOI:10.1109/TNNLS.2016.2615085

Abstract

Hashing-based semantic similarity search is becoming increasingly important for building large-scale content-based retrieval system. The state-of-the-art supervised hashing techniques use flexible two-step strategy to learn hash functions. The first step learns binary codes for training data by solving binary optimization problems with millions of variables, thus usually requiring intensive computations. Despite simplicity and efficiency, locality-sensitive hashing (LSH) has never been recognized as a good way to generate such codes due to its poor performance in traditional approximate neighbor search. We claim in this paper that the true merit of LSH lies in transforming the semantic labels to obtain the binary codes, resulting in an effective and efficient two-step hashing framework. Specifically, we developed the locality-sensitive two-step hashing (LS-TSH) that generates the binary codes through LSH rather than any complex optimization technique. Theoretically, with proper assumption, LS-TSH is actually a useful LSH scheme, so that it preserves the label-based semantic similarity and possesses sublinear query complexity for hash lookup. Experimentally, LS-TSH could obtain comparable retrieval accuracy with state of the arts with two to three orders of magnitudes faster training speed.

摘要

基于哈希的语义相似性搜索在构建大规模基于内容的检索系统方面变得越来越重要。最先进的监督哈希技术使用灵活的两步策略来学习哈希函数。第一步通过解决具有数百万个变量的二进制优化问题为训练数据学习二进制代码，因此通常需要大量的计算。尽管局部敏感哈希 (LSH) 简单高效，但由于其在传统近似邻居搜索中的性能较差，从未被认为是生成此类代码的好方法。我们在本文中声称，LSH 的真正优点在于将语义标签转换以获得二进制代码，从而形成一个有效且高效的两步哈希框架。具体来说，我们开发了局部敏感两步哈希 (LS-TSH)，通过 LSH 而不是任何复杂的优化技术生成二进制代码。从理论上讲，在适当的假设下，LS-TSH 实际上是一种有用的 LSH 方案，因此它保留了基于标签的语义相似性，并具有亚线性的哈希查询复杂度。实验上，LS-TSH 可以以快两到三个数量级的速度获得与最先进技术相当的检索精度。

相似文献

In Defense of Locality-Sensitive Hashing.

IEEE Trans Neural Netw Learn Syst. 2018 Jan;29(1):87-103. doi: 10.1109/TNNLS.2016.2615085. Epub 2016 Oct 24.

Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.

IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.

Robust hashing with local models for approximate similarity search.

IEEE Trans Cybern. 2014 Jul;44(7):1225-36. doi: 10.1109/TCYB.2013.2289351.

Unsupervised Semantic-Preserving Adversarial Hashing for Image Search.

IEEE Trans Image Process. 2019 Aug;28(8):4032-4044. doi: 10.1109/TIP.2019.2903661. Epub 2019 Mar 13.

Simultaneous Feature Aggregating and Hashing for Compact Binary Code Learning.

IEEE Trans Image Process. 2019 Oct;28(10):4954-4969. doi: 10.1109/TIP.2019.2913509. Epub 2019 May 8.

Semi-supervised hashing for large-scale search.

IEEE Trans Pattern Anal Mach Intell. 2012 Dec;34(12):2393-406. doi: 10.1109/TPAMI.2012.48.

Supervised hashing using graph cuts and boosted decision trees.

IEEE Trans Pattern Anal Mach Intell. 2015 Nov;37(11):2317-31. doi: 10.1109/TPAMI.2015.2404776.

Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search.

IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1429-1440. doi: 10.1109/TNNLS.2018.2869601. Epub 2018 Oct 1.

Deep Class-Wise Hashing: Semantics-Preserving Hashing via Class-Wise Loss.

IEEE Trans Neural Netw Learn Syst. 2020 May;31(5):1681-1695. doi: 10.1109/TNNLS.2019.2921805. Epub 2019 Jul 10.

Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval.

IEEE Trans Image Process. 2016 Oct;25(10):4540-54. doi: 10.1109/TIP.2016.2592800. Epub 2016 Jul 18.

引用本文的文献

An Online Hashing Algorithm for Image Retrieval Based on Optical-Sensor Network.

Sensors (Basel). 2023 Feb 25;23(5):2576. doi: 10.3390/s23052576.

A Framework for Enabling Unpaired Multi-Modal Learning for Deep Cross-Modal Hashing Retrieval.

J Imaging. 2022 Dec 15;8(12):328. doi: 10.3390/jimaging8120328.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

《对局部敏感哈希的辩护》

In Defense of Locality-Sensitive Hashing.

出版信息

IEEE Trans Neural Netw Learn Syst. 2018 Jan;29(1):87-103. doi: 10.1109/TNNLS.2016.2615085. Epub 2016 Oct 24.

DOI:10.1109/TNNLS.2016.2615085

PMID:28113786

Abstract

摘要

《对局部敏感哈希的辩护》

In Defense of Locality-Sensitive Hashing.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

《对局部敏感哈希的辩护》

In Defense of Locality-Sensitive Hashing.

出版信息