用于联合哈希表学习的分布式互补二进制量化

Distributed Complementary Binary Quantization for Joint Hash Table Learning.

作者信息

Liu Xianglong, Fu Qiang, Wang Deqing, Bai Xiao, Wu Xinyu, Tao Dacheng

出版信息

IEEE Trans Neural Netw Learn Syst. 2020 Dec;31(12):5312-5323. doi: 10.1109/TNNLS.2020.2965992. Epub 2020 Nov 30.

DOI:10.1109/TNNLS.2020.2965992

PMID:32078562

Abstract

Building multiple hash tables serves as a very successful technique for gigantic data indexing, which can simultaneously guarantee both the search accuracy and efficiency. However, most of existing multitable indexing solutions, without informative hash codes and strong table complementarity, largely suffer from the table redundancy. To address the problem, we propose a complementary binary quantization (CBQ) method for jointly learning multiple tables and the corresponding informative hash functions in a centralized way. Based on CBQ, we further design a distributed learning algorithm (D-CBQ) to accelerate the training over the large-scale distributed data set. The proposed (D-)CBQ exploits the power of prototype-based incomplete binary coding to well align the data distributions in the original space and the Hamming space and further utilizes the nature of multi-index search to jointly reduce the quantization loss. (D-)CBQ possesses several attractive properties, including the extensibility for generating long hash codes in the product space and the scalability with linear training time. Extensive experiments on two popular large-scale tasks, including the Euclidean and semantic nearest neighbor search, demonstrate that the proposed (D-)CBQ enjoys efficient computation, informative binary quantization, and strong table complementarity, which together help significantly outperform the state of the arts, with up to 57.76% performance gains relatively.

摘要

构建多个哈希表是一种非常成功的用于海量数据索引的技术，它可以同时保证搜索的准确性和效率。然而，现有的大多多表索引解决方案由于缺乏信息丰富的哈希码和强大的表互补性，在很大程度上存在表冗余问题。为了解决这个问题，我们提出了一种互补二进制量化（CBQ）方法，用于以集中方式联合学习多个表和相应的信息丰富的哈希函数。基于CBQ，我们进一步设计了一种分布式学习算法（D-CBQ），以加速在大规模分布式数据集上的训练。所提出的（D-）CBQ利用基于原型的不完全二进制编码的能力，使原始空间和汉明空间中的数据分布良好对齐，并进一步利用多索引搜索的特性来联合减少量化损失。（D-）CBQ具有几个吸引人的特性，包括在乘积空间中生成长哈希码的可扩展性和线性训练时间的可扩展性。在包括欧几里得和语义最近邻搜索在内的两个流行的大规模任务上进行的大量实验表明，所提出的（D-）CBQ具有高效的计算、信息丰富的二进制量化和强大的表互补性，这些共同有助于显著超越现有技术水平，相对性能提升高达57.76%。

相似文献

Distributed Complementary Binary Quantization for Joint Hash Table Learning.用于联合哈希表学习的分布式互补二进制量化

IEEE Trans Neural Netw Learn Syst. 2020 Dec;31(12):5312-5323. doi: 10.1109/TNNLS.2020.2965992. Epub 2020 Nov 30.

Distributed Adaptive Binary Quantization for Fast Nearest Neighbor Search.分布式自适应二进制量化用于快速最近邻搜索。

IEEE Trans Image Process. 2017 Nov;26(11):5324-5336. doi: 10.1109/TIP.2017.2729896. Epub 2017 Jul 24.

Query-Adaptive Reciprocal Hash Tables for Nearest Neighbor Search.查询自适应互哈希表用于最近邻搜索。

IEEE Trans Image Process. 2016 Feb;25(2):907-19. doi: 10.1109/TIP.2015.2505180. Epub 2015 Dec 3.

Query-Adaptive Hash Code Ranking for Large-Scale Multi-View Visual Search.查询自适应哈希码排序在大规模多视图视觉搜索中的应用。

IEEE Trans Image Process. 2016 Oct;25(10):4514-24. doi: 10.1109/TIP.2016.2593344. Epub 2016 Jul 19.

Structure Sensitive Hashing With Adaptive Product Quantization.结构敏感哈希与自适应乘积量化。

IEEE Trans Cybern. 2016 Oct;46(10):2252-2264. doi: 10.1109/TCYB.2015.2474742. Epub 2015 Oct 1.

Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval.多模态判别式二值嵌入的大规模跨模态检索。

IEEE Trans Image Process. 2016 Oct;25(10):4540-54. doi: 10.1109/TIP.2016.2592800. Epub 2016 Jul 18.

Minimizing Reconstruction Bias Hashing via Joint Projection Learning and Quantization.通过联合投影学习和量化最小化重建偏差哈希

IEEE Trans Image Process. 2018 Mar 21. doi: 10.1109/TIP.2018.2818008.

Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索

IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.

Fast Exact Search in Hamming Space With Multi-Index Hashing.基于多索引哈希的 Hamming 空间快速精确搜索。

IEEE Trans Pattern Anal Mach Intell. 2014 Jun;36(6):1107-19. doi: 10.1109/TPAMI.2013.231.

Unsupervised Semantic-Preserving Adversarial Hashing for Image Search.用于图像搜索的无监督语义保持对抗哈希

IEEE Trans Image Process. 2019 Aug;28(8):4032-4044. doi: 10.1109/TIP.2019.2903661. Epub 2019 Mar 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于联合哈希表学习的分布式互补二进制量化

Distributed Complementary Binary Quantization for Joint Hash Table Learning.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献