用于跨模态相似性搜索的深度语义保持序数哈希

Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search.

作者信息

Jin Lu, Li Kai, Li Zechao, Xiao Fu, Qi Guo-Jun, Tang Jinhui

出版信息

IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1429-1440. doi: 10.1109/TNNLS.2018.2869601. Epub 2018 Oct 1.

DOI:10.1109/TNNLS.2018.2869601

Abstract

Cross-modal hashing has attracted increasing research attention due to its efficiency for large-scale multimedia retrieval. With simultaneous feature representation and hash function learning, deep cross-modal hashing (DCMH) methods have shown superior performance. However, most existing methods on DCMH adopt binary quantization functions (e.g., [Formula: see text]) to generate hash codes, which limit the retrieval performance since binary quantization functions are sensitive to the variations of numeric values. Toward this end, we propose a novel end-to-end ranking-based hashing framework, in this paper, termed as deep semantic-preserving ordinal hashing (DSPOH), to learn hash functions with deep neural networks by exploring the ranking structure of feature dimensions. In DSPOH, the ordinal representation, which encodes the relative rank ordering of feature dimensions, is explored to generate hash codes. Such ordinal embedding benefits from the numeric stability of rank correlation measures. To make the hash codes discriminative, the ordinal representation is expected to well predict the class labels so that the ranking-based hash function learning is optimally compatible with the label predicting. Meanwhile, the intermodality similarity is preserved to guarantee that the hash codes of different modalities are consistent. Importantly, DSPOH can be effectively integrated with different types of network architectures, which demonstrates the flexibility and scalability of our proposed hashing framework. Extensive experiments on three widely used multimodal data sets show that DSPOH outperforms state of the art for cross-modal retrieval tasks.

摘要

跨模态哈希因其在大规模多媒体检索中的高效性而受到越来越多的研究关注。通过同时进行特征表示和哈希函数学习，深度跨模态哈希（DCMH）方法已展现出卓越的性能。然而，大多数现有的DCMH方法采用二进制量化函数（例如，[公式：见正文]）来生成哈希码，这限制了检索性能，因为二进制量化函数对数值变化敏感。为此，我们在本文中提出了一种新颖的基于排序的端到端哈希框架，称为深度语义保留序数哈希（DSPOH），通过探索特征维度的排序结构，利用深度神经网络学习哈希函数。在DSPOH中，探索了对特征维度的相对排序进行编码的序数表示来生成哈希码。这种序数嵌入受益于秩相关度量的数值稳定性。为使哈希码具有判别力，期望序数表示能很好地预测类别标签，从而使基于排序的哈希函数学习与标签预测最优地兼容。同时，保留跨模态相似性以确保不同模态的哈希码一致。重要的是，DSPOH可以有效地与不同类型的网络架构集成，这证明了我们提出的哈希框架的灵活性和可扩展性。在三个广泛使用的多模态数据集上进行的大量实验表明，DSPOH在跨模态检索任务中优于现有技术。

相似文献

Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search.

IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1429-1440. doi: 10.1109/TNNLS.2018.2869601. Epub 2018 Oct 1.

Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.

IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.

Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals.

IEEE Trans Neural Netw Learn Syst. 2023 Apr;34(4):1838-1851. doi: 10.1109/TNNLS.2020.2997020. Epub 2023 Apr 4.

Deep Ordinal Hashing With Spatial Attention.

IEEE Trans Image Process. 2019 May;28(5):2173-2186. doi: 10.1109/TIP.2018.2883522. Epub 2018 Nov 28.

Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval.

IEEE Trans Image Process. 2016 Oct;25(10):4540-54. doi: 10.1109/TIP.2016.2592800. Epub 2016 Jul 18.

Triplet-Based Deep Hashing Network for Cross-Modal Retrieval.

IEEE Trans Image Process. 2018 Aug;27(8):3893-3903. doi: 10.1109/TIP.2018.2821921. Epub 2018 Apr 4.

Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval.

IEEE Trans Image Process. 2017 May;26(5):2494-2507. doi: 10.1109/TIP.2017.2676345. Epub 2017 Mar 1.

Unsupervised Semantic-Preserving Adversarial Hashing for Image Search.

IEEE Trans Image Process. 2019 Aug;28(8):4032-4044. doi: 10.1109/TIP.2019.2903661. Epub 2019 Mar 13.

Label Consistent Matrix Factorization Hashing for Large-Scale Cross-Modal Similarity Search.

IEEE Trans Pattern Anal Mach Intell. 2019 Oct;41(10):2466-2479. doi: 10.1109/TPAMI.2018.2861000. Epub 2018 Jul 30.

Linear Subspace Ranking Hashing for Cross-Modal Retrieval.

IEEE Trans Pattern Anal Mach Intell. 2017 Sep;39(9):1825-1838. doi: 10.1109/TPAMI.2016.2610969. Epub 2016 Sep 19.

引用本文的文献

A Framework for Enabling Unpaired Multi-Modal Learning for Deep Cross-Modal Hashing Retrieval.

J Imaging. 2022 Dec 15;8(12):328. doi: 10.3390/jimaging8120328.

A new design of multimedia big data retrieval enabled by deep feature learning and Adaptive Semantic Similarity Function.

Multimed Syst. 2022;28(3):1039-1058. doi: 10.1007/s00530-022-00897-8. Epub 2022 Feb 5.

Deep Semantic-Preserving Reconstruction Hashing for Unsupervised Cross-Modal Retrieval.

Entropy (Basel). 2020 Nov 7;22(11):1266. doi: 10.3390/e22111266.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于跨模态相似性搜索的深度语义保持序数哈希

Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search.

作者信息

Jin Lu, Li Kai, Li Zechao, Xiao Fu, Qi Guo-Jun, Tang Jinhui

出版信息

IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1429-1440. doi: 10.1109/TNNLS.2018.2869601. Epub 2018 Oct 1.

DOI:10.1109/TNNLS.2018.2869601

PMID:30281496

Abstract

摘要

用于跨模态相似性搜索的深度语义保持序数哈希

Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于跨模态相似性搜索的深度语义保持序数哈希

Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search.

作者信息

出版信息

相似文献

引用本文的文献