• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于跨媒体检索的离散语义对齐哈希

Discrete Semantic Alignment Hashing for Cross-Media Retrieval.

作者信息

Yao Tao, Kong Xiangwei, Fu Haiyan, Tian Qi

出版信息

IEEE Trans Cybern. 2020 Dec;50(12):4896-4907. doi: 10.1109/TCYB.2019.2912644. Epub 2020 Dec 3.

DOI:10.1109/TCYB.2019.2912644
PMID:31107671
Abstract

Cross-media hashing, which maps data from different modalities to a low-dimensional sharing Hamming space, has attracted considerable attention due to the rapid increase of multimodal data, for example, images and texts. Recent cross-media hashing works mainly aim at learning compact hash codes to preserve the class label-based or feature-based similarities among samples. However, these methods ignore the unbalanced semantic gaps between different modalities and high-level semantic concepts, which generally results in less effective hash functions and unsatisfying retrieval performance. Specifically, the key words of texts contain semantic meanings, while the low-level features of images lack of semantic meanings. That means the semantic gap in image modality is larger than that in text modality. In this paper, we propose a simple yet effective hashing method for cross-media retrieval to address this problem, dubbed discrete semantic alignment hashing (DSAH). First, DSAH formulates to exploit collaborative filtering to mine the relations between class labels and hash codes, which can reduce memory consumption and computational cost compared to pairwise similarity. Then, the attribute of image modality is employed to align the semantic information with text modality. Finally, to further improve the quality of hash codes, we propose a discrete optimization algorithm to learn discrete hash codes directly, and each bit has a closed-form solution. Extensive experiments on multiple public databases show that our model can seamlessly incorporate attributes and achieve promising performance.

摘要

跨媒体哈希将来自不同模态的数据映射到一个低维共享汉明空间,由于多模态数据(如图像和文本)的快速增长,其已引起了广泛关注。最近的跨媒体哈希工作主要旨在学习紧凑的哈希码,以保留样本之间基于类标签或基于特征的相似性。然而,这些方法忽略了不同模态和高级语义概念之间不平衡的语义差距,这通常会导致哈希函数效率较低且检索性能不尽人意。具体而言,文本的关键词包含语义含义,而图像的低级特征缺乏语义含义。这意味着图像模态中的语义差距大于文本模态中的语义差距。在本文中,我们提出了一种简单而有效的跨媒体检索哈希方法来解决这个问题,称为离散语义对齐哈希(DSAH)。首先,DSAH制定利用协同过滤来挖掘类标签和哈希码之间的关系,与成对相似性相比,这可以减少内存消耗和计算成本。然后,利用图像模态的属性将语义信息与文本模态对齐。最后,为了进一步提高哈希码的质量,我们提出了一种离散优化算法来直接学习离散哈希码,并且每个位都有一个闭式解。在多个公共数据库上进行的大量实验表明,我们的模型可以无缝整合属性并取得有前景的性能。

相似文献

1
Discrete Semantic Alignment Hashing for Cross-Media Retrieval.用于跨媒体检索的离散语义对齐哈希
IEEE Trans Cybern. 2020 Dec;50(12):4896-4907. doi: 10.1109/TCYB.2019.2912644. Epub 2020 Dec 3.
2
Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索
IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.
3
Fast discrete cross-modal hashing with semantic consistency.快速离散跨模态哈希与语义一致性。
Neural Netw. 2020 May;125:142-152. doi: 10.1016/j.neunet.2020.01.035. Epub 2020 Feb 11.
4
Exploring Auxiliary Context: Discrete Semantic Transfer Hashing for Scalable Image Retrieval.探索辅助上下文:用于可扩展图像检索的离散语义转移哈希
IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5264-5276. doi: 10.1109/TNNLS.2018.2797248. Epub 2018 Feb 14.
5
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals.用于可扩展图像-文本和视频-文本检索的深度语义多模态哈希网络
IEEE Trans Neural Netw Learn Syst. 2023 Apr;34(4):1838-1851. doi: 10.1109/TNNLS.2020.2997020. Epub 2023 Apr 4.
6
Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval.学习判别式二进制代码进行大规模跨模态检索。
IEEE Trans Image Process. 2017 May;26(5):2494-2507. doi: 10.1109/TIP.2017.2676345. Epub 2017 Mar 1.
7
Similarity-Preserving Linkage Hashing for Online Image Retrieval.用于在线图像检索的相似性保持链接哈希
IEEE Trans Image Process. 2020 Mar 24. doi: 10.1109/TIP.2020.2981879.
8
Label Consistent Matrix Factorization Hashing for Large-Scale Cross-Modal Similarity Search.用于大规模跨模态相似性搜索的标签一致矩阵分解哈希算法
IEEE Trans Pattern Anal Mach Intell. 2019 Oct;41(10):2466-2479. doi: 10.1109/TPAMI.2018.2861000. Epub 2018 Jul 30.
9
Semantic Neighbor Graph Hashing for Multimodal Retrieval.基于语义邻居图的哈希的多模态检索。
IEEE Trans Image Process. 2018 Mar;27(3):1405-1417. doi: 10.1109/TIP.2017.2776745. Epub 2017 Nov 22.
10
Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval.多模态判别式二值嵌入的大规模跨模态检索。
IEEE Trans Image Process. 2016 Oct;25(10):4540-54. doi: 10.1109/TIP.2016.2592800. Epub 2016 Jul 18.