• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多模态判别式二值嵌入的大规模跨模态检索。

Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval.

出版信息

IEEE Trans Image Process. 2016 Oct;25(10):4540-54. doi: 10.1109/TIP.2016.2592800. Epub 2016 Jul 18.

DOI:10.1109/TIP.2016.2592800
PMID:27448355
Abstract

Multimodal hashing, which conducts effective and efficient nearest neighbor search across heterogeneous data on large-scale multimedia databases, has been attracting increasing interest, given the explosive growth of multimedia content on the Internet. Recent multimodal hashing research mainly aims at learning the compact binary codes to preserve semantic information given by labels. The overwhelming majority of these methods are similarity preserving approaches which approximate pairwise similarity matrix with Hamming distances between the to-be-learnt binary hash codes. However, these methods ignore the discriminative property in hash learning process, which results in hash codes from different classes undistinguished, and therefore reduces the accuracy and robustness for the nearest neighbor search. To this end, we present a novel multimodal hashing method, named multimodal discriminative binary embedding (MDBE), which focuses on learning discriminative hash codes. First, the proposed method formulates the hash function learning in terms of classification, where the binary codes generated by the learned hash functions are expected to be discriminative. And then, it exploits the label information to discover the shared structures inside heterogeneous data. Finally, the learned structures are preserved for hash codes to produce similar binary codes in the same class. Hence, the proposed MDBE can preserve both discriminability and similarity for hash codes, and will enhance retrieval accuracy. Thorough experiments on benchmark data sets demonstrate that the proposed method achieves excellent accuracy and competitive computational efficiency compared with the state-of-the-art methods for large-scale cross-modal retrieval task.

摘要

多模态散列技术在互联网上多媒体内容的爆炸式增长的背景下,因其可以在大规模多媒体数据库中对异构数据进行有效和高效的最近邻搜索而受到越来越多的关注。最近的多模态散列研究主要旨在学习紧凑的二进制代码以保留标签提供的语义信息。这些方法中的绝大多数都是相似性保留方法,即用待学习的二进制散列码之间的汉明距离来近似成对相似矩阵。然而,这些方法忽略了散列学习过程中的判别特性,导致不同类别的散列码无法区分,从而降低了最近邻搜索的准确性和鲁棒性。为此,我们提出了一种新的多模态散列方法,称为多模态判别二进制嵌入(MDBE),它专注于学习判别散列码。首先,该方法从分类的角度来表述散列函数的学习,即期望通过学习的散列函数生成的二进制代码具有判别性。然后,它利用标签信息来发现异构数据中的共享结构。最后,将学习到的结构保留在散列码中,以生成同一类中相似的二进制码。因此,所提出的 MDBE 可以同时保留散列码的判别能力和相似性,从而提高检索的准确性。在基准数据集上的大量实验表明,与大规模跨模态检索任务的最新方法相比,该方法具有出色的准确性和有竞争力的计算效率。

相似文献

1
Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval.多模态判别式二值嵌入的大规模跨模态检索。
IEEE Trans Image Process. 2016 Oct;25(10):4540-54. doi: 10.1109/TIP.2016.2592800. Epub 2016 Jul 18.
2
Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval.学习判别式二进制代码进行大规模跨模态检索。
IEEE Trans Image Process. 2017 May;26(5):2494-2507. doi: 10.1109/TIP.2017.2676345. Epub 2017 Mar 1.
3
Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search.用于跨模态相似性搜索的深度语义保持序数哈希
IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1429-1440. doi: 10.1109/TNNLS.2018.2869601. Epub 2018 Oct 1.
4
Label Consistent Matrix Factorization Hashing for Large-Scale Cross-Modal Similarity Search.用于大规模跨模态相似性搜索的标签一致矩阵分解哈希算法
IEEE Trans Pattern Anal Mach Intell. 2019 Oct;41(10):2466-2479. doi: 10.1109/TPAMI.2018.2861000. Epub 2018 Jul 30.
5
Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索
IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.
6
Multi-Manifold Deep Discriminative Cross-Modal Hashing for Medical Image Retrieval.多流形深度判别跨模态哈希医学图像检索。
IEEE Trans Image Process. 2022;31:3371-3385. doi: 10.1109/TIP.2022.3171081. Epub 2022 May 9.
7
Unsupervised Semantic-Preserving Adversarial Hashing for Image Search.用于图像搜索的无监督语义保持对抗哈希
IEEE Trans Image Process. 2019 Aug;28(8):4032-4044. doi: 10.1109/TIP.2019.2903661. Epub 2019 Mar 13.
8
Structure Sensitive Hashing With Adaptive Product Quantization.结构敏感哈希与自适应乘积量化。
IEEE Trans Cybern. 2016 Oct;46(10):2252-2264. doi: 10.1109/TCYB.2015.2474742. Epub 2015 Oct 1.
9
Joint Specifics and Consistency Hash Learning for Large-Scale Cross-Modal Retrieval.用于大规模跨模态检索的关节细节与一致性哈希学习
IEEE Trans Image Process. 2022;31:5343-5358. doi: 10.1109/TIP.2022.3195059. Epub 2022 Aug 16.
10
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals.用于可扩展图像-文本和视频-文本检索的深度语义多模态哈希网络
IEEE Trans Neural Netw Learn Syst. 2023 Apr;34(4):1838-1851. doi: 10.1109/TNNLS.2020.2997020. Epub 2023 Apr 4.

引用本文的文献

1
Discrete Two-Step Cross-Modal Hashing through the Exploitation of Pairwise Relations.通过利用成对关系实现离散两步跨模态哈希。
Comput Intell Neurosci. 2021 Sep 27;2021:4846043. doi: 10.1155/2021/4846043. eCollection 2021.