• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

同时半耦合字典学习在典范空间中的匹配。

Simultaneous Semi-Coupled Dictionary Learning for Matching in Canonical Space.

出版信息

IEEE Trans Image Process. 2017 Aug;26(8):3995-4004. doi: 10.1109/TIP.2017.2707858. Epub 2017 May 24.

DOI:10.1109/TIP.2017.2707858
PMID:28541900
Abstract

Cross-modal recognition and matching with privileged information are important challenging problems in the field of computer vision. The cross-modal scenario deals with matching across different modalities and needs to take care of the large variations present across and within each modality. The privileged information scenario deals with the situation that all the information available during training may not be available during the testing stage, and hence, algorithms need to leverage the extra information from the training stage itself. We show that for multi-modal data, either one of the above situations may arise if one modality is absent during testing. Here, we propose a novel framework, which can handle both these scenarios seamlessly with applications to matching multi-modal data. The proposed approach jointly uses data from the two modalities to build a canonical representation, which encompasses information from both the modalities. We explore four different types of canonical representations for different types of data. The algorithm computes dictionaries and canonical representation for data from both the modalities, such that the transformed sparse coefficients of both the modalities are equal to that of the canonical representation. The sparse coefficients are finally matched using Mahalanobis metric. Extensive experiments on different data sets, involving RGBD, text-image, and audio-image data, show the effectiveness of the proposed framework.

摘要

跨模态识别和匹配与特权信息是计算机视觉领域的重要挑战性问题。跨模态场景涉及跨不同模式的匹配,需要注意不同模式之间以及每个模式内部存在的巨大差异。特权信息场景涉及的情况是,在训练阶段可用的所有信息在测试阶段可能不可用,因此,算法需要利用训练阶段本身的额外信息。我们表明,如果在测试阶段缺少一种模式,则对于多模态数据,上述两种情况中的任何一种都可能出现。在这里,我们提出了一个新的框架,可以无缝地处理这两种情况,并应用于多模态数据的匹配。所提出的方法联合使用两个模态的数据来构建一个规范表示,该表示包含来自两个模态的信息。我们针对不同类型的数据探索了四种不同类型的规范表示。该算法为两个模态的数据计算字典和规范表示,使得两个模态的变换稀疏系数等于规范表示的稀疏系数。最后,使用马氏距离匹配稀疏系数。在涉及 RGBD、文本-图像和音频-图像数据的不同数据集上进行的广泛实验表明了所提出框架的有效性。

相似文献

1
Simultaneous Semi-Coupled Dictionary Learning for Matching in Canonical Space.同时半耦合字典学习在典范空间中的匹配。
IEEE Trans Image Process. 2017 Aug;26(8):3995-4004. doi: 10.1109/TIP.2017.2707858. Epub 2017 May 24.
2
Generalized Coupled Dictionary Learning Approach With Applications to Cross-Modal Matching.广义耦合字典学习方法及其在跨模态匹配中的应用。
IEEE Trans Image Process. 2016 Aug;25(8):3826-37. doi: 10.1109/TIP.2016.2577885. Epub 2016 Jun 7.
3
Multi-Modal Convolutional Dictionary Learning.多模态卷积字典学习
IEEE Trans Image Process. 2022;31:1325-1339. doi: 10.1109/TIP.2022.3141251. Epub 2022 Jan 25.
4
Image transformation based on learning dictionaries across image spaces.基于跨图像空间学习字典的图像变换。
IEEE Trans Pattern Anal Mach Intell. 2013 Feb;35(2):367-80. doi: 10.1109/TPAMI.2012.95.
5
Online multi-modal robust non-negative dictionary learning for visual tracking.用于视觉跟踪的在线多模态鲁棒非负字典学习
PLoS One. 2015 May 11;10(5):e0124685. doi: 10.1371/journal.pone.0124685. eCollection 2015.
6
Cross-Modality Image Synthesis via Weakly Coupled and Geometry Co-Regularized Joint Dictionary Learning.基于弱耦合和几何正则化联合字典学习的跨模态图像合成。
IEEE Trans Med Imaging. 2018 Mar;37(3):815-827. doi: 10.1109/TMI.2017.2781192.
7
Learning Stable Multilevel Dictionaries for Sparse Representations.学习用于稀疏表示的稳定多级字典。
IEEE Trans Neural Netw Learn Syst. 2015 Sep;26(9):1913-26. doi: 10.1109/TNNLS.2014.2361052. Epub 2014 Oct 16.
8
Dictionary learning for stereo image representation.立体图像表示的字典学习。
IEEE Trans Image Process. 2011 Apr;20(4):921-34. doi: 10.1109/TIP.2010.2081679. Epub 2010 Sep 30.
9
Alternatively Constrained Dictionary Learning For Image Superresolution.替代约束字典学习的图像超分辨率方法。
IEEE Trans Cybern. 2014 Mar;44(3):366-77. doi: 10.1109/TCYB.2013.2256347. Epub 2013 May 2.
10
Learning Low-Rank Class-Specific Dictionary and Sparse Intra-Class Variant Dictionary for Face Recognition.学习用于人脸识别的低秩特定类别字典和稀疏类内变体字典。
PLoS One. 2015 Nov 16;10(11):e0142403. doi: 10.1371/journal.pone.0142403. eCollection 2015.