• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于RGB-D动作识别的判别关系表示学习

Discriminative Relational Representation Learning for RGB-D Action Recognition.

出版信息

IEEE Trans Image Process. 2016 Jun;25(6):2856-2865. doi: 10.1109/TIP.2016.2556940. Epub 2016 Apr 20.

DOI:10.1109/TIP.2016.2556940
PMID:28113902
Abstract

This paper addresses the problem of recognizing human actions from RGB-D videos. A discriminative relational feature learning method is proposed for fusing heterogeneous RGB and depth modalities, and classifying the actions in RGB-D sequences. Our method factorizes the feature matrix of each modality, and enforces the same semantics for them in order to learn shared features from multimodal data. This allows us to capture the complex correlations between the two modalities. To improve the discriminative power of the relational features, we introduce a hinge loss to measure the classification accuracy when the features are employed for classification. This essentially performs supervised factorization, and learns discriminative features that are optimized for classification. We formulate the recognition task within a maximum margin framework, and solve the formulation using a coordinate descent algorithm. The proposed method is extensively evaluated on two public RGB-D action data sets. We demonstrate that the proposed method can learn extremely low-dimensional features with superior discriminative power, and outperforms the state-of-the-art methods. It also achieves high performance when one modality is missing in testing or training.

摘要

本文探讨了从RGB-D视频中识别人类动作的问题。提出了一种判别性关系特征学习方法,用于融合异构的RGB和深度模态,并对RGB-D序列中的动作进行分类。我们的方法对每个模态的特征矩阵进行分解,并强制它们具有相同的语义,以便从多模态数据中学习共享特征。这使我们能够捕捉两种模态之间的复杂相关性。为了提高关系特征的判别能力,我们引入了一个铰链损失来衡量当特征用于分类时的分类准确率。这本质上执行了监督分解,并学习了针对分类进行优化的判别特征。我们在最大间隔框架内制定识别任务,并使用坐标下降算法求解该公式。所提出的方法在两个公共RGB-D动作数据集上进行了广泛评估。我们证明,所提出的方法可以学习具有卓越判别能力的极低维特征,并且优于现有方法。当测试或训练中缺少一种模态时,它也能实现高性能。

相似文献

1
Discriminative Relational Representation Learning for RGB-D Action Recognition.用于RGB-D动作识别的判别关系表示学习
IEEE Trans Image Process. 2016 Jun;25(6):2856-2865. doi: 10.1109/TIP.2016.2556940. Epub 2016 Apr 20.
2
Deep Multimodal Feature Analysis for Action Recognition in RGB+D Videos.基于 RGB+D 视频的深度多模态特征分析用于动作识别
IEEE Trans Pattern Anal Mach Intell. 2018 May;40(5):1045-1058. doi: 10.1109/TPAMI.2017.2691321. Epub 2017 Apr 5.
3
Visual Recognition in RGB Images and Videos by Learning from RGB-D Data.通过从RGB-D数据中学习实现RGB图像和视频中的视觉识别
IEEE Trans Pattern Anal Mach Intell. 2018 Aug;40(8):2030-2036. doi: 10.1109/TPAMI.2017.2734890. Epub 2017 Aug 2.
4
Learning Discriminative Cross-Modality Features for RGB-D Saliency Detection.学习用于RGB-D显著性检测的判别性跨模态特征。
IEEE Trans Image Process. 2022;31:1285-1297. doi: 10.1109/TIP.2022.3140606. Epub 2022 Jan 25.
5
Homogeneous-to-Heterogeneous: Unsupervised Learning for RGB-Infrared Person Re-Identification.从同质地到异质地:RGB-红外人像再识别的无监督学习。
IEEE Trans Image Process. 2021;30:6392-6407. doi: 10.1109/TIP.2021.3092578. Epub 2021 Jul 14.
6
MMNet: A Model-Based Multimodal Network for Human Action Recognition in RGB-D Videos.MMNet:一种基于模型的 RGB-D 视频人体动作识别多模态网络。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3522-3538. doi: 10.1109/TPAMI.2022.3177813. Epub 2023 Feb 3.
7
Learning with Privileged Information via Adversarial Discriminative Modality Distillation.通过对抗性判别模态蒸馏进行带特权信息的学习。
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2581-2593. doi: 10.1109/TPAMI.2019.2929038. Epub 2019 Jul 16.
8
ASK: Adaptively Selecting Key Local Features for RGB-D Scene Recognition.问:为RGB-D场景识别自适应选择关键局部特征。
IEEE Trans Image Process. 2021;30:2722-2733. doi: 10.1109/TIP.2021.3053459. Epub 2021 Feb 10.
9
Robust action recognition via borrowing information across video modalities.通过跨视频模态信息借用实现鲁棒的动作识别。
IEEE Trans Image Process. 2015 Feb;24(2):709-23. doi: 10.1109/TIP.2014.2385591. Epub 2014 Dec 23.
10
Semisupervised feature selection via spline regression for video semantic recognition.基于样条回归的半监督特征选择在视频语义识别中的应用。
IEEE Trans Neural Netw Learn Syst. 2015 Feb;26(2):252-64. doi: 10.1109/TNNLS.2014.2314123.

引用本文的文献

1
RGB-D Data-Based Action Recognition: A Review.基于 RGB-D 数据的动作识别:综述。
Sensors (Basel). 2021 Jun 21;21(12):4246. doi: 10.3390/s21124246.