• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用深度图的合成多视角数据进行动作识别的特异性和潜在关联学习。

Specificity and Latent Correlation Learning for Action Recognition Using Synthetic Multi-View Data From Depth Maps.

出版信息

IEEE Trans Image Process. 2017 Dec;26(12):5560-5574. doi: 10.1109/TIP.2017.2740122. Epub 2017 Aug 14.

DOI:10.1109/TIP.2017.2740122
PMID:28816663
Abstract

This paper presents a novel approach to action recognition using synthetic multi-view data from depth maps. Specifically, multiple views are first generated by rotating 3D point clouds from depth maps. A pyramid multi-view depth motion template is then adopted for multi-view action representation, characterizing the multi-scale motion and shape patterns in 3D. Empirically, despite the view-specific information, the latent information between multiple views often provides important cues for action recognition. Concentrating on this observation and motivated by the success of the dictionary learning framework, this paper proposes to explicitly learn a view-specific dictionary (called specificity) for each view, and simultaneously learn a latent dictionary (called latent correlation) across multiple views. Thus, a novel method, specificity and latent correlation learning, is put forward to learn the specificity that captures the most discriminative features of each view, and learn the latent correlation that contributes the inherent 3D information to multiple views. In this way, a compact and discriminative dictionary is constructed by specificity and latent correlation for feature representation of actions. The proposed method is evaluated on the MSR Action3D, the MSR Gesture3D, the MSR Action Pairs, and the ChaLearn multi-modal data sets, consistently achieving promising results compared with the state-of-the-art methods based on depth data.

摘要

本文提出了一种基于深度图合成多视角数据的动作识别新方法。具体来说,首先通过旋转 3D 点云生成多个视角。然后采用金字塔多视角深度运动模板进行多视角动作表示,以刻画 3D 中的多尺度运动和形状模式。从经验上看,尽管视角特定信息,但多个视角之间的潜在信息通常为动作识别提供重要线索。受字典学习框架成功的启发,本文专注于这一观察结果,提出为每个视角显式学习一个特定视角字典(称为特异性),并同时学习多个视角之间的潜在字典(称为潜在相关性)。因此,提出了一种新的特异性和潜在相关性学习方法,用于学习特异性,以捕获每个视角最具判别力的特征,以及学习潜在相关性,以将固有 3D 信息贡献给多个视角。通过这种方式,通过特异性和潜在相关性构建了一个紧凑且具有判别力的字典,用于动作特征表示。在 MSR Action3D、MSR Gesture3D、MSR Action Pairs 和 ChaLearn 多模态数据集上进行了评估,与基于深度数据的最新方法相比,该方法始终取得了有希望的结果。

相似文献

1
Specificity and Latent Correlation Learning for Action Recognition Using Synthetic Multi-View Data From Depth Maps.使用深度图的合成多视角数据进行动作识别的特异性和潜在关联学习。
IEEE Trans Image Process. 2017 Dec;26(12):5560-5574. doi: 10.1109/TIP.2017.2740122. Epub 2017 Aug 14.
2
Cross-View Action Recognition via Transferable Dictionary Learning.跨视图动作识别的可迁移字典学习
IEEE Trans Image Process. 2016 May;25(6):2542-56. doi: 10.1109/TIP.2016.2548242.
3
Multi-Domain & Multi-Task Learning for Human Action Recognition.用于人类动作识别的多领域与多任务学习
IEEE Trans Image Process. 2018 Sep 28. doi: 10.1109/TIP.2018.2872879.
4
Exploring 3D Human Action Recognition Using STACOG on Multi-View Depth Motion Maps Sequences.基于多视角深度运动图序列的 STACOG 探索三维人体动作识别。
Sensors (Basel). 2021 May 24;21(11):3642. doi: 10.3390/s21113642.
5
Multiview Latent Space Learning With Feature Redundancy Minimization.多视图潜在空间学习与特征冗余最小化。
IEEE Trans Cybern. 2020 Apr;50(4):1655-1668. doi: 10.1109/TCYB.2018.2883673. Epub 2018 Dec 14.
6
Person Re-Identification by Cross-View Multi-Level Dictionary Learning.基于跨视角多级字典学习的行人重识别
IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):2963-2977. doi: 10.1109/TPAMI.2017.2764893. Epub 2017 Oct 26.
7
Robust 3D Hand Pose Estimation From Single Depth Images Using Multi-View CNNs.基于多视角卷积神经网络的单目深度图像稳健三维手姿估计
IEEE Trans Image Process. 2018 Sep;27(9):4422-4436. doi: 10.1109/TIP.2018.2834824.
8
Discriminative margin-sensitive autoencoder for collective multi-view disease analysis.用于集体多视图疾病分析的判别式敏感自编码器。
Neural Netw. 2020 Mar;123:94-107. doi: 10.1016/j.neunet.2019.11.013. Epub 2019 Dec 2.
9
Multi-channel EEG-based sleep stage classification with joint collaborative representation and multiple kernel learning.基于多通道脑电图的睡眠阶段分类:联合协同表示与多核学习
J Neurosci Methods. 2015 Oct 30;254:94-101. doi: 10.1016/j.jneumeth.2015.07.006. Epub 2015 Jul 17.
10
Multi-View Multi-Instance Learning Based on Joint Sparse Representation and Multi-View Dictionary Learning.基于联合稀疏表示和多视图字典学习的多视图多实例学习。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2554-2560. doi: 10.1109/TPAMI.2017.2669303. Epub 2017 Feb 14.

引用本文的文献

1
Computer-Aided Multi-Target Management of Emergent Alzheimer's Disease.计算机辅助的阿尔茨海默病急症多靶点管理
Bioinformation. 2018 May 5;14(4):167-180. doi: 10.6026/97320630014167. eCollection 2018.