• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于基于骨架的高性能人体动作识别的视图自适应神经网络。

View Adaptive Neural Networks for High Performance Skeleton-Based Human Action Recognition.

作者信息

Zhang Pengfei, Lan Cuiling, Xing Junliang, Zeng Wenjun, Xue Jianru, Zheng Nanning

出版信息

IEEE Trans Pattern Anal Mach Intell. 2019 Aug;41(8):1963-1978. doi: 10.1109/TPAMI.2019.2896631. Epub 2019 Jan 31.

DOI:10.1109/TPAMI.2019.2896631
PMID:30714909
Abstract

Skeleton-based human action recognition has recently attracted increasing attention thanks to the accessibility and the popularity of 3D skeleton data. One of the key challenges in action recognition lies in the large variations of action representations when they are captured from different viewpoints. In order to alleviate the effects of view variations, this paper introduces a novel view adaptation scheme, which automatically determines the virtual observation viewpoints over the course of an action in a learning based data driven manner. Instead of re-positioning the skeletons using a fixed human-defined prior criterion, we design two view adaptive neural networks, i.e., VA-RNN and VA-CNN, which are respectively built based on the recurrent neural network (RNN) with the Long Short-term Memory (LSTM) and the convolutional neural network (CNN). For each network, a novel view adaptation module learns and determines the most suitable observation viewpoints, and transforms the skeletons to those viewpoints for the end-to-end recognition with a main classification network. Ablation studies find that the proposed view adaptive models are capable of transforming the skeletons of various views to much more consistent virtual viewpoints. Therefore, the models largely eliminate the influence of the viewpoints, enabling the networks to focus on the learning of action-specific features and thus resulting in superior performance. In addition, we design a two-stream scheme (referred to as VA-fusion) that fuses the scores of the two networks to provide the final prediction, obtaining enhanced performance. Moreover, random rotation of skeleton sequences is employed to improve the robustness of view adaptation models and alleviate overfitting during training. Extensive experimental evaluations on five challenging benchmarks demonstrate the effectiveness of the proposed view-adaptive networks and superior performance over state-of-the-art approaches.

摘要

基于骨骼的人体动作识别近年来因3D骨骼数据的可获取性和普及性而受到越来越多的关注。动作识别中的关键挑战之一在于从不同视角捕捉动作表示时存在的巨大变化。为了减轻视角变化的影响,本文引入了一种新颖的视角适应方案,该方案以基于学习的数据驱动方式在动作过程中自动确定虚拟观察视角。我们不是使用固定的人为定义的先验标准来重新定位骨骼,而是设计了两个视角自适应神经网络,即VA - RNN和VA - CNN,它们分别基于带有长短期记忆(LSTM)的递归神经网络(RNN)和卷积神经网络(CNN)构建。对于每个网络,一个新颖的视角适应模块学习并确定最合适的观察视角,并将骨骼转换到这些视角,以便与主分类网络进行端到端识别。消融研究发现,所提出的视角自适应模型能够将各种视角的骨骼转换为更加一致的虚拟视角。因此,这些模型在很大程度上消除了视角的影响,使网络能够专注于动作特定特征的学习,从而获得卓越的性能。此外,我们设计了一种双流方案(称为VA融合),融合两个网络的得分以提供最终预测,从而获得增强的性能。此外,采用骨骼序列的随机旋转来提高视角适应模型的鲁棒性并减轻训练期间的过拟合。在五个具有挑战性的基准上进行的广泛实验评估证明了所提出的视角自适应网络的有效性以及相对于现有最先进方法的卓越性能。

相似文献

1
View Adaptive Neural Networks for High Performance Skeleton-Based Human Action Recognition.用于基于骨架的高性能人体动作识别的视图自适应神经网络。
IEEE Trans Pattern Anal Mach Intell. 2019 Aug;41(8):1963-1978. doi: 10.1109/TPAMI.2019.2896631. Epub 2019 Jan 31.
2
Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks.基于距离向量和多高视自适应网络的骨架动作识别。
Comput Intell Neurosci. 2021 Aug 18;2021:1507770. doi: 10.1155/2021/1507770. eCollection 2021.
3
Skeleton-Based Human Action Recognition With Global Context-Aware Attention LSTM Networks.基于骨架的全局上下文感知注意力 LSTM 网络的人体动作识别。
IEEE Trans Image Process. 2018 Apr;27(4):1586-1599. doi: 10.1109/TIP.2017.2785279.
4
Spatio⁻Temporal Image Representation of 3D Skeletal Movements for View-Invariant Action Recognition with Deep Convolutional Neural Networks.用于深度卷积神经网络的视图不变动作识别的3D骨骼运动的时空图像表示
Sensors (Basel). 2019 Apr 24;19(8):1932. doi: 10.3390/s19081932.
5
Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition.用于基于骨架的动作识别的自适应注意力记忆图卷积网络
Sensors (Basel). 2021 Oct 12;21(20):6761. doi: 10.3390/s21206761.
6
Representation Learning of Temporal Dynamics for Skeleton-Based Action Recognition.基于骨架的动作识别的时态动力学表示学习。
IEEE Trans Image Process. 2016 Jul;25(7):3010-3022. doi: 10.1109/TIP.2016.2552404. Epub 2016 Apr 8.
7
Modality Compensation Network: Cross-Modal Adaptation for Action Recognition.模态补偿网络:用于动作识别的跨模态自适应
IEEE Trans Image Process. 2020 Jan 23. doi: 10.1109/TIP.2020.2967577.
8
View-Invariant Human Action Recognition Based on a 3D Bio-Constrained Skeleton Model.基于三维生物约束骨骼模型的视图不变人体动作识别。
IEEE Trans Image Process. 2019 Aug;28(8):3959-3972. doi: 10.1109/TIP.2019.2907048. Epub 2019 Mar 22.
9
Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks.基于多流自适应图卷积网络的骨架动作识别
IEEE Trans Image Process. 2020 Oct 9;PP. doi: 10.1109/TIP.2020.3028207.
10
Spatio-Temporal Attention-Based LSTM Networks for 3D Action Recognition and Detection.基于时空注意力的 LSTM 网络用于 3D 动作识别与检测。
IEEE Trans Image Process. 2018 Jul;27(7):3459-3471. doi: 10.1109/TIP.2018.2818328.

引用本文的文献

1
A novel multi-modal rehabilitation monitoring over human motion intention recognition.一种基于人体运动意图识别的新型多模态康复监测。
Front Bioeng Biotechnol. 2025 Jul 17;13:1568690. doi: 10.3389/fbioe.2025.1568690. eCollection 2025.
2
Semantics-Assisted Training Graph Convolution Network for Skeleton-Based Action Recognition.用于基于骨架的动作识别的语义辅助训练图卷积网络
Sensors (Basel). 2025 Mar 15;25(6):1841. doi: 10.3390/s25061841.
3
Estimating a 3D Human Skeleton from a Single RGB Image by Fusing Predicted Depths from Multiple Virtual Viewpoints.
通过融合多个虚拟视角预测的深度信息从单张RGB图像估计三维人体骨骼
Sensors (Basel). 2024 Dec 15;24(24):8017. doi: 10.3390/s24248017.
4
BodyFlow: An Open-Source Library for Multimodal Human Activity Recognition.BodyFlow:用于多模态人体活动识别的开源库。
Sensors (Basel). 2024 Oct 19;24(20):6729. doi: 10.3390/s24206729.
5
A Survey on 3D Skeleton-Based Action Recognition Using Learning Method.基于学习方法的三维骨骼动作识别研究
Cyborg Bionic Syst. 2024 May 16;5:0100. doi: 10.34133/cbsystems.0100. eCollection 2024.
6
Spatial-Temporal Self-Attention Enhanced Graph Convolutional Networks for Fitness Yoga Action Recognition.基于时空自注意力增强图卷积网络的健身瑜伽动作识别。
Sensors (Basel). 2023 May 14;23(10):4741. doi: 10.3390/s23104741.
7
Geometric Deep Neural Network Using Rigid and Non-Rigid Transformations for Landmark-Based Human Behavior Analysis.基于刚性和非刚性变换的几何深度学习网络在基于关键点的人体行为分析中的应用。
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13314-13327. doi: 10.1109/TPAMI.2023.3291663. Epub 2023 Oct 3.
8
Multi-view image-based behavior classification of wet-dog shake in Kainate rat model.基于多视图图像的红藻氨酸盐大鼠模型中湿狗摇行为分类
Front Behav Neurosci. 2023 May 2;17:1148549. doi: 10.3389/fnbeh.2023.1148549. eCollection 2023.
9
Generalized Pose Decoupled Network for Unsupervised 3D Skeleton Sequence-Based Action Representation Learning.用于基于无监督3D骨架序列的动作表示学习的广义姿态解耦网络。
Cyborg Bionic Syst. 2022;2022:0002. doi: 10.34133/cbsystems.0002. Epub 2022 Dec 30.
10
Explainable fMRI-based brain decoding via spatial temporal-pyramid graph convolutional network.基于可解释功能磁共振成像的时空金字塔图卷积网络脑解码。
Hum Brain Mapp. 2023 May;44(7):2921-2935. doi: 10.1002/hbm.26255. Epub 2023 Feb 28.