基于学习方法的三维骨骼动作识别研究

A Survey on 3D Skeleton-Based Action Recognition Using Learning Method.

作者信息

Ren Bin, Liu Mengyuan, Ding Runwei, Liu Hong

机构信息

University of Pisa, Pisa, Italy.

University of Trento, Trento, Italy.

出版信息

Cyborg Bionic Syst. 2024 May 16;5:0100. doi: 10.34133/cbsystems.0100. eCollection 2024.

DOI:10.34133/cbsystems.0100

PMID:38757045

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11096730/

Abstract

Three-dimensional skeleton-based action recognition (3D SAR) has gained important attention within the computer vision community, owing to the inherent advantages offered by skeleton data. As a result, a plethora of impressive works, including those based on conventional handcrafted features and learned feature extraction methods, have been conducted over the years. However, prior surveys on action recognition have primarily focused on video or red-green-blue (RGB) data-dominated approaches, with limited coverage of reviews related to skeleton data. Furthermore, despite the extensive application of deep learning methods in this field, there has been a notable absence of research that provides an introductory or comprehensive review from the perspective of deep learning architectures. To address these limitations, this survey first underscores the importance of action recognition and emphasizes the significance of 3-dimensional (3D) skeleton data as a valuable modality. Subsequently, we provide a comprehensive introduction to mainstream action recognition techniques based on 4 fundamental deep architectures, i.e., recurrent neural networks, convolutional neural networks, graph convolutional network, and Transformers. All methods with the corresponding architectures are then presented in a data-driven manner with detailed discussion. Finally, we offer insights into the current largest 3D skeleton dataset, NTU-RGB+D, and its new edition, NTU-RGB+D 120, along with an overview of several top-performing algorithms on these datasets. To the best of our knowledge, this research represents the first comprehensive discussion of deep learning-based action recognition using 3D skeleton data.

摘要

基于三维骨骼的动作识别（3D SAR）因其骨骼数据所具有的固有优势而在计算机视觉领域受到了广泛关注。因此，多年来已经开展了大量令人印象深刻的工作，包括基于传统手工特征和学习特征提取方法的研究。然而，先前关于动作识别的综述主要集中在视频或红绿蓝（RGB）数据主导的方法上，对与骨骼数据相关的综述覆盖有限。此外，尽管深度学习方法在该领域得到了广泛应用，但从深度学习架构的角度进行入门或全面综述的研究却明显缺失。为了克服这些局限性，本综述首先强调了动作识别的重要性，并强调了三维（3D）骨骼数据作为一种有价值模态的重要性。随后，我们基于四种基本的深度架构，即循环神经网络、卷积神经网络、图卷积网络和Transformer，对主流动作识别技术进行了全面介绍。然后，所有具有相应架构的方法都以数据驱动的方式呈现，并进行了详细讨论。最后，我们深入探讨了当前最大的3D骨骼数据集NTU-RGB+D及其新版本NTU-RGB+D 120，以及这些数据集上几种表现最佳的算法概述。据我们所知，本研究首次对基于深度学习的3D骨骼数据动作识别进行了全面讨论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d25/11096730/27ad4df443df/cbsystems.0100.fig.001.jpg

相似文献

A Survey on 3D Skeleton-Based Action Recognition Using Learning Method.基于学习方法的三维骨骼动作识别研究

Cyborg Bionic Syst. 2024 May 16;5:0100. doi: 10.34133/cbsystems.0100. eCollection 2024.

Multi-scale and attention enhanced graph convolution network for skeleton-based violence action recognition.用于基于骨架的暴力行为识别的多尺度注意力增强图卷积网络。

Front Neurorobot. 2022 Dec 15;16:1091361. doi: 10.3389/fnbot.2022.1091361. eCollection 2022.

Deep Learning for Human Activity Recognition on 3D Human Skeleton: Survey and Comparative Study.基于 3D 人体骨骼的人类活动识别深度学习：综述与比较研究。

Sensors (Basel). 2023 May 27;23(11):5121. doi: 10.3390/s23115121.

Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition.用于基于骨架的动作识别的自适应注意力记忆图卷积网络

Sensors (Basel). 2021 Oct 12;21(20):6761. doi: 10.3390/s21206761.

Dynamic Edge Convolutional Neural Network for Skeleton-Based Human Action Recognition.基于骨架的人体动作识别的动态边缘卷积神经网络。

Sensors (Basel). 2023 Jan 10;23(2):778. doi: 10.3390/s23020778.

GAS-GCN: Gated Action-Specific Graph Convolutional Networks for Skeleton-Based Action Recognition.GAS-GCN：基于骨骼的动作识别的门控动作特定图卷积网络。

Sensors (Basel). 2020 Jun 21;20(12):3499. doi: 10.3390/s20123499.

Multi-Modality Adaptive Feature Fusion Graph Convolutional Network for Skeleton-Based Action Recognition.基于骨架的动作识别的多模态自适应特征融合图卷积网络。

Sensors (Basel). 2023 Jun 7;23(12):5414. doi: 10.3390/s23125414.

Feedback Graph Convolutional Network for Skeleton-Based Action Recognition.用于基于骨架的动作识别的反馈图卷积网络

IEEE Trans Image Process. 2022;31:164-175. doi: 10.1109/TIP.2021.3129117. Epub 2021 Dec 2.

RGB-D Data-Based Action Recognition: A Review.基于 RGB-D 数据的动作识别：综述。

Sensors (Basel). 2021 Jun 21;21(12):4246. doi: 10.3390/s21124246.

TFC-GCN: Lightweight Temporal Feature Cross-Extraction Graph Convolutional Network for Skeleton-Based Action Recognition.TFC-GCN：基于骨架的动作识别的轻量级时间特征交叉提取图卷积网络。

Sensors (Basel). 2023 Jun 15;23(12):5593. doi: 10.3390/s23125593.

引用本文的文献

BioCompNet: A Deep Learning Workflow Enabling Automated Body Composition Analysis toward Precision Management of Cardiometabolic Disorders.生物计算网络：一种深度学习工作流程，可实现对身体成分的自动分析，以精准管理心脏代谢疾病。

Cyborg Bionic Syst. 2025 Aug 20;6:0381. doi: 10.34133/cbsystems.0381. eCollection 2025.

Predictive power of the HEMPA risk assessment method for musculoskeletal disorders in nurses and caregivers: insights and implications.HEMPA风险评估方法对护士和护理人员肌肉骨骼疾病的预测能力：见解与启示

BMC Nurs. 2025 Jul 1;24(1):811. doi: 10.1186/s12912-025-03297-1.

Construction of intelligent gymnastics teaching model based on neural network and artificial intelligence.基于神经网络与人工智能的智能体操教学模型构建

Sci Rep. 2025 Jul 1;15(1):22105. doi: 10.1038/s41598-025-06839-4.

Machine Learning for Human Activity Recognition: State-of-the-Art Techniques and Emerging Trends.用于人类活动识别的机器学习：最新技术与新兴趋势。

J Imaging. 2025 Mar 20;11(3):91. doi: 10.3390/jimaging11030091.

Sci Rep. 2024 Dec 28;14(1):30908. doi: 10.1038/s41598-024-81762-8.

Improved Generalizability in Medical Computer Vision: Hyperbolic Deep Learning in Multi-Modality Neuroimaging.医学计算机视觉中泛化能力的提升：多模态神经成像中的双曲深度学习

J Imaging. 2024 Dec 12;10(12):319. doi: 10.3390/jimaging10120319.

Multi-Stream Fusion Network for Skeleton-Based Construction Worker Action Recognition.基于骨架的建筑工人动作识别的多流融合网络。

Sensors (Basel). 2023 Nov 23;23(23):9350. doi: 10.3390/s23239350.

Multi-Camera-Based Human Activity Recognition for Human-Robot Collaboration in Construction.基于多摄像机的施工中人与机器人协作的人类活动识别。

Sensors (Basel). 2023 Aug 7;23(15):6997. doi: 10.3390/s23156997.

Deep Learning for Human Activity Recognition on 3D Human Skeleton: Survey and Comparative Study.基于 3D 人体骨骼的人类活动识别深度学习：综述与比较研究。

Sensors (Basel). 2023 May 27;23(11):5121. doi: 10.3390/s23115121.

Using EfficientNet-B7 (CNN), Variational Auto Encoder (VAE) and Siamese Twins' Networks to Evaluate Human Exercises as Super Objects in a TSSCI Images.使用高效网络B7（卷积神经网络）、变分自编码器（VAE）和暹罗孪生网络来评估作为TSSCI图像中超级对象的人体运动。

J Pers Med. 2023 May 22;13(5):874. doi: 10.3390/jpm13050874.

本文引用的文献

A Hand Gesture Recognition Strategy Based on Virtual-Dimension Increase of EMG.一种基于肌电图虚拟维度增加的手势识别策略。

Cyborg Bionic Syst. 2024 Jan 29;5:0066. doi: 10.34133/cbsystems.0066. eCollection 2024.

Facial Prior Guided Micro-Expression Generation.基于面部先验的微表情生成。

IEEE Trans Image Process. 2024;33:525-540. doi: 10.1109/TIP.2023.3345177. Epub 2024 Jan 4.

Dynamic Dense Graph Convolutional Network for Skeleton-Based Human Motion Prediction.基于骨架的人体运动预测的动态密集图卷积网络。

IEEE Trans Image Process. 2024;33:1-15. doi: 10.1109/TIP.2023.3334954. Epub 2023 Dec 6.

DTCM: Joint Optimization of Dark Enhancement and Action Recognition in Videos.深度时态对比学习：视频中暗部增强与动作识别的联合优化

IEEE Trans Image Process. 2023;32:3507-3520. doi: 10.1109/TIP.2023.3286254. Epub 2023 Jun 23.

Generalized Pose Decoupled Network for Unsupervised 3D Skeleton Sequence-Based Action Representation Learning.用于基于无监督3D骨架序列的动作表示学习的广义姿态解耦网络。

Cyborg Bionic Syst. 2022;2022:0002. doi: 10.34133/cbsystems.0002. Epub 2022 Dec 30.

Application Research on Optimization Algorithm of sEMG Gesture Recognition Based on Light CNN+LSTM Model.基于轻量级卷积神经网络+长短期记忆网络模型的表面肌电手势识别优化算法应用研究

Cyborg Bionic Syst. 2021 Nov 8;2021:9794610. doi: 10.34133/2021/9794610. eCollection 2021.

Human Action Recognition From Various Data Modalities: A Review.基于多种数据模态的人类行为识别综述

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3200-3225. doi: 10.1109/TPAMI.2022.3183112. Epub 2023 Feb 3.

Feedback Graph Convolutional Network for Skeleton-Based Action Recognition.用于基于骨架的动作识别的反馈图卷积网络

IEEE Trans Image Process. 2022;31:164-175. doi: 10.1109/TIP.2021.3129117. Epub 2021 Dec 2.

Memory Attention Networks for Skeleton-Based Action Recognition.基于骨架的动作识别的记忆注意网络。

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4800-4814. doi: 10.1109/TNNLS.2021.3061115. Epub 2022 Aug 31.

Structural Knowledge Distillation for Efficient Skeleton-Based Action Recognition.用于高效基于骨架的动作识别的结构知识蒸馏

IEEE Trans Image Process. 2021;30:2963-2976. doi: 10.1109/TIP.2021.3056895. Epub 2021 Feb 17.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于学习方法的三维骨骼动作识别研究

A Survey on 3D Skeleton-Based Action Recognition Using Learning Method.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献