超越关节：基于骨架的动作识别和检测的从原始几何形状中学习表示。

Beyond Joints: Learning Representations From Primitive Geometries for Skeleton-Based Action Recognition and Detection.

出版信息

IEEE Trans Image Process. 2018 Sep;27(9):4382-4394. doi: 10.1109/TIP.2018.2837386.

DOI:10.1109/TIP.2018.2837386

Abstract

Recently, skeleton-based action recognition becomes popular owing to the development of cost-effective depth sensors and fast pose estimation algorithms. Traditional methods based on pose descriptors often fail on large-scale datasets due to the limited representation of engineered features. Recent recurrent neural networks (RNN) based approaches mostly focus on the temporal evolution of body joints and neglect the geometric relations. In this paper, we aim to leverage the geometric relations among joints for action recognition. We introduce three primitive geometries: joints, edges, and surfaces. Accordingly, a generic end-to-end RNN based network is designed to accommodate the three inputs. For action recognition, a novel viewpoint transformation layer and temporal dropout layers are utilized in the RNN based network to learn robust representations. And for action detection, we first perform frame-wise action classification, then exploit a novel multi-scale sliding window algorithm. Experiments on the large-scale 3D action recognition benchmark datasets show that joints, edges, and surfaces are effective and complementary for different actions. Our approaches dramatically outperform the existing state-of-the-art methods for both tasks of action recognition and action detection.

摘要

最近，基于骨架的动作识别技术由于成本效益高的深度传感器和快速姿态估计算法的发展而变得流行。传统的基于姿态描述符的方法由于工程特征的表示有限，在大规模数据集上往往会失败。最近基于循环神经网络（RNN）的方法主要关注于身体关节的时间演化，而忽略了几何关系。在本文中，我们旨在利用关节之间的几何关系进行动作识别。我们引入了三种基本几何形状：关节、边和表面。相应地，设计了一个通用的端到端基于 RNN 的网络来容纳这三种输入。对于动作识别，在基于 RNN 的网络中利用了新颖的视角变换层和时间丢弃层来学习鲁棒的表示。对于动作检测，我们首先执行逐帧的动作分类，然后利用新颖的多尺度滑动窗口算法。在大规模 3D 动作识别基准数据集上的实验表明，关节、边和表面对于不同的动作是有效且互补的。我们的方法在动作识别和动作检测这两个任务上都明显优于现有的最先进方法。

相似文献

Beyond Joints: Learning Representations From Primitive Geometries for Skeleton-Based Action Recognition and Detection.

IEEE Trans Image Process. 2018 Sep;27(9):4382-4394. doi: 10.1109/TIP.2018.2837386.

Hypergraph Neural Network for Skeleton-Based Action Recognition.

IEEE Trans Image Process. 2021;30:2263-2275. doi: 10.1109/TIP.2021.3051495. Epub 2021 Jan 26.

View-Invariant Human Action Recognition Based on a 3D Bio-Constrained Skeleton Model.

IEEE Trans Image Process. 2019 Aug;28(8):3959-3972. doi: 10.1109/TIP.2019.2907048. Epub 2019 Mar 22.

Representation Learning of Temporal Dynamics for Skeleton-Based Action Recognition.

IEEE Trans Image Process. 2016 Jul;25(7):3010-3022. doi: 10.1109/TIP.2016.2552404. Epub 2016 Apr 8.

Spatio⁻Temporal Image Representation of 3D Skeletal Movements for View-Invariant Action Recognition with Deep Convolutional Neural Networks.

Sensors (Basel). 2019 Apr 24;19(8):1932. doi: 10.3390/s19081932.

Skeleton-Based Human Action Recognition With Global Context-Aware Attention LSTM Networks.

IEEE Trans Image Process. 2018 Apr;27(4):1586-1599. doi: 10.1109/TIP.2017.2785279.

Weakly Supervised Adversarial Learning for 3D Human Pose Estimation from Point Clouds.

IEEE Trans Vis Comput Graph. 2020 May;26(5):1851-1859. doi: 10.1109/TVCG.2020.2973076. Epub 2020 Feb 13.

Learning Clip Representations for Skeleton-Based 3D Action Recognition.

IEEE Trans Image Process. 2018 Jun;27(6):2842-2855. doi: 10.1109/TIP.2018.2812099.

GAS-GCN: Gated Action-Specific Graph Convolutional Networks for Skeleton-Based Action Recognition.

Sensors (Basel). 2020 Jun 21;20(12):3499. doi: 10.3390/s20123499.

Learning a Deep Model for Human Action Recognition from Novel Viewpoints.

IEEE Trans Pattern Anal Mach Intell. 2018 Mar;40(3):667-681. doi: 10.1109/TPAMI.2017.2691768. Epub 2017 Apr 6.

引用本文的文献

Spatial-Temporal Heatmap Masked Autoencoder for Skeleton-Based Action Recognition.

Sensors (Basel). 2025 May 16;25(10):3146. doi: 10.3390/s25103146.

A Spatial-Temporal Multi-Feature Network (STMF-Net) for Skeleton-Based Construction Worker Action Recognition.

Sensors (Basel). 2024 Nov 22;24(23):7455. doi: 10.3390/s24237455.

Localization and recognition of human action in 3D using transformers.

Commun Eng. 2024 Sep 3;3(1):125. doi: 10.1038/s44172-024-00272-7.

TFC-GCN: Lightweight Temporal Feature Cross-Extraction Graph Convolutional Network for Skeleton-Based Action Recognition.

Sensors (Basel). 2023 Jun 15;23(12):5593. doi: 10.3390/s23125593.

Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network.

Comput Intell Neurosci. 2022 Jun 13;2022:6608448. doi: 10.1155/2022/6608448. eCollection 2022.

Body and Hand-Object ROI-Based Behavior Recognition Using Deep Learning.

Sensors (Basel). 2021 Mar 6;21(5):1838. doi: 10.3390/s21051838.

Analysis of the role and robustness of artificial intelligence in commodity image recognition under deep learning neural network.

PLoS One. 2020 Jul 7;15(7):e0235783. doi: 10.1371/journal.pone.0235783. eCollection 2020.

GAS-GCN: Gated Action-Specific Graph Convolutional Networks for Skeleton-Based Action Recognition.

Sensors (Basel). 2020 Jun 21;20(12):3499. doi: 10.3390/s20123499.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

超越关节：基于骨架的动作识别和检测的从原始几何形状中学习表示。

Beyond Joints: Learning Representations From Primitive Geometries for Skeleton-Based Action Recognition and Detection.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献