• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多流自适应图卷积网络的骨架动作识别

Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks.

作者信息

Shi Lei, Zhang Yifan, Cheng Jian, Lu Hanqing

出版信息

IEEE Trans Image Process. 2020 Oct 9;PP. doi: 10.1109/TIP.2020.3028207.

DOI:10.1109/TIP.2020.3028207
PMID:33035162
Abstract

Graph convolutional networks (GCNs), which generalize CNNs to more generic non-Euclidean structures, have achieved remarkable performance for skeleton-based action recognition. However, there still exist several issues in the previous GCN-based models. First, the topology of the graph is set heuristically and fixed over all the model layers and input data. This may not be suitable for the hierarchy of the GCN model and the diversity of the data in action recognition tasks. Second, the second-order information of the skeleton data, i.e., the length and orientation of the bones, is rarely investigated, which is naturally more informative and discriminative for the human action recognition. In this work, we propose a novel multi-stream attention-enhanced adaptive graph convolutional neural network (MS-AAGCN) for skeleton-based action recognition. The graph topology in our model can be either uniformly or individually learned based on the input data in an end-to-end manner. This data-driven approach increases the flexibility of the model for graph construction and brings more generality to adapt to various data samples. Besides, the proposed adaptive graph convolutional layer is further enhanced by a spatial-temporal-channel attention module, which helps the model pay more attention to important joints, frames and features. Moreover, the information of both the joints and bones, together with their motion information, are simultaneously modeled in a multi-stream framework, which shows notable improvement for the recognition accuracy. Extensive experiments on the two large-scale datasets, NTU-RGBD and Kinetics-Skeleton, demonstrate that the performance of our model exceeds the state-of-the-art with a significant margin.

摘要

图卷积网络(GCN)将卷积神经网络推广到更通用的非欧几里得结构,在基于骨架的动作识别方面取得了显著性能。然而,基于GCN的先前模型仍然存在几个问题。首先,图的拓扑结构是通过启发式设置的,并且在所有模型层和输入数据上都是固定的。这可能不适用于GCN模型的层次结构以及动作识别任务中数据的多样性。其次,骨架数据的二阶信息,即骨骼的长度和方向,很少被研究,而这些信息对于人类动作识别自然更具信息量和判别力。在这项工作中,我们提出了一种用于基于骨架的动作识别的新型多流注意力增强自适应图卷积神经网络(MS-AAGCN)。我们模型中的图拓扑可以基于输入数据以端到端的方式统一或单独学习。这种数据驱动的方法增加了模型在图构建方面的灵活性,并带来了更强的通用性以适应各种数据样本。此外,所提出的自适应图卷积层通过时空通道注意力模块进一步增强,这有助于模型更加关注重要的关节、帧和特征。而且,关节和骨骼的信息及其运动信息在多流框架中同时建模,这在识别准确率方面显示出显著提高。在两个大规模数据集NTU-RGBD和Kinetics-Skeleton上进行的大量实验表明,我们模型的性能大幅超越了当前的最优水平。

相似文献

1
Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks.基于多流自适应图卷积网络的骨架动作识别
IEEE Trans Image Process. 2020 Oct 9;PP. doi: 10.1109/TIP.2020.3028207.
2
Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition.用于基于骨架的动作识别的自适应注意力记忆图卷积网络
Sensors (Basel). 2021 Oct 12;21(20):6761. doi: 10.3390/s21206761.
3
Multi-Modality Adaptive Feature Fusion Graph Convolutional Network for Skeleton-Based Action Recognition.基于骨架的动作识别的多模态自适应特征融合图卷积网络。
Sensors (Basel). 2023 Jun 7;23(12):5414. doi: 10.3390/s23125414.
4
Shallow Graph Convolutional Network for Skeleton-Based Action Recognition.基于骨架的动作识别的浅层图卷积网络。
Sensors (Basel). 2021 Jan 11;21(2):452. doi: 10.3390/s21020452.
5
Multi-scale and attention enhanced graph convolution network for skeleton-based violence action recognition.用于基于骨架的暴力行为识别的多尺度注意力增强图卷积网络。
Front Neurorobot. 2022 Dec 15;16:1091361. doi: 10.3389/fnbot.2022.1091361. eCollection 2022.
6
Graph Diffusion Convolutional Network for Skeleton Based Semantic Recognition of Two-Person Actions.基于骨架的两人动作语义识别的图扩散卷积网络。
IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8477-8493. doi: 10.1109/TPAMI.2023.3238411. Epub 2023 Jun 5.
7
GAS-GCN: Gated Action-Specific Graph Convolutional Networks for Skeleton-Based Action Recognition.GAS-GCN:基于骨骼的动作识别的门控动作特定图卷积网络。
Sensors (Basel). 2020 Jun 21;20(12):3499. doi: 10.3390/s20123499.
8
TFC-GCN: Lightweight Temporal Feature Cross-Extraction Graph Convolutional Network for Skeleton-Based Action Recognition.TFC-GCN:基于骨架的动作识别的轻量级时间特征交叉提取图卷积网络。
Sensors (Basel). 2023 Jun 15;23(12):5593. doi: 10.3390/s23125593.
9
Feedback Graph Convolutional Network for Skeleton-Based Action Recognition.用于基于骨架的动作识别的反馈图卷积网络
IEEE Trans Image Process. 2022;31:164-175. doi: 10.1109/TIP.2021.3129117. Epub 2021 Dec 2.
10
Enhanced Adjacency Matrix-Based Lightweight Graph Convolution Network for Action Recognition.基于增强邻接矩阵的轻量级图卷积网络用于动作识别
Sensors (Basel). 2023 Jul 14;23(14):6397. doi: 10.3390/s23146397.

引用本文的文献

1
Fusion of Multimodal Spatio-Temporal Features and 3D Deformable Convolution Based on Sign Language Recognition in Sensor Networks.基于传感器网络中手语识别的多模态时空特征融合与3D可变形卷积
Sensors (Basel). 2025 Jul 13;25(14):4378. doi: 10.3390/s25144378.
2
A Comprehensive Methodological Survey of Human Activity Recognition Across Diverse Data Modalities.跨多种数据模态的人类活动识别综合方法学综述
Sensors (Basel). 2025 Jun 27;25(13):4028. doi: 10.3390/s25134028.
3
A Structured and Methodological Review on Multi-View Human Activity Recognition for Ambient Assisted Living.
面向环境辅助生活的多视图人类活动识别的结构化与方法学综述
J Imaging. 2025 Jun 3;11(6):182. doi: 10.3390/jimaging11060182.
4
Action recognition using part and attention enhanced feature fusion.基于部分与注意力增强特征融合的动作识别
Sci Rep. 2025 May 29;15(1):18780. doi: 10.1038/s41598-025-02461-6.
5
The application of suitable sports games for junior high school students based on deep learning and artificial intelligence.基于深度学习和人工智能的适合初中生的体育游戏应用
Sci Rep. 2025 May 16;15(1):17056. doi: 10.1038/s41598-025-01941-z.
6
Semantics-Assisted Training Graph Convolution Network for Skeleton-Based Action Recognition.用于基于骨架的动作识别的语义辅助训练图卷积网络
Sensors (Basel). 2025 Mar 15;25(6):1841. doi: 10.3390/s25061841.
7
Closed-loop rehabilitation of upper-limb dyskinesia after stroke: from natural motion to neuronal microfluidics.中风后上肢运动障碍的闭环康复:从自然运动到神经微流体
J Neuroeng Rehabil. 2025 Apr 19;22(1):87. doi: 10.1186/s12984-025-01617-9.
8
MAF-Net: A multimodal data fusion approach for human action recognition.MAF-Net:一种用于人类动作识别的多模态数据融合方法。
PLoS One. 2025 Apr 9;20(4):e0319656. doi: 10.1371/journal.pone.0319656. eCollection 2025.
9
A Spatial-Temporal Multi-Feature Network (STMF-Net) for Skeleton-Based Construction Worker Action Recognition.一种用于基于骨架的建筑工人动作识别的时空多特征网络(STMF-Net)。
Sensors (Basel). 2024 Nov 22;24(23):7455. doi: 10.3390/s24237455.
10
Sarcopenia diagnosis using skeleton-based gait sequence and foot-pressure image datasets.基于骨骼的步态序列和足底压力图像数据集进行肌肉减少症诊断。
Front Public Health. 2024 Nov 27;12:1443188. doi: 10.3389/fpubh.2024.1443188. eCollection 2024.