• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有鲁棒解耦损失的多维细化图卷积网络用于基于细粒度骨架的动作识别

Multidimensional Refinement Graph Convolutional Network With Robust Decouple Loss for Fine-Grained Skeleton-Based Action Recognition.

作者信息

Liu Sheng-Lan, Ding Yu-Ning, Zhang Jin-Rong, Liu Kai-Yuan, Zhang Si-Fan, Wang Fei-Long, Huang Gao

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7615-7626. doi: 10.1109/TNNLS.2024.3384770. Epub 2025 Apr 4.

DOI:10.1109/TNNLS.2024.3384770
PMID:38619962
Abstract

Graph convolutional networks (GCNs) have been widely used in skeleton-based action recognition. However, existing approaches are limited in fine-grained action recognition due to the similarity of interclass data. Moreover, the noisy data from pose extraction increase the challenge of fine-grained recognition. In this work, we propose a flexible attention block called channel-variable spatial-temporal attention (CVSTA) to enhance the discriminative power of spatial-temporal joints and obtain a more compact intraclass feature distribution. Based on CVSTA, we construct a multidimensional refinement GCN (MDR-GCN) that can improve the discrimination among channel-, joint-, and frame-level features for fine-grained actions. Furthermore, we propose a robust decouple loss (RDL) that significantly boosts the effect of the CVSTA and reduces the impact of noise. The proposed method combining MDR-GCN with RDL outperforms the known state-of-the-art skeleton-based approaches on fine-grained datasets, FineGym99 and FSD-10, and also on the coarse NTU-RGB + D 120 dataset and NTU-RGB + D X-view version. Our code is publicly available at https://github.com/dingyn-Reno/MDR-GCN.

摘要

图卷积网络(GCN)已被广泛应用于基于骨架的动作识别。然而,由于类间数据的相似性,现有的方法在细粒度动作识别方面存在局限性。此外,姿态提取过程中产生的噪声数据增加了细粒度识别的挑战。在这项工作中,我们提出了一种灵活的注意力模块,称为通道可变时空注意力(CVSTA),以增强时空关节的判别能力,并获得更紧凑的类内特征分布。基于CVSTA,我们构建了一个多维细化GCN(MDR-GCN),它可以提高细粒度动作在通道、关节和帧级特征之间的辨别能力。此外,我们还提出了一种鲁棒解耦损失(RDL),它显著增强了CVSTA的效果,并降低了噪声的影响。将MDR-GCN与RDL相结合的方法在细粒度数据集FineGym99和FSD-10以及粗糙的NTU-RGB + D 120数据集和NTU-RGB + D X-view版本上优于已知的基于骨架的先进方法。我们的代码可在https://github.com/dingyn-Reno/MDR-GCN上公开获取。

相似文献

1
Multidimensional Refinement Graph Convolutional Network With Robust Decouple Loss for Fine-Grained Skeleton-Based Action Recognition.具有鲁棒解耦损失的多维细化图卷积网络用于基于细粒度骨架的动作识别
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7615-7626. doi: 10.1109/TNNLS.2024.3384770. Epub 2025 Apr 4.
2
Feedback Graph Convolutional Network for Skeleton-Based Action Recognition.用于基于骨架的动作识别的反馈图卷积网络
IEEE Trans Image Process. 2022;31:164-175. doi: 10.1109/TIP.2021.3129117. Epub 2021 Dec 2.
3
TFC-GCN: Lightweight Temporal Feature Cross-Extraction Graph Convolutional Network for Skeleton-Based Action Recognition.TFC-GCN:基于骨架的动作识别的轻量级时间特征交叉提取图卷积网络。
Sensors (Basel). 2023 Jun 15;23(12):5593. doi: 10.3390/s23125593.
4
Graph Diffusion Convolutional Network for Skeleton Based Semantic Recognition of Two-Person Actions.基于骨架的两人动作语义识别的图扩散卷积网络。
IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8477-8493. doi: 10.1109/TPAMI.2023.3238411. Epub 2023 Jun 5.
5
Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition.用于基于骨架的动作识别的自适应注意力记忆图卷积网络
Sensors (Basel). 2021 Oct 12;21(20):6761. doi: 10.3390/s21206761.
6
GAS-GCN: Gated Action-Specific Graph Convolutional Networks for Skeleton-Based Action Recognition.GAS-GCN:基于骨骼的动作识别的门控动作特定图卷积网络。
Sensors (Basel). 2020 Jun 21;20(12):3499. doi: 10.3390/s20123499.
7
Multi-scale and attention enhanced graph convolution network for skeleton-based violence action recognition.用于基于骨架的暴力行为识别的多尺度注意力增强图卷积网络。
Front Neurorobot. 2022 Dec 15;16:1091361. doi: 10.3389/fnbot.2022.1091361. eCollection 2022.
8
Graph Convolutional Networks for multi-modal robotic martial arts leg pose recognition.用于多模态机器人武术腿部姿势识别的图卷积网络
Front Neurorobot. 2025 Jan 20;18:1520983. doi: 10.3389/fnbot.2024.1520983. eCollection 2024.
9
Shallow Graph Convolutional Network for Skeleton-Based Action Recognition.基于骨架的动作识别的浅层图卷积网络。
Sensors (Basel). 2021 Jan 11;21(2):452. doi: 10.3390/s21020452.
10
Two-stream spatio-temporal GCN-transformer networks for skeleton-based action recognition.用于基于骨架的动作识别的双流时空GCN-Transformer网络
Sci Rep. 2025 Feb 10;15(1):4982. doi: 10.1038/s41598-025-87752-8.