• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于骨架的人体动作识别的动态边缘卷积神经网络。

Dynamic Edge Convolutional Neural Network for Skeleton-Based Human Action Recognition.

机构信息

School of Electronics and Information Engineering, Korea Aerospace University, Goyang 10540, Republic of Korea.

出版信息

Sensors (Basel). 2023 Jan 10;23(2):778. doi: 10.3390/s23020778.

DOI:10.3390/s23020778
PMID:36679576
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9864180/
Abstract

To provide accessible, intelligent, and efficient remote access such as the internet of things, rehabilitation, autonomous driving, virtual games, and healthcare, human action recognition (HAR) has gained much attention among computer vision researchers. Several methods have already been addressed to ensure effective and efficient action recognition based on different perspectives including data modalities, feature design, network configuration, and application domains. In this article, we design a new deep learning model by integrating criss-cross attention and edge convolution to extract discriminative features from the skeleton sequence for action recognition. The attention mechanism is applied in spatial and temporal directions to pursue the intra- and inter-frame relationships. Then, several edge convolutional layers are conducted to explore the geometric relationships among the neighboring joints in the human body. The proposed model is dynamically updated after each layer by recomputing the graph on the basis of k-nearest joints for learning local and global information in action sequences. We used publicly available benchmark skeleton datasets such as UTD-MHAD (University of Texas at Dallas multimodal human action dataset) and MSR-Action3D (Microsoft action 3D) to evaluate the proposed method. We also investigated the proposed method with different configurations of network architectures to assure effectiveness and robustness. The proposed method achieved average accuracies of 99.53% and 95.64% on the UTD-MHAD and MSR-Action3D datasets, respectively, outperforming state-of-the-art methods.

摘要

为了提供物联网、康复、自动驾驶、虚拟游戏和医疗保健等方面的可访问、智能和高效的远程访问,人体动作识别 (HAR) 在计算机视觉研究人员中引起了广泛关注。已经提出了几种方法,以确保基于不同视角(包括数据模态、特征设计、网络配置和应用领域)的有效和高效的动作识别。在本文中,我们设计了一种新的深度学习模型,通过集成交叉注意和边缘卷积从骨架序列中提取判别特征,用于动作识别。注意力机制应用于空间和时间方向,以研究帧内和帧间关系。然后,进行了几个边缘卷积层,以探索人体中相邻关节之间的几何关系。该模型通过在基于 k-最近邻关节的基础上重新计算图,在动作序列中学习局部和全局信息,从而在每层后动态更新。我们使用了公开可用的骨架基准数据集,如 UTD-MHAD(达拉斯大学多模态人体动作数据集)和 MSR-Action3D(微软动作 3D),来评估所提出的方法。我们还研究了具有不同网络架构配置的所提出的方法,以确保有效性和鲁棒性。所提出的方法在 UTD-MHAD 和 MSR-Action3D 数据集上的平均准确率分别为 99.53%和 95.64%,优于最先进的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/19fdac1ce61d/sensors-23-00778-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/647e70581af2/sensors-23-00778-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/ed731f343375/sensors-23-00778-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/0b771766bfe8/sensors-23-00778-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/c01230ca0231/sensors-23-00778-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/c8d27fb56dde/sensors-23-00778-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/68387a2bef24/sensors-23-00778-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/d75521b44797/sensors-23-00778-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/0c0db5552f59/sensors-23-00778-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/05b4eb8dfa65/sensors-23-00778-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/19fdac1ce61d/sensors-23-00778-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/647e70581af2/sensors-23-00778-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/ed731f343375/sensors-23-00778-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/0b771766bfe8/sensors-23-00778-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/c01230ca0231/sensors-23-00778-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/c8d27fb56dde/sensors-23-00778-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/68387a2bef24/sensors-23-00778-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/d75521b44797/sensors-23-00778-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/0c0db5552f59/sensors-23-00778-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/05b4eb8dfa65/sensors-23-00778-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1df/9864180/19fdac1ce61d/sensors-23-00778-g010.jpg

相似文献

1
Dynamic Edge Convolutional Neural Network for Skeleton-Based Human Action Recognition.基于骨架的人体动作识别的动态边缘卷积神经网络。
Sensors (Basel). 2023 Jan 10;23(2):778. doi: 10.3390/s23020778.
2
A union of deep learning and swarm-based optimization for 3D human action recognition.基于深度学习和群体智能优化的三维人体动作识别方法。
Sci Rep. 2022 Mar 31;12(1):5494. doi: 10.1038/s41598-022-09293-8.
3
Adaptive Attention Memory Graph Convolutional Networks for Skeleton-Based Action Recognition.用于基于骨架的动作识别的自适应注意力记忆图卷积网络
Sensors (Basel). 2021 Oct 12;21(20):6761. doi: 10.3390/s21206761.
4
Graph Edge Convolutional Neural Networks for Skeleton-Based Action Recognition.基于骨架的动作识别的图边缘卷积神经网络。
IEEE Trans Neural Netw Learn Syst. 2020 Aug;31(8):3047-3060. doi: 10.1109/TNNLS.2019.2935173. Epub 2019 Sep 17.
5
Enhancing Human Activity Recognition through Integrated Multimodal Analysis: A Focus on RGB Imaging, Skeletal Tracking, and Pose Estimation.通过集成多模态分析增强人类活动识别:重点关注 RGB 成像、骨骼跟踪和姿势估计。
Sensors (Basel). 2024 Jul 17;24(14):4646. doi: 10.3390/s24144646.
6
Multi-View Human Action Recognition Using Skeleton Based-FineKNN with Extraneous Frame Scrapping Technique.基于骨架的 FineKNN 与额外帧剔除技术的多视角人体动作识别。
Sensors (Basel). 2023 Mar 2;23(5):2745. doi: 10.3390/s23052745.
7
Hypergraph Neural Network for Skeleton-Based Action Recognition.基于骨架的动作识别的超图神经网络。
IEEE Trans Image Process. 2021;30:2263-2275. doi: 10.1109/TIP.2021.3051495. Epub 2021 Jan 26.
8
View-Invariant Human Action Recognition Based on a 3D Bio-Constrained Skeleton Model.基于三维生物约束骨骼模型的视图不变人体动作识别。
IEEE Trans Image Process. 2019 Aug;28(8):3959-3972. doi: 10.1109/TIP.2019.2907048. Epub 2019 Mar 22.
9
Skeleton Graph-Neural-Network-Based Human Action Recognition: A Survey.基于骨架图神经网络的人体动作识别研究综述
Sensors (Basel). 2022 Mar 8;22(6):2091. doi: 10.3390/s22062091.
10
Deep Learning for Human Activity Recognition on 3D Human Skeleton: Survey and Comparative Study.基于 3D 人体骨骼的人类活动识别深度学习:综述与比较研究。
Sensors (Basel). 2023 May 27;23(11):5121. doi: 10.3390/s23115121.

引用本文的文献

1
Ethnic dance movement instruction guided by artificial intelligence and 3D convolutional neural networks.由人工智能和3D卷积神经网络指导的民族舞蹈动作教学
Sci Rep. 2025 May 15;15(1):16856. doi: 10.1038/s41598-025-01879-2.
2
A bioinspired in-materia analog photoelectronic reservoir computing for human action processing.一种用于人类动作处理的受生物启发的材料内模拟光电储层计算。
Nat Commun. 2025 Mar 6;16(1):2263. doi: 10.1038/s41467-025-56899-3.
3
Adopting Graph Neural Networks to Analyze Human-Object Interactions for Inferring Activities of Daily Living.

本文引用的文献

1
A Comprehensive Review of Recent Deep Learning Techniques for Human Activity Recognition.深度学习技术在人体活动识别中的研究进展综述
Comput Intell Neurosci. 2022 Apr 20;2022:8323962. doi: 10.1155/2022/8323962. eCollection 2022.
2
Incomplete Region Estimation and Restoration of 3D Point Cloud Human Face Datasets.三维点云人脸数据集的不完全区域估计和恢复。
Sensors (Basel). 2022 Jan 18;22(3):723. doi: 10.3390/s22030723.
3
A Comprehensive Survey on Graph Neural Networks.图神经网络综述。
采用图神经网络分析人与物体的交互以推断日常生活活动
Sensors (Basel). 2024 Apr 17;24(8):2567. doi: 10.3390/s24082567.
4
C-DTW for Human Action Recognition Based on Nanogenerator.基于纳米发电机的人体动作识别的 C-DTW 方法。
Sensors (Basel). 2023 Aug 17;23(16):7230. doi: 10.3390/s23167230.
5
KNN-Based Machine Learning Classifier Used on Deep Learned Spatial Motion Features for Human Action Recognition.基于K近邻算法的机器学习分类器在深度学习的空间运动特征用于人体动作识别中的应用
Entropy (Basel). 2023 May 25;25(6):844. doi: 10.3390/e25060844.
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24. doi: 10.1109/TNNLS.2020.2978386. Epub 2021 Jan 4.
4
View-invariant Deep Architecture for Human Action Recognition using Two-stream Motion and Shape Temporal Dynamics.使用双流运动和形状时间动态的用于人类动作识别的视图不变深度架构
IEEE Trans Image Process. 2020 Jan 15. doi: 10.1109/TIP.2020.2965299.
5
A Deep Learning Framework for Assessing Physical Rehabilitation Exercises.深度学习框架用于评估物理康复运动。
IEEE Trans Neural Syst Rehabil Eng. 2020 Feb;28(2):468-477. doi: 10.1109/TNSRE.2020.2966249. Epub 2020 Jan 13.
6
A vision-based system for intelligent monitoring: human behaviour analysis and privacy by context.一种用于智能监控的基于视觉的系统:人类行为分析与情境隐私。
Sensors (Basel). 2014 May 20;14(5):8895-925. doi: 10.3390/s140508895.
7
Unobtrusive sensing and wearable devices for health informatics.用于健康信息学的非侵入式传感与可穿戴设备。
IEEE Trans Biomed Eng. 2014 May;61(5):1538-54. doi: 10.1109/TBME.2014.2309951.