• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于二维卷积神经网络和特征融合的动态手势识别。

Dynamic gesture recognition based on 2D convolutional neural network and feature fusion.

机构信息

College of Automation, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China.

College of Computer Science, Chongqing University, Chongqing, 400044, China.

出版信息

Sci Rep. 2022 Mar 14;12(1):4345. doi: 10.1038/s41598-022-08133-z.

DOI:10.1038/s41598-022-08133-z
PMID:35288612
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8921226/
Abstract

Gesture recognition is one of the most popular techniques in the field of computer vision today. In recent years, many algorithms for gesture recognition have been proposed, but most of them do not have a good balance between recognition efficiency and accuracy. Therefore, proposing a dynamic gesture recognition algorithm that balances efficiency and accuracy is still a meaningful work. Currently, most of the commonly used dynamic gesture recognition algorithms are based on 3D convolutional neural networks. Although 3D convolutional neural networks consider both spatial and temporal features, the networks are too complex, which is the main reason for the low efficiency of the algorithms. To improve this problem, we propose a recognition method based on a strategy combining 2D convolutional neural networks with feature fusion. The original keyframes and optical flow keyframes are used to represent spatial and temporal features respectively, which are then sent to the 2D convolutional neural network for feature fusion and final recognition. To ensure the quality of the extracted optical flow graph without increasing the complexity of the network, we use the fractional-order method to extract the optical flow graph, creatively combine fractional calculus and deep learning. Finally, we use Cambridge Hand Gesture dataset and Northwestern University Hand Gesture dataset to verify the effectiveness of our algorithm. The experimental results show that our algorithm has a high accuracy while ensuring low network complexity.

摘要

手势识别是计算机视觉领域中最流行的技术之一。近年来,已经提出了许多手势识别算法,但大多数算法在识别效率和准确性之间都没有很好的平衡。因此,提出一种平衡效率和准确性的动态手势识别算法仍然是一项有意义的工作。目前,大多数常用的动态手势识别算法都是基于 3D 卷积神经网络的。虽然 3D 卷积神经网络同时考虑了空间和时间特征,但网络过于复杂,这是算法效率低的主要原因。为了解决这个问题,我们提出了一种基于 2D 卷积神经网络与特征融合相结合的策略的识别方法。原始关键帧和光流关键帧分别表示空间和时间特征,然后将它们发送到 2D 卷积神经网络进行特征融合和最终识别。为了确保提取的光流图的质量而不增加网络的复杂性,我们使用分数阶方法提取光流图,创造性地将分数阶微积分和深度学习结合起来。最后,我们使用剑桥手部动作数据集和西北大学手部动作数据集验证了我们算法的有效性。实验结果表明,我们的算法在保证网络复杂度低的同时具有很高的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/3fa80477bddb/41598_2022_8133_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/fe2325cb28c6/41598_2022_8133_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/916c7a7c8c32/41598_2022_8133_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/b8ea0078c044/41598_2022_8133_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/d716e950d837/41598_2022_8133_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/13c8185072fe/41598_2022_8133_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/2c590bd92f4f/41598_2022_8133_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/8d2432933f13/41598_2022_8133_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/c5caa11b172d/41598_2022_8133_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/d519a5269127/41598_2022_8133_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/c59fda3d638c/41598_2022_8133_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/71718a915e78/41598_2022_8133_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/3fa80477bddb/41598_2022_8133_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/fe2325cb28c6/41598_2022_8133_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/916c7a7c8c32/41598_2022_8133_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/b8ea0078c044/41598_2022_8133_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/d716e950d837/41598_2022_8133_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/13c8185072fe/41598_2022_8133_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/2c590bd92f4f/41598_2022_8133_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/8d2432933f13/41598_2022_8133_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/c5caa11b172d/41598_2022_8133_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/d519a5269127/41598_2022_8133_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/c59fda3d638c/41598_2022_8133_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/71718a915e78/41598_2022_8133_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2076/8921226/3fa80477bddb/41598_2022_8133_Fig12_HTML.jpg

相似文献

1
Dynamic gesture recognition based on 2D convolutional neural network and feature fusion.基于二维卷积神经网络和特征融合的动态手势识别。
Sci Rep. 2022 Mar 14;12(1):4345. doi: 10.1038/s41598-022-08133-z.
2
Dynamic Gesture Recognition Algorithm Based on 3D Convolutional Neural Network.基于三维卷积神经网络的动态手势识别算法。
Comput Intell Neurosci. 2021 Aug 16;2021:4828102. doi: 10.1155/2021/4828102. eCollection 2021.
3
MSFF-Net: Multi-Stream Feature Fusion Network for surface electromyography gesture recognition.MSFF-Net:用于表面肌电信号手势识别的多流特征融合网络。
PLoS One. 2022 Nov 7;17(11):e0276436. doi: 10.1371/journal.pone.0276436. eCollection 2022.
4
Transformer-based hand gesture recognition from instantaneous to fused neural decomposition of high-density EMG signals.基于Transformer 的手 gestures 识别,来自高密度 EMG 信号的即时融合神经分解。
Sci Rep. 2023 Jul 7;13(1):11000. doi: 10.1038/s41598-023-36490-w.
5
Hand gesture recognition using sEMG signals with a multi-stream time-varying feature enhancement approach.基于多流时变特征增强方法的 sEMG 信号手势识别。
Sci Rep. 2024 Sep 27;14(1):22061. doi: 10.1038/s41598-024-72996-7.
6
Surface EMG-Based Instantaneous Hand Gesture Recognition Using Convolutional Neural Network with the Transfer Learning Method.基于表面肌电信号的卷积神经网络与迁移学习方法的瞬时手势识别。
Sensors (Basel). 2021 Apr 5;21(7):2540. doi: 10.3390/s21072540.
7
Dual Stream Long Short-Term Memory Feature Fusion Classifier for Surface Electromyography Gesture Recognition.双通道长短时记忆特征融合分类器用于表面肌电手势识别。
Sensors (Basel). 2024 Jun 4;24(11):3631. doi: 10.3390/s24113631.
8
Multimode Gesture Recognition Algorithm Based on Convolutional Long Short-Term Memory Network.基于卷积长短期记忆网络的多模式手势识别算法。
Comput Intell Neurosci. 2022 Mar 2;2022:4068414. doi: 10.1155/2022/4068414. eCollection 2022.
9
Multi-Category Gesture Recognition Modeling Based on sEMG and IMU Signals.基于 sEMG 和 IMU 信号的多类别手势识别建模。
Sensors (Basel). 2022 Aug 5;22(15):5855. doi: 10.3390/s22155855.
10
Hand Gesture Recognition based on Surface Electromyography using Convolutional Neural Network with Transfer Learning Method.基于卷积神经网络的迁移学习方法的表面肌电手势识别。
IEEE J Biomed Health Inform. 2021 Apr;25(4):1292-1304. doi: 10.1109/JBHI.2020.3009383. Epub 2021 Apr 6.

引用本文的文献

1
HAVIT: research on vision-language gesture interaction mechanism for smart furniture.哈维特:智能家具视觉-语言-手势交互机制研究
Sci Rep. 2025 Jul 28;15(1):27423. doi: 10.1038/s41598-025-10758-9.
2
Local pattern aware 3D video swin transformer with masked autoencoding for realtime augmented reality gesture interaction.用于实时增强现实手势交互的具有掩码自动编码的局部模式感知3D视频斯温变压器
Sci Rep. 2025 Jul 1;15(1):21318. doi: 10.1038/s41598-025-05935-9.
3
Gesture-controlled reconfigurable metasurface system based on surface electromyography for real-time electromagnetic wave manipulation.

本文引用的文献

1
Dynamic Gesture Recognition Using Surface EMG Signals Based on Multi-Stream Residual Network.基于多流残差网络的表面肌电信号动态手势识别
Front Bioeng Biotechnol. 2021 Oct 22;9:779353. doi: 10.3389/fbioe.2021.779353. eCollection 2021.
2
Squeeze-and-Excitation Networks.挤压激励网络。
IEEE Trans Pattern Anal Mach Intell. 2020 Aug;42(8):2011-2023. doi: 10.1109/TPAMI.2019.2913372. Epub 2019 Apr 29.
3
Fractional-order variational optical flow model for motion estimation.分数阶变分光流模型的运动估计。
基于表面肌电图的手势控制可重构超表面系统用于实时电磁波操纵。
Nanophotonics. 2025 Jan 6;14(1):107-119. doi: 10.1515/nanoph-2024-0572. eCollection 2025 Jan.
4
Intelligent Gesture Recognition Based on Screen Reflectance Multi-Band Spectral Features.基于屏幕反射多波段光谱特征的智能手势识别
Sensors (Basel). 2024 Aug 26;24(17):5519. doi: 10.3390/s24175519.
5
Integration of Tracking, Re-Identification, and Gesture Recognition for Facilitating Human-Robot Interaction.用于促进人机交互的跟踪、重新识别和手势识别的集成。
Sensors (Basel). 2024 Jul 25;24(15):4850. doi: 10.3390/s24154850.
6
Verification of Criterion-Related Validity for Developing a Markerless Hand Tracking Device.开发无标记手部跟踪设备的标准关联效度验证
Biomimetics (Basel). 2024 Jul 2;9(7):400. doi: 10.3390/biomimetics9070400.
7
Upper limb modeling and motion extraction based on multi-space-fusion.基于多空间融合的上肢建模与运动提取。
Sci Rep. 2023 Sep 26;13(1):16101. doi: 10.1038/s41598-023-36767-0.
Philos Trans A Math Phys Eng Sci. 2013 Apr 1;371(1990):20120148. doi: 10.1098/rsta.2012.0148. Print 2013 May 13.