• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

轻量级深度学习技术与高级处理的实时手势识别。

Light-Weight Deep Learning Techniques with Advanced Processing for Real-Time Hand Gesture Recognition.

机构信息

Department of Computer Engineering, Gachon University, Seongnam 1342, Republic of Korea.

Informatics Department, Electronics Research Institute (ERI), Cairo 11843, Egypt.

出版信息

Sensors (Basel). 2022 Dec 20;23(1):2. doi: 10.3390/s23010002.

DOI:10.3390/s23010002
PMID:36616601
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9823561/
Abstract

In the discipline of hand gesture and dynamic sign language recognition, deep learning approaches with high computational complexity and a wide range of parameters have been an extremely remarkable success. However, the implementation of sign language recognition applications for mobile phones with restricted storage and computing capacities is usually greatly constrained by those limited resources. In light of this situation, we suggest lightweight deep neural networks with advanced processing for real-time dynamic sign language recognition (DSLR). This paper presents a DSLR application to minimize the gap between hearing-impaired communities and regular society. The DSLR application was developed using two robust deep learning models, the GRU and the 1D CNN, combined with the MediaPipe framework. In this paper, the authors implement advanced processes to solve most of the DSLR problems, especially in real-time detection, e.g., differences in depth and location. The solution method consists of three main parts. First, the input dataset is preprocessed with our algorithm to standardize the number of frames. Then, the MediaPipe framework extracts hands and poses landmarks (features) to detect and locate them. Finally, the features of the models are passed after processing the unification of the depth and location of the body to recognize the DSL accurately. To accomplish this, the authors built a new American video-based sign dataset and named it DSL-46. DSL-46 contains 46 daily used signs that were presented with all the needed details and properties for recording the new dataset. The results of the experiments show that the presented solution method can recognize dynamic signs extremely fast and accurately, even in real-time detection. The DSLR reaches an accuracy of 98.8%, 99.84%, and 88.40% on the DSL-46, LSA64, and LIBRAS-BSL datasets, respectively.

摘要

在手势和动态手语识别领域,具有高计算复杂度和广泛参数的深度学习方法取得了非常显著的成功。然而,对于存储和计算能力有限的手机,手语识别应用的实现通常受到这些有限资源的极大限制。针对这种情况,我们提出了一种用于实时动态手语识别(DSLR)的轻量级深度学习网络,该网络具有先进的处理能力。本文提出了一种 DSLR 应用,旨在缩小听障人士和普通社会之间的差距。该 DSLR 应用是使用两个强大的深度学习模型(GRU 和 1D CNN)与 MediaPipe 框架结合开发的。在本文中,作者实现了先进的处理过程,以解决大多数 DSLR 问题,特别是在实时检测方面,例如深度和位置的差异。该解决方案方法包括三个主要部分。首先,使用我们的算法预处理输入数据集,以标准化帧数。然后,MediaPipe 框架提取手部和姿势地标(特征)来检测和定位它们。最后,将模型的特征传递后,对深度和位置的统一进行处理,以准确识别 DSL。为此,作者构建了一个新的基于美国视频的手语数据集,并将其命名为 DSL-46。DSL-46 包含 46 个日常使用的手语,这些手语具有记录新数据集所需的所有详细信息和属性。实验结果表明,所提出的解决方案方法可以非常快速和准确地识别动态手语,即使是在实时检测中。在 DSL-46、LSA64 和 LIBRAS-BSL 数据集上,DSLR 的准确率分别达到 98.8%、99.84%和 88.40%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/a2a0765599e3/sensors-23-00002-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/c90575514fe9/sensors-23-00002-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/31d7973cf39d/sensors-23-00002-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/b569ec2b7f07/sensors-23-00002-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/05784aceb2d5/sensors-23-00002-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/cbbc5171140a/sensors-23-00002-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/883afad7c814/sensors-23-00002-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/c9ccd23b6307/sensors-23-00002-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/5292d0559819/sensors-23-00002-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/7258ffb1d339/sensors-23-00002-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/84484347f313/sensors-23-00002-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/27fe170fd52f/sensors-23-00002-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/a2a0765599e3/sensors-23-00002-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/c90575514fe9/sensors-23-00002-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/31d7973cf39d/sensors-23-00002-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/b569ec2b7f07/sensors-23-00002-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/05784aceb2d5/sensors-23-00002-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/cbbc5171140a/sensors-23-00002-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/883afad7c814/sensors-23-00002-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/c9ccd23b6307/sensors-23-00002-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/5292d0559819/sensors-23-00002-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/7258ffb1d339/sensors-23-00002-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/84484347f313/sensors-23-00002-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/27fe170fd52f/sensors-23-00002-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/575c/9823561/a2a0765599e3/sensors-23-00002-g011.jpg

相似文献

1
Light-Weight Deep Learning Techniques with Advanced Processing for Real-Time Hand Gesture Recognition.轻量级深度学习技术与高级处理的实时手势识别。
Sensors (Basel). 2022 Dec 20;23(1):2. doi: 10.3390/s23010002.
2
A Deep Learning-Based End-to-End Composite System for Hand Detection and Gesture Recognition.基于深度学习的手检测与手势识别端到端复合系统。
Sensors (Basel). 2019 Nov 30;19(23):5282. doi: 10.3390/s19235282.
3
Exploiting domain transformation and deep learning for hand gesture recognition using a low-cost dataglove.利用领域变换和深度学习,使用低成本数据手套进行手势识别。
Sci Rep. 2022 Dec 12;12(1):21446. doi: 10.1038/s41598-022-25108-2.
4
Dynamic Japanese Sign Language Recognition Throw Hand Pose Estimation Using Effective Feature Extraction and Classification Approach.基于有效特征提取和分类方法的动态日本手语识别投手姿势估计
Sensors (Basel). 2024 Jan 26;24(3):826. doi: 10.3390/s24030826.
5
An sEMG-Controlled 3D Game for Rehabilitation Therapies: Real-Time Time Hand Gesture Recognition Using Deep Learning Techniques.基于肌电信号控制的三维游戏用于康复治疗:使用深度学习技术的实时手部手势识别。
Sensors (Basel). 2020 Nov 12;20(22):6451. doi: 10.3390/s20226451.
6
Real-Time Hand Gesture Recognition Using Fine-Tuned Convolutional Neural Network.基于微调卷积神经网络的实时手势识别。
Sensors (Basel). 2022 Jan 18;22(3):706. doi: 10.3390/s22030706.
7
Real-Time Arabic Sign Language Recognition Using a Hybrid Deep Learning Model.基于混合深度学习模型的实时阿拉伯手语识别
Sensors (Basel). 2024 Jun 6;24(11):3683. doi: 10.3390/s24113683.
8
Sensor Fusion of Motion-Based Sign Language Interpretation with Deep Learning.基于运动的手语解释的传感器融合与深度学习。
Sensors (Basel). 2020 Nov 2;20(21):6256. doi: 10.3390/s20216256.
9
Sign Language Recognition Method Based on Palm Definition Model and Multiple Classification.基于手掌定义模型和多分类的手语识别方法。
Sensors (Basel). 2022 Sep 1;22(17):6621. doi: 10.3390/s22176621.
10
Dynamic Hand Gesture Recognition Based on a Leap Motion Controller and Two-Layer Bidirectional Recurrent Neural Network.基于 Leap Motion 控制器和两层双向递归神经网络的动态手势识别。
Sensors (Basel). 2020 Apr 8;20(7):2106. doi: 10.3390/s20072106.

引用本文的文献

1
Static and Dynamic Hand Gestures: A Review of Techniques of Virtual Reality Manipulation.静态和动态手势:虚拟现实操作技术综述。
Sensors (Basel). 2024 Jun 9;24(12):3760. doi: 10.3390/s24123760.

本文引用的文献

1
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions.深度学习综述:概念、卷积神经网络架构、挑战、应用及未来方向。
J Big Data. 2021;8(1):53. doi: 10.1186/s40537-021-00444-8. Epub 2021 Mar 31.
2
Convolutional neural network models for cancer type prediction based on gene expression.基于基因表达的癌症类型预测卷积神经网络模型。
BMC Med Genomics. 2020 Apr 3;13(Suppl 5):44. doi: 10.1186/s12920-020-0677-2.
3
Chinese Sign Language Recognition Based on an Optimized Tree-Structure Framework.基于优化树结构框架的中国手语识别
IEEE J Biomed Health Inform. 2017 Jul;21(4):994-1004. doi: 10.1109/JBHI.2016.2560907. Epub 2016 May 3.