• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 Mediapipe 优化的 GRU 模型的印地语手语识别。

An integrated mediapipe-optimized GRU model for Indian sign language recognition.

机构信息

School of Computer Science and Engineering, Kyungpook National University, Buk-gu, Daegu, 41566, South Korea.

Division of Computer Engineering, Hankuk University of Foreign Studies, Seoul, South Korea.

出版信息

Sci Rep. 2022 Jul 13;12(1):11964. doi: 10.1038/s41598-022-15998-7.

DOI:10.1038/s41598-022-15998-7
PMID:35831393
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9279358/
Abstract

Sign language recognition is challenged by problems, such as accurate tracking of hand gestures, occlusion of hands, and high computational cost. Recently, it has benefited from advancements in deep learning techniques. However, these larger complex approaches cannot manage long-term sequential data and they are characterized by poor information processing and learning efficiency in capturing useful information. To overcome these challenges, we propose an integrated MediaPipe-optimized gated recurrent unit (MOPGRU) model for Indian sign language recognition. Specifically, we improved the update gate of the standard GRU cell by multiplying it by the reset gate to discard the redundant information from the past in one screening. By obtaining feedback from the resultant of the reset gate, additional attention is shown to the present input. Additionally, we replace the hyperbolic tangent activation in standard GRUs with exponential linear unit activation and SoftMax with Softsign activation in the output layer of the GRU cell. Thus, our proposed MOPGRU model achieved better prediction accuracy, high learning efficiency, information processing capability, and faster convergence than other sequential models.

摘要

手语识别受到一些问题的挑战,例如对手势的准确跟踪、手部遮挡和计算成本高。最近,它受益于深度学习技术的进步。然而,这些更大更复杂的方法无法处理长期的序列数据,并且在捕获有用信息方面的信息处理和学习效率较差。为了克服这些挑战,我们提出了一种用于印度手语识别的集成 MediaPipe 优化门控循环单元(MOPGRU)模型。具体来说,我们通过将其乘以重置门来改进标准 GRU 单元的更新门,以在一次筛选中丢弃过去的冗余信息。通过从重置门的结果中获得反馈,对当前输入给予额外的关注。此外,我们在 GRU 单元的输出层中用指数线性单元激活替换标准 GRU 中的双曲正切激活,用 Softsign 激活替换 SoftMax。因此,与其他序列模型相比,我们提出的 MOPGRU 模型实现了更好的预测精度、高效率的学习、信息处理能力和更快的收敛速度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/ab55fb1d7efe/41598_2022_15998_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/80a288fe66c7/41598_2022_15998_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/a10224e3d9a6/41598_2022_15998_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/d388492e75fb/41598_2022_15998_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/a59824d4b10c/41598_2022_15998_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/aa9750bcd588/41598_2022_15998_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/5cf0d6c87650/41598_2022_15998_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/521d491ec96e/41598_2022_15998_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/ab55fb1d7efe/41598_2022_15998_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/80a288fe66c7/41598_2022_15998_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/a10224e3d9a6/41598_2022_15998_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/d388492e75fb/41598_2022_15998_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/a59824d4b10c/41598_2022_15998_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/aa9750bcd588/41598_2022_15998_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/5cf0d6c87650/41598_2022_15998_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/521d491ec96e/41598_2022_15998_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca51/9279358/ab55fb1d7efe/41598_2022_15998_Fig8_HTML.jpg

相似文献

1
An integrated mediapipe-optimized GRU model for Indian sign language recognition.基于 Mediapipe 优化的 GRU 模型的印地语手语识别。
Sci Rep. 2022 Jul 13;12(1):11964. doi: 10.1038/s41598-022-15998-7.
2
Light-Weight Deep Learning Techniques with Advanced Processing for Real-Time Hand Gesture Recognition.轻量级深度学习技术与高级处理的实时手势识别。
Sensors (Basel). 2022 Dec 20;23(1):2. doi: 10.3390/s23010002.
3
Sign Language Recognition Method Based on Palm Definition Model and Multiple Classification.基于手掌定义模型和多分类的手语识别方法。
Sensors (Basel). 2022 Sep 1;22(17):6621. doi: 10.3390/s22176621.
4
Dynamic Japanese Sign Language Recognition Throw Hand Pose Estimation Using Effective Feature Extraction and Classification Approach.基于有效特征提取和分类方法的动态日本手语识别投手姿势估计
Sensors (Basel). 2024 Jan 26;24(3):826. doi: 10.3390/s24030826.
5
American Sign Language Words Recognition of Skeletal Videos Using Processed Video Driven Multi-Stacked Deep LSTM.基于处理视频驱动的多层堆叠深度 LSTM 的骨骼视频的美国手语词识别。
Sensors (Basel). 2022 Feb 11;22(4):1406. doi: 10.3390/s22041406.
6
Hypertuned Deep Convolutional Neural Network for Sign Language Recognition.超调深度卷积神经网络的手语识别。
Comput Intell Neurosci. 2022 Apr 30;2022:1450822. doi: 10.1155/2022/1450822. eCollection 2022.
7
Privacy-Preserving British Sign Language Recognition Using Deep Learning.基于深度学习的隐私保护的英国手语识别。
Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:4316-4319. doi: 10.1109/EMBC48229.2022.9871491.
8
Bioinspired Photoreceptors with Neural Network for Recognition and Classification of Sign Language Gesture.基于神经网络的仿生光感受器,用于识别和分类手语手势。
Sensors (Basel). 2023 Dec 6;23(24):9646. doi: 10.3390/s23249646.
9
Sensor Fusion of Motion-Based Sign Language Interpretation with Deep Learning.基于运动的手语解释的传感器融合与深度学习。
Sensors (Basel). 2020 Nov 2;20(21):6256. doi: 10.3390/s20216256.
10
Hand gestures for emergency situations: A video dataset based on words from Indian sign language.紧急情况下的手势:基于印度手语词汇的视频数据集
Data Brief. 2020 Jul 11;31:106016. doi: 10.1016/j.dib.2020.106016. eCollection 2020 Aug.

引用本文的文献

1
Low-cost computation for isolated sign language video recognition with multiple reservoir computing.用于基于多水库计算的孤立手语视频识别的低成本计算
PLoS One. 2025 Jul 30;20(7):e0322717. doi: 10.1371/journal.pone.0322717. eCollection 2025.
2
An artificial intelligence powered study of enlarged facial pore prevalence on one million Chinese from different age groups and its correlation with environmental factors.一项针对中国不同年龄段的一百万人面部毛孔粗大流行情况及其与环境因素相关性的人工智能研究。
Skin Res Technol. 2024 Sep;30(9):e70025. doi: 10.1111/srt.70025.
3
The forecasting of surface displacement for tunnel slopes utilizing the WD-IPSO-GRU model.
利用WD-IPSO-GRU模型预测隧道边坡的地表位移
Sci Rep. 2024 Sep 5;14(1):20717. doi: 10.1038/s41598-024-71742-3.
4
AI and augmented reality for 3D Indian dance pose reconstruction cultural revival.人工智能和增强现实技术用于 3D 印度舞蹈姿势重建与文化复兴。
Sci Rep. 2024 Apr 4;14(1):7906. doi: 10.1038/s41598-024-58680-w.
5
Artificial intelligence analysis of over a million Chinese men and women reveals level of dark circle in the facial skin aging process.对超过 100 万中国男女的人工智能分析揭示了面部皮肤衰老过程中黑眼圈的程度。
Skin Res Technol. 2023 Nov;29(11):e13492. doi: 10.1111/srt.13492.
6
Domain Adaptation with Contrastive Simultaneous Multi-Loss Training for Hand Gesture Recognition.基于对比同时多损失训练的域自适应的手势识别。
Sensors (Basel). 2023 Mar 22;23(6):3332. doi: 10.3390/s23063332.