• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于创新手部姿势的手语识别:使用混合元启发式优化算法与深度学习模型助力听力障碍者

Innovative hand pose based sign language recognition using hybrid metaheuristic optimization algorithms with deep learning model for hearing impaired persons.

作者信息

Alabduallah Bayan, Al Dayil Reham, Alkharashi Abdulwhab, Alneil Amani A

机构信息

Department of Information Systems, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, 11671, Riyadh, Saudi Arabia.

Applied College, Imam Mohammad Ibn Saud Islamic University (IMSIU), 11432, Riyadh, Saudi Arabia.

出版信息

Sci Rep. 2025 Mar 18;15(1):9320. doi: 10.1038/s41598-025-93559-4.

DOI:10.1038/s41598-025-93559-4
PMID:40102499
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11920037/
Abstract

Sign language (SL) is an effective mode of communication, which uses visual-physical methods like hand signals, expressions, and body actions to communicate between the difficulty of hearing and the deaf community, produce opinions, and carry significant conversations. SL recognition (SLR), the procedure of automatically identifying and construing gestures of SL, has gotten considerable attention recently owing to its latent link to the lack of communication between the deaf and the hearing world. Hand gesture detection is its domain, in which computer vision (CV) and artificial intelligence (AI) help deliver non-verbal communication between computers and humans by classifying the significant movements of the human hands. The emergence and constant growth of DL approaches have delivered motivation and momentum for evolving SLR. Therefore, this manuscript presents an Innovative Sign Language Recognition using Hand Pose with Hybrid Metaheuristic Optimization Algorithms in Deep Learning (ISLRHP-HMOADL) technique for Hearing-Impaired Persons. The main objective of the ISLRHP-HMOADL technique focused on hand pose recognition to improve the efficiency and accuracy of sign interpretation for hearing-impaired persons. Initially, the ISLRHP-HMOADL model performs image pre-processing using a wiener filter (WF) to enhance image quality by reducing noise. Furthermore, the fusion of three models, ResNeXt101, VGG19, and vision transformer (ViT), is employed for feature extraction to capture diverse and intricate spatial and contextual details from the images. The bidirectional gated recurrent unit (BiGRU) classifier is implemented for hand pose recognition. To further optimize the performance of the model, the ISLRHP-HMOADL model implements the hybrid crow search-improved grey wolf optimization (CS-IGWO) model for parameter tuning, achieving a finely-tuned configuration that enhances classification accuracy and robustness. A comprehensive experimental study is accomplished under the ASL alphabet dataset to exhibit the improved performance of the ISLRHP-HMOADL model. The comparative results of the ISLRHP-HMOADL model illustrated a superior accuracy value of 99.57% over existing techniques.

摘要

手语(SL)是一种有效的交流方式,它使用诸如手势、表情和身体动作等视觉物理方法,在听力障碍者和聋人社区之间进行交流、表达观点并进行重要对话。手语识别(SLR)是自动识别和解释手语手势的过程,由于其与聋人和听力世界之间沟通障碍的潜在联系,最近受到了相当大的关注。手势检测是其领域,其中计算机视觉(CV)和人工智能(AI)通过对人类手部的重要动作进行分类,帮助实现计算机与人类之间的非语言交流。深度学习(DL)方法的出现和不断发展为手语识别的发展提供了动力和契机。因此,本文提出了一种用于听力障碍者的深度学习中基于手部姿态与混合元启发式优化算法的创新手语识别(ISLRHP-HMOADL)技术。ISLRHP-HMOADL技术的主要目标集中在手部姿态识别上,以提高听力障碍者手语解释的效率和准确性。最初,ISLRHP-HMOADL模型使用维纳滤波器(WF)进行图像预处理,通过减少噪声来提高图像质量。此外,采用ResNeXt101、VGG19和视觉Transformer(ViT)这三种模型的融合进行特征提取,以从图像中捕获多样且复杂的空间和上下文细节。双向门控循环单元(BiGRU)分类器用于手部姿态识别。为了进一步优化模型性能,ISLRHP-HMOADL模型采用混合乌鸦搜索-改进灰狼优化(CS-IGWO)模型进行参数调整,实现精细调整的配置,提高分类准确性和鲁棒性。在美式手语字母数据集下完成了全面的实验研究,以展示ISLRHP-HMOADL模型的改进性能。ISLRHP-HMOADL模型的比较结果表明,与现有技术相比,其准确率高达99.57%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/1e6696483c15/41598_2025_93559_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/82ff4abd7f07/41598_2025_93559_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/2fe6d98b19e5/41598_2025_93559_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/f00d5bc9dc81/41598_2025_93559_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/211bd4ccb86f/41598_2025_93559_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/a3d5450e1902/41598_2025_93559_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/65bba2c97c36/41598_2025_93559_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/50ca58aa4b7f/41598_2025_93559_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/702b46439278/41598_2025_93559_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/33c3e7fcae86/41598_2025_93559_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/b05497b6e1ec/41598_2025_93559_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/cf2adbcd4f5c/41598_2025_93559_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/f87d5740ccb6/41598_2025_93559_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/6709dd78f994/41598_2025_93559_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/66273e874186/41598_2025_93559_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/223651761493/41598_2025_93559_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/73087375153e/41598_2025_93559_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/2212b4fa9d6d/41598_2025_93559_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/1e6696483c15/41598_2025_93559_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/82ff4abd7f07/41598_2025_93559_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/2fe6d98b19e5/41598_2025_93559_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/f00d5bc9dc81/41598_2025_93559_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/211bd4ccb86f/41598_2025_93559_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/a3d5450e1902/41598_2025_93559_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/65bba2c97c36/41598_2025_93559_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/50ca58aa4b7f/41598_2025_93559_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/702b46439278/41598_2025_93559_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/33c3e7fcae86/41598_2025_93559_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/b05497b6e1ec/41598_2025_93559_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/cf2adbcd4f5c/41598_2025_93559_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/f87d5740ccb6/41598_2025_93559_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/6709dd78f994/41598_2025_93559_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/66273e874186/41598_2025_93559_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/223651761493/41598_2025_93559_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/73087375153e/41598_2025_93559_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/2212b4fa9d6d/41598_2025_93559_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fe4/11920037/1e6696483c15/41598_2025_93559_Fig17_HTML.jpg

相似文献

1
Innovative hand pose based sign language recognition using hybrid metaheuristic optimization algorithms with deep learning model for hearing impaired persons.基于创新手部姿势的手语识别:使用混合元启发式优化算法与深度学习模型助力听力障碍者
Sci Rep. 2025 Mar 18;15(1):9320. doi: 10.1038/s41598-025-93559-4.
2
IoT-driven smart assistive communication system for the hearing impaired with hybrid deep learning models for sign language recognition.基于物联网驱动的智能辅助通信系统,用于听力障碍者,采用混合深度学习模型进行手语识别。
Sci Rep. 2025 Feb 20;15(1):6192. doi: 10.1038/s41598-025-89975-1.
3
Atom Search Optimization with Deep Learning Enabled Arabic Sign Language Recognition for Speaking and Hearing Disability Persons.基于深度学习的原子搜索优化算法用于聋哑人士阿拉伯语手语识别
Healthcare (Basel). 2022 Aug 24;10(9):1606. doi: 10.3390/healthcare10091606.
4
Dynamic Japanese Sign Language Recognition Throw Hand Pose Estimation Using Effective Feature Extraction and Classification Approach.基于有效特征提取和分类方法的动态日本手语识别投手姿势估计
Sensors (Basel). 2024 Jan 26;24(3):826. doi: 10.3390/s24030826.
5
Automated sign language detection and classification using reptile search algorithm with hybrid deep learning.使用带有混合深度学习的爬虫搜索算法进行自动手语检测与分类
Heliyon. 2023 Dec 8;10(1):e23252. doi: 10.1016/j.heliyon.2023.e23252. eCollection 2024 Jan 15.
6
Real-Time Arabic Sign Language Recognition Using a Hybrid Deep Learning Model.基于混合深度学习模型的实时阿拉伯手语识别
Sensors (Basel). 2024 Jun 6;24(11):3683. doi: 10.3390/s24113683.
7
Real-Time American Sign Language Interpretation Using Deep Learning and Keypoint Tracking.使用深度学习和关键点跟踪的实时美国手语翻译
Sensors (Basel). 2025 Mar 28;25(7):2138. doi: 10.3390/s25072138.
8
Sensor Fusion of Motion-Based Sign Language Interpretation with Deep Learning.基于运动的手语解释的传感器融合与深度学习。
Sensors (Basel). 2020 Nov 2;20(21):6256. doi: 10.3390/s20216256.
9
Efhamni: A Deep Learning-Based Saudi Sign Language Recognition Application.埃法赫尼:一种基于深度学习的沙特手语识别应用。
Sensors (Basel). 2024 May 14;24(10):3112. doi: 10.3390/s24103112.
10
HGR-ViT: Hand Gesture Recognition with Vision Transformer.HGR-ViT:基于视觉Transformer 的手势识别
Sensors (Basel). 2023 Jun 14;23(12):5555. doi: 10.3390/s23125555.

本文引用的文献

1
An integrated binary metaheuristic approach in dynamic unit commitment and economic emission dispatch for hybrid energy systems.一种用于混合能源系统动态机组组合与经济排放调度的集成二元元启发式方法。
Sci Rep. 2024 Oct 14;14(1):23964. doi: 10.1038/s41598-024-75743-0.
2
CHD7 variants associated with hearing loss and enlargement of the vestibular aqueduct.与听力损失和前庭导水管扩大相关的 CHD7 变异体。
Hum Genet. 2023 Oct;142(10):1499-1517. doi: 10.1007/s00439-023-02581-x. Epub 2023 Sep 5.
3
Improved ultrasound image quality with pixel-based beamforming using a Wiener-filter and a SNR-dependent coherence factor.
基于维纳滤波和 SNR 相关相干因子的像素波束形成可提高超声图像质量。
Ultrasonics. 2022 Feb;119:106594. doi: 10.1016/j.ultras.2021.106594. Epub 2021 Sep 25.
4
American Sign Language Alphabet Recognition by Extracting Feature from Hand Pose Estimation.从手姿态估计中提取特征实现美国手语字母识别。
Sensors (Basel). 2021 Aug 31;21(17):5856. doi: 10.3390/s21175856.