• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ASLDetect:使用类似ResNet和U-Net组件的阿拉伯手语检测

ASLDetect: Arabic sign language detection using ResNet and U-Net like component.

作者信息

Alasmari Naif, Asiri Sultan

机构信息

Computer Science Department, Applied College, Muhayil, King Khalid University, Abha, Saudi Arabia.

Center for Artificial Intelligence (CAI), King Khalid University, Abha, 61421, Saudi Arabia.

出版信息

Sci Rep. 2025 May 23;15(1):18012. doi: 10.1038/s41598-025-01588-w.

DOI:10.1038/s41598-025-01588-w
PMID:40410252
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12102344/
Abstract

Sign languages are essential for communication among over 430 million deaf and hard-of-hearing individuals worldwide. However, recognizing Arabic Sign Language (ArSL) in real-world settings remains challenging due to issues like background noise, lighting variations, and hand occlusions. These limitations hinder the effectiveness of existing systems in applications such as assistive technologies and education. To tackle these challenges, we propose ASLDetect, a new model for ArSL recognition that leverages ResNet for feature extraction and a U-Net-based architecture for accurate gesture segmentation. Our method includes preprocessing steps like resizing images to 64 64 pixels, normalization, and selective augmentation to improve robustness in diverse environments. We evaluated ASLDetect on two datasets: ArASL2018, which features plain backgrounds, and ArASL2021, which includes more complex and diverse environments. On ArASL2018, ASLDetect achieved an accuracy of 99.35%, surpassing ResNet34 (99.08%), T-SignSys (97.92%), and UrSL-CNN (0.98%). For ArASL2021, we applied transfer learning from our ArASL2018-trained model, significantly improving performance and reaching 86.84% accuracy-outperforming ResNet34 (82.5%), T-SignSys (58.98%), and UrSL-CNN (0.49%). These results highlight ASLDetect's accuracy, robustness, and adaptability.

摘要

手语对于全球超过4.3亿失聪和听力障碍人士之间的交流至关重要。然而,在现实场景中识别阿拉伯手语(ArSL)仍然具有挑战性,因为存在背景噪音、光照变化和手部遮挡等问题。这些限制阻碍了现有系统在辅助技术和教育等应用中的有效性。为了应对这些挑战,我们提出了ASLDetect,一种用于ArSL识别的新模型,它利用ResNet进行特征提取,并采用基于U-Net的架构进行精确的手势分割。我们的方法包括预处理步骤,如图像调整为64×64像素、归一化和选择性增强,以提高在不同环境中的鲁棒性。我们在两个数据集上评估了ASLDetect:具有纯色背景的ArASL2018和包含更复杂多样环境的ArASL2021。在ArASL2018上,ASLDetect的准确率达到99.35%,超过了ResNet34(99.08%)、T-SignSys(97.92%)和UrSL-CNN(0.98%)。对于ArASL2021,我们应用了从在ArASL2018上训练的模型进行迁移学习,显著提高了性能,准确率达到86.84%,超过了ResNet34(82.5%)、T-SignSys(58.98%)和UrSL-CNN(0.49%)。这些结果突出了ASLDetect的准确性、鲁棒性和适应性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/1a2897fd96e7/41598_2025_1588_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/00bdc50ff24c/41598_2025_1588_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/c33a35821e67/41598_2025_1588_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/82cbe59855e1/41598_2025_1588_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/ef421db37b05/41598_2025_1588_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/eb0bece0e210/41598_2025_1588_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/bfcf9522350c/41598_2025_1588_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/1a2897fd96e7/41598_2025_1588_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/00bdc50ff24c/41598_2025_1588_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/c33a35821e67/41598_2025_1588_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/82cbe59855e1/41598_2025_1588_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/ef421db37b05/41598_2025_1588_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/eb0bece0e210/41598_2025_1588_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/bfcf9522350c/41598_2025_1588_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30a/12102344/1a2897fd96e7/41598_2025_1588_Fig7_HTML.jpg

相似文献

1
ASLDetect: Arabic sign language detection using ResNet and U-Net like component.ASLDetect:使用类似ResNet和U-Net组件的阿拉伯手语检测
Sci Rep. 2025 May 23;15(1):18012. doi: 10.1038/s41598-025-01588-w.
2
Real-Time Arabic Sign Language Recognition Using a Hybrid Deep Learning Model.基于混合深度学习模型的实时阿拉伯手语识别
Sensors (Basel). 2024 Jun 6;24(11):3683. doi: 10.3390/s24113683.
3
Continuous Arabic Sign Language Recognition Models.连续阿拉伯手语识别模型。
Sensors (Basel). 2025 May 5;25(9):2916. doi: 10.3390/s25092916.
4
Human-Computer Interaction with Hand Gesture Recognition Using ResNet and MobileNet.基于 ResNet 和 MobileNet 的手势识别的人机交互
Comput Intell Neurosci. 2022 Mar 26;2022:8777355. doi: 10.1155/2022/8777355. eCollection 2022.
5
Two-Stream Mixed Convolutional Neural Network for American Sign Language Recognition.双流混合卷积神经网络在美手势语识别中的应用。
Sensors (Basel). 2022 Aug 9;22(16):5959. doi: 10.3390/s22165959.
6
IoT-driven smart assistive communication system for the hearing impaired with hybrid deep learning models for sign language recognition.基于物联网驱动的智能辅助通信系统,用于听力障碍者,采用混合深度学习模型进行手语识别。
Sci Rep. 2025 Feb 20;15(1):6192. doi: 10.1038/s41598-025-89975-1.
7
Efhamni: A Deep Learning-Based Saudi Sign Language Recognition Application.埃法赫尼:一种基于深度学习的沙特手语识别应用。
Sensors (Basel). 2024 May 14;24(10):3112. doi: 10.3390/s24103112.
8
Innovative hand pose based sign language recognition using hybrid metaheuristic optimization algorithms with deep learning model for hearing impaired persons.基于创新手部姿势的手语识别:使用混合元启发式优化算法与深度学习模型助力听力障碍者
Sci Rep. 2025 Mar 18;15(1):9320. doi: 10.1038/s41598-025-93559-4.
9
Real-Time Hand Gesture Recognition Using Fine-Tuned Convolutional Neural Network.基于微调卷积神经网络的实时手势识别。
Sensors (Basel). 2022 Jan 18;22(3):706. doi: 10.3390/s22030706.
10
Machine Learning and Deep Learning Approaches for Arabic Sign Language Recognition: A Decade Systematic Literature Review.用于阿拉伯手语识别的机器学习和深度学习方法:十年系统文献综述
Sensors (Basel). 2024 Dec 5;24(23):7798. doi: 10.3390/s24237798.

本文引用的文献

1
Human-Computer Interaction with Hand Gesture Recognition Using ResNet and MobileNet.基于 ResNet 和 MobileNet 的手势识别的人机交互
Comput Intell Neurosci. 2022 Mar 26;2022:8777355. doi: 10.1155/2022/8777355. eCollection 2022.
2
ArASL: Arabic Alphabets Sign Language Dataset.ArASL:阿拉伯字母手语数据集。
Data Brief. 2019 Feb 23;23:103777. doi: 10.1016/j.dib.2019.103777. eCollection 2019 Apr.