• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

双语手语识别:一种基于YOLOv11的孟加拉语和英语字母模型。

Bilingual Sign Language Recognition: A YOLOv11-Based Model for Bangla and English Alphabets.

作者信息

Navin Nawshin, Farid Fahmid Al, Rakin Raiyen Z, Tanzim Sadman S, Rahman Mashrur, Rahman Shakila, Uddin Jia, Karim Hezerul Abdul

机构信息

Department of Computer Science, American International University-Bangladesh, Dhaka 1229, Bangladesh.

Centre for Image and Vision Computing (CIVC), COE for Artificial Intelligence, Faculty of Artificial Intelligence and Engineering (FAIE), Multimedia University, Cyberjaya 63100, Selangor, Malaysia.

出版信息

J Imaging. 2025 Apr 27;11(5):134. doi: 10.3390/jimaging11050134.

DOI:10.3390/jimaging11050134
PMID:40422991
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12112066/
Abstract

Communication through sign language effectively helps both hearing- and speaking-impaired individuals connect. However, there are problems with the interlingual communication between Bangla Sign Language (BdSL) and English Sign Language (ASL) due to the absence of a unified system. This study aims to introduce a detection system that incorporates these two sign languages to enhance the flow of communication for those who use these forms of sign language. This study developed and tested a deep learning-based sign-language detection system that can recognize both BdSL and ASL alphabets concurrently in real time. The approach uses a YOLOv11 object detection architecture that has been trained with an open-source dataset on a set of 9556 images containing 64 different letter signs from both languages. Data preprocessing was applied to enhance the performance of the model. Evaluation criteria, including the precision, recall, mAP, and other parameter values were also computed to evaluate the model. The performance analysis of the proposed method shows a precision of 99.12% and average recall rates of 99.63% in 30 epochs. The studies show that the proposed model outperforms the current techniques in sign language recognition (SLR) and can be used in communicating assistive technologies and human-computer interaction systems.

摘要

通过手语进行交流有效地帮助了听力和语言有障碍的人建立联系。然而,由于缺乏统一的系统,孟加拉手语(BdSL)和美国手语(ASL)之间的跨语言交流存在问题。本研究旨在引入一个整合这两种手语的检测系统,以促进使用这些手语形式的人的交流顺畅。本研究开发并测试了一种基于深度学习的手语检测系统,该系统能够实时同时识别BdSL和ASL字母。该方法使用YOLOv11目标检测架构,该架构已在一个开源数据集上进行训练,该数据集包含一组9556张图像,其中有来自两种语言的64种不同字母手势。应用了数据预处理以提高模型的性能。还计算了包括精度、召回率、平均精度均值(mAP)和其他参数值在内的评估标准来评估模型。所提方法的性能分析表明,在30个轮次中精度为99.12%,平均召回率为99.63%。研究表明,所提模型在手语识别(SLR)方面优于当前技术,可用于通信辅助技术和人机交互系统。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/b6986cfeb521/jimaging-11-00134-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/b12418fbf5c4/jimaging-11-00134-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/3eda60305f8e/jimaging-11-00134-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/39cdea91f337/jimaging-11-00134-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/5f29b9549dda/jimaging-11-00134-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/7407b0095689/jimaging-11-00134-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/1d606617084f/jimaging-11-00134-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/e1f255b94aee/jimaging-11-00134-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/3adc912f0fb7/jimaging-11-00134-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/8faa9a6752da/jimaging-11-00134-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/54997c9a294f/jimaging-11-00134-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/1e58aef5d53d/jimaging-11-00134-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/213b89d58945/jimaging-11-00134-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/97a8fecb5cbb/jimaging-11-00134-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/563ce1f21bf9/jimaging-11-00134-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/b6986cfeb521/jimaging-11-00134-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/b12418fbf5c4/jimaging-11-00134-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/3eda60305f8e/jimaging-11-00134-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/39cdea91f337/jimaging-11-00134-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/5f29b9549dda/jimaging-11-00134-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/7407b0095689/jimaging-11-00134-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/1d606617084f/jimaging-11-00134-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/e1f255b94aee/jimaging-11-00134-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/3adc912f0fb7/jimaging-11-00134-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/8faa9a6752da/jimaging-11-00134-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/54997c9a294f/jimaging-11-00134-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/1e58aef5d53d/jimaging-11-00134-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/213b89d58945/jimaging-11-00134-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/97a8fecb5cbb/jimaging-11-00134-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/563ce1f21bf9/jimaging-11-00134-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/66c5/12112066/b6986cfeb521/jimaging-11-00134-g015.jpg

相似文献

1
Bilingual Sign Language Recognition: A YOLOv11-Based Model for Bangla and English Alphabets.双语手语识别:一种基于YOLOv11的孟加拉语和英语字母模型。
J Imaging. 2025 Apr 27;11(5):134. doi: 10.3390/jimaging11050134.
2
Bangla Sign Language (BdSL) Alphabets and Numerals Classification Using a Deep Learning Model.使用深度学习模型对孟加拉手语(BdSL)字母和数字进行分类。
Sensors (Basel). 2022 Jan 12;22(2):574. doi: 10.3390/s22020574.
3
BDSL 49: A comprehensive dataset of Bangla sign language.孟加拉语手语49:孟加拉语手语综合数据集。
Data Brief. 2023 Jun 18;49:109329. doi: 10.1016/j.dib.2023.109329. eCollection 2023 Aug.
4
BdSL47: A complete depth-based Bangla sign alphabet and digit dataset.BdSL47:一个完整的基于深度的孟加拉语手语字母和数字数据集。
Data Brief. 2023 Nov 11;51:109799. doi: 10.1016/j.dib.2023.109799. eCollection 2023 Dec.
5
Atom Search Optimization with Deep Learning Enabled Arabic Sign Language Recognition for Speaking and Hearing Disability Persons.基于深度学习的原子搜索优化算法用于聋哑人士阿拉伯语手语识别
Healthcare (Basel). 2022 Aug 24;10(9):1606. doi: 10.3390/healthcare10091606.
6
KU-BdSL: An open dataset for Bengali sign language recognition.KU-BdSL:一个用于孟加拉语手语识别的开放数据集。
Data Brief. 2023 Nov 11;51:109797. doi: 10.1016/j.dib.2023.109797. eCollection 2023 Dec.
7
BdSLW-11: Dataset of Bangladeshi sign language words for recognizing 11 daily useful BdSL words.BdSLW - 11:用于识别11个日常有用的孟加拉国手语单词的孟加拉国手语单词数据集。
Data Brief. 2022 Nov 13;45:108747. doi: 10.1016/j.dib.2022.108747. eCollection 2022 Dec.
8
Real-Time American Sign Language Interpretation Using Deep Learning and Keypoint Tracking.使用深度学习和关键点跟踪的实时美国手语翻译
Sensors (Basel). 2025 Mar 28;25(7):2138. doi: 10.3390/s25072138.
9
IoT-driven smart assistive communication system for the hearing impaired with hybrid deep learning models for sign language recognition.基于物联网驱动的智能辅助通信系统,用于听力障碍者,采用混合深度学习模型进行手语识别。
Sci Rep. 2025 Feb 20;15(1):6192. doi: 10.1038/s41598-025-89975-1.
10
Development of Sign Language Motion Recognition System for Hearing-Impaired People Using Electromyography Signal.基于肌电信号的听障人士手语运动识别系统的开发。
Sensors (Basel). 2020 Oct 14;20(20):5807. doi: 10.3390/s20205807.

本文引用的文献

1
A Static Sign Language Recognition Method Enhanced with Self-Attention Mechanisms.基于自注意力机制的静态手语识别方法。
Sensors (Basel). 2024 Oct 29;24(21):6921. doi: 10.3390/s24216921.
2
BdSL47: A complete depth-based Bangla sign alphabet and digit dataset.BdSL47:一个完整的基于深度的孟加拉语手语字母和数字数据集。
Data Brief. 2023 Nov 11;51:109799. doi: 10.1016/j.dib.2023.109799. eCollection 2023 Dec.
3
Real-Time Obstacle Detection with YOLOv8 in a WSN Using UAV Aerial Photography.基于无人机航空摄影的无线传感器网络中使用YOLOv8进行实时障碍物检测
J Imaging. 2023 Oct 10;9(10):216. doi: 10.3390/jimaging9100216.
4
American Sign Language Alphabet Recognition by Extracting Feature from Hand Pose Estimation.从手姿态估计中提取特征实现美国手语字母识别。
Sensors (Basel). 2021 Aug 31;21(17):5856. doi: 10.3390/s21175856.
5
Text Data Augmentation for Deep Learning.用于深度学习的文本数据增强
J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.
6
Global, regional, and national incidence, prevalence, and years lived with disability for 310 diseases and injuries, 1990-2015: a systematic analysis for the Global Burden of Disease Study 2015.1990 - 2015年全球、区域和国家310种疾病和损伤的发病率、患病率及伤残调整生命年:全球疾病负担研究2015的系统分析
Lancet. 2016 Oct 8;388(10053):1545-1602. doi: 10.1016/S0140-6736(16)31678-6.