• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BdSL47:一个完整的基于深度的孟加拉语手语字母和数字数据集。

BdSL47: A complete depth-based Bangla sign alphabet and digit dataset.

作者信息

Rayeed S M, Tuba Sidratul Tamzida, Mahmud Hasan, Mazumder Mumtahin Habib Ullah, Mukta Saddam Hossain, Hasan Kamrul

机构信息

Systems and Software Lab (SSL), Department of Computer Science and Engineering (CSE), Islamic University of Technology (IUT), Board Bazar, Gazipur 1704, Bangladesh.

Department of Computer Science and Engineering (CSE), United International University (UIU), United City, Madani Avenue, Dhaka1212, Bangladesh.

出版信息

Data Brief. 2023 Nov 11;51:109799. doi: 10.1016/j.dib.2023.109799. eCollection 2023 Dec.

DOI:10.1016/j.dib.2023.109799
PMID:38075615
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10700367/
Abstract

Sign Language Recognition (SLR) is crucial for enabling communication between the deaf-mute and hearing communities. Nevertheless, the development of a comprehensive sign language dataset is a challenging task due to the complexity and variations in hand gestures. This challenge is particularly evident in the case of Bangla Sign Language (BdSL), where the limited availability of depth datasets impedes accurate recognition. To address this issue, we propose BdSL47, an open-access depth dataset for 47 one-handed static signs (10 digits, from ০ to ৯; and 37 letters, from অ to ँ) of BdSL. The dataset was created using the MediaPipe framework for extracting depth information. To classify the signs, we developed an Artificial Neural Network (ANN) model with a 63-node input layer, a 47-node output layer, and 4 hidden layers that included dropout in the last two hidden layers, an Adam optimizer, and a ReLU activation function. Based on the selected hyperparameters, the proposed ANN model effectively learns the spatial relationships and patterns from the depth-based gestural input features and gives an F1 score of 97.84 %, indicating the effectiveness of the approach compared to the baselines provided. The availability of BdSL47 as a comprehensive dataset can have an impact on improving the accuracy of SLR for BdSL using more advanced deep-learning models.

摘要

手语识别(SLR)对于实现聋哑人与听力正常人群之间的交流至关重要。然而,由于手势的复杂性和多样性,开发一个全面的手语数据集是一项具有挑战性的任务。在孟加拉手语(BdSL)的情况下,这一挑战尤为明显,深度数据集的有限可用性阻碍了准确识别。为了解决这个问题,我们提出了BdSL47,这是一个用于BdSL的47种单手静态手势(10个数字,从০到৯;以及37个字母,从অ到ঁ)的开放获取深度数据集。该数据集是使用MediaPipe框架创建的,用于提取深度信息。为了对手势进行分类,我们开发了一个人工神经网络(ANN)模型,该模型具有一个63节点的输入层、一个47节点的输出层和4个隐藏层,其中最后两个隐藏层包含随机失活,使用Adam优化器和ReLU激活函数。基于选定的超参数,所提出的ANN模型有效地从基于深度的手势输入特征中学习空间关系和模式,并给出了97.84%的F1分数,表明与提供的基线相比该方法的有效性。BdSL47作为一个全面数据集的可用性可能会对使用更先进的深度学习模型提高BdSL的SLR准确性产生影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/5723301498f4/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/7ee0ed32f8f4/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/cc1212ca124a/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/ee94e70b8209/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/4ee18deec6ff/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/cb5aca29fe44/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/7fe13036d881/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/2aed3e9aee8b/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/13018396b0f2/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/5723301498f4/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/7ee0ed32f8f4/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/cc1212ca124a/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/ee94e70b8209/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/4ee18deec6ff/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/cb5aca29fe44/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/7fe13036d881/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/2aed3e9aee8b/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/13018396b0f2/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/10700367/5723301498f4/gr9.jpg

相似文献

1
BdSL47: A complete depth-based Bangla sign alphabet and digit dataset.BdSL47:一个完整的基于深度的孟加拉语手语字母和数字数据集。
Data Brief. 2023 Nov 11;51:109799. doi: 10.1016/j.dib.2023.109799. eCollection 2023 Dec.
2
Bangla Sign Language (BdSL) Alphabets and Numerals Classification Using a Deep Learning Model.使用深度学习模型对孟加拉手语(BdSL)字母和数字进行分类。
Sensors (Basel). 2022 Jan 12;22(2):574. doi: 10.3390/s22020574.
3
BDSL 49: A comprehensive dataset of Bangla sign language.孟加拉语手语49:孟加拉语手语综合数据集。
Data Brief. 2023 Jun 18;49:109329. doi: 10.1016/j.dib.2023.109329. eCollection 2023 Aug.
4
KU-BdSL: An open dataset for Bengali sign language recognition.KU-BdSL:一个用于孟加拉语手语识别的开放数据集。
Data Brief. 2023 Nov 11;51:109797. doi: 10.1016/j.dib.2023.109797. eCollection 2023 Dec.
5
BdSLW-11: Dataset of Bangladeshi sign language words for recognizing 11 daily useful BdSL words.BdSLW - 11:用于识别11个日常有用的孟加拉国手语单词的孟加拉国手语单词数据集。
Data Brief. 2022 Nov 13;45:108747. doi: 10.1016/j.dib.2022.108747. eCollection 2022 Dec.
6
Light-Weight Deep Learning Techniques with Advanced Processing for Real-Time Hand Gesture Recognition.轻量级深度学习技术与高级处理的实时手势识别。
Sensors (Basel). 2022 Dec 20;23(1):2. doi: 10.3390/s23010002.
7
Dynamic Japanese Sign Language Recognition Throw Hand Pose Estimation Using Effective Feature Extraction and Classification Approach.基于有效特征提取和分类方法的动态日本手语识别投手姿势估计
Sensors (Basel). 2024 Jan 26;24(3):826. doi: 10.3390/s24030826.
8
Atom Search Optimization with Deep Learning Enabled Arabic Sign Language Recognition for Speaking and Hearing Disability Persons.基于深度学习的原子搜索优化算法用于聋哑人士阿拉伯语手语识别
Healthcare (Basel). 2022 Aug 24;10(9):1606. doi: 10.3390/healthcare10091606.
9
Sign Language Recognition Method Based on Palm Definition Model and Multiple Classification.基于手掌定义模型和多分类的手语识别方法。
Sensors (Basel). 2022 Sep 1;22(17):6621. doi: 10.3390/s22176621.
10
American Sign Language Alphabet Recognition by Extracting Feature from Hand Pose Estimation.从手姿态估计中提取特征实现美国手语字母识别。
Sensors (Basel). 2021 Aug 31;21(17):5856. doi: 10.3390/s21175856.

引用本文的文献

1
Bilingual Sign Language Recognition: A YOLOv11-Based Model for Bangla and English Alphabets.双语手语识别:一种基于YOLOv11的孟加拉语和英语字母模型。
J Imaging. 2025 Apr 27;11(5):134. doi: 10.3390/jimaging11050134.

本文引用的文献

1
Empirical Study on Classifiers for Earlier Prediction of COVID-19 Infection Cure and Death Rate in the Indian States.关于印度各邦新冠病毒感染治愈率和死亡率早期预测分类器的实证研究
Healthcare (Basel). 2022 Jan 2;10(1):85. doi: 10.3390/healthcare10010085.
2
Traffic accident detection and condition analysis based on social networking data.基于社交网络数据的交通事故检测与状态分析。
Accid Anal Prev. 2021 Mar;151:105973. doi: 10.1016/j.aap.2021.105973. Epub 2021 Jan 15.