一种基于计算机视觉的乌尔都语手语数据集识别与分类系统。

A computer vision-based system for recognition and classification of Urdu sign language dataset.

作者信息

Zahid Hira, Rashid Munaf, Syed Sidra Abid, Ullah Rafi, Asif Muhammad, Khan Muzammil, Abdul Mujeeb Amenah, Haider Khan Ali

机构信息

Biomedical Engineering Department and Electrical Engineering Department, Ziauddin University, Karachi, Pakistan.

Electrical Engineering Department and Software Engineering Department, Ziauddin University, Karachi, Pakistan.

出版信息

PeerJ Comput Sci. 2022 Dec 14;8:e1174. doi: 10.7717/peerj-cs.1174. eCollection 2022.

DOI:10.7717/peerj-cs.1174

PMID:37346313

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10281630/

Abstract

Human beings rely heavily on social communication as one of the major aspects of communication. Language is the most effective means of verbal and nonverbal communication and association. To bridge the communication gap between deaf people communities, and non-deaf people, sign language is widely used. According to the World Federation of the Deaf, there are about 70 million deaf people present around the globe and about 300 sign languages being used. Hence, the structural form of the hand gestures involving visual motions and signs is used as a communication system to help the deaf and speech-impaired community for daily interaction. The aim is to collect a dataset of Urdu sign language (USL) and test it through a machine learning classifier. The overview of the proposed system is divided into four main stages , data collection, data acquisition, training model ad testing model. The USL dataset which is comprised of 1,560 images was created by photographing various hand positions using a camera. This work provides a strategy for automated identification of USL numbers based on a bag-of-words (BoW) paradigm. For classification purposes, support vector machine (SVM), Random Forest, and K-nearest neighbor (K-NN) are used with the BoW histogram bin frequencies as characteristics. The proposed technique outperforms others in number classification, attaining the accuracies of 88%, 90%, and 84% for the random forest, SVM, and K-NN respectively.

摘要

人类严重依赖社会交流，将其作为交流的主要方面之一。语言是言语和非言语交流及联系的最有效手段。为了弥合聋人群体与非聋人群体之间的交流差距，手语被广泛使用。据世界聋人联合会统计，全球约有7000万聋人，使用约300种手语。因此，涉及视觉动作和手势的手部姿势的结构形式被用作一种交流系统，以帮助聋人和有语言障碍的群体进行日常互动。目的是收集乌尔都语手语（USL）数据集，并通过机器学习分类器对其进行测试。所提出系统的概述分为四个主要阶段，即数据收集、数据获取、训练模型和测试模型。由1560张图像组成的USL数据集是通过使用相机拍摄各种手部姿势创建的。这项工作提供了一种基于词袋（BoW）范式自动识别USL数字的策略。为了进行分类，将支持向量机（SVM）、随机森林和K近邻（K-NN）与BoW直方图箱频率作为特征一起使用。所提出的技术在数字分类方面优于其他技术，随机森林、SVM和K-NN的准确率分别达到88%、90%和84%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc22/10281630/a92d5c198b22/peerj-cs-08-1174-g001.jpg

相似文献

A computer vision-based system for recognition and classification of Urdu sign language dataset.

PeerJ Comput Sci. 2022 Dec 14;8:e1174. doi: 10.7717/peerj-cs.1174. eCollection 2022.

Recognition of Urdu sign language: a systematic review of the machine learning classification.

PeerJ Comput Sci. 2022 Feb 18;8:e883. doi: 10.7717/peerj-cs.883. eCollection 2022.

Dataset of Pakistan Sign Language and Automatic Recognition of Hand Configuration of Urdu Alphabet through Machine Learning.

Data Brief. 2021 Apr 2;36:107021. doi: 10.1016/j.dib.2021.107021. eCollection 2021 Jun.

Vision-based Pakistani sign language recognition using bag-of-words and support vector machines.

Sci Rep. 2022 Dec 9;12(1):21325. doi: 10.1038/s41598-022-15864-6.

Dynamic Japanese Sign Language Recognition Throw Hand Pose Estimation Using Effective Feature Extraction and Classification Approach.

Sensors (Basel). 2024 Jan 26;24(3):826. doi: 10.3390/s24030826.

Hand gestures for emergency situations: A video dataset based on words from Indian sign language.

Data Brief. 2020 Jul 11;31:106016. doi: 10.1016/j.dib.2020.106016. eCollection 2020 Aug.

Dataglove for Sign Language Recognition of People with Hearing and Speech Impairment via Wearable Inertial Sensors.

Sensors (Basel). 2023 Jul 26;23(15):6693. doi: 10.3390/s23156693.

American Sign Language Recognition Using Leap Motion Controller with Machine Learning Approach.

Sensors (Basel). 2018 Oct 19;18(10):3554. doi: 10.3390/s18103554.

KU-BdSL: An open dataset for Bengali sign language recognition.

Data Brief. 2023 Nov 11;51:109797. doi: 10.1016/j.dib.2023.109797. eCollection 2023 Dec.

MyWSL: Malaysian words sign language dataset.

Data Brief. 2023 Jun 22;49:109338. doi: 10.1016/j.dib.2023.109338. eCollection 2023 Aug.

引用本文的文献

Systematic literature review on the application of machine learning for the prediction of properties of different types of concrete.

PeerJ Comput Sci. 2024 May 16;10:e1853. doi: 10.7717/peerj-cs.1853. eCollection 2024.

本文引用的文献

The UN Incheon strategy Implementation and the Pakistani quagmire; ground realities.

J Pak Med Assoc. 2021 Dec;71(12):2787-2793. doi: 10.47391/JPMA.969.

Dataset of Pakistan Sign Language and Automatic Recognition of Hand Configuration of Urdu Alphabet through Machine Learning.

Data Brief. 2021 Apr 2;36:107021. doi: 10.1016/j.dib.2021.107021. eCollection 2021 Jun.

Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.

Data Brief. 2020 May 21;31:105749. doi: 10.1016/j.dib.2020.105749. eCollection 2020 Aug.

A logical calculus of the ideas immanent in nervous activity. 1943.

Bull Math Biol. 1990;52(1-2):99-115; discussion 73-97.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种基于计算机视觉的乌尔都语手语数据集识别与分类系统。

A computer vision-based system for recognition and classification of Urdu sign language dataset.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献