Alayed Asmaa
Department of Software Engineering, College of Computing, Umm Al-Qura University, Makkah 21955, Saudi Arabia.
Sensors (Basel). 2024 Dec 5;24(23):7798. doi: 10.3390/s24237798.
Sign language (SL) is a means of communication that is used to bridge the gap between the deaf, hearing-impaired, and others. For Arabic speakers who are hard of hearing or deaf, Arabic Sign Language (ArSL) is a form of nonverbal communication. The development of effective Arabic sign language recognition (ArSLR) tools helps facilitate this communication, especially for people who are not familiar with ArSLR. Although researchers have investigated various machine learning (ML) and deep learning (DL) methods and techniques that affect the performance of ArSLR systems, a systematic review of these methods is lacking. The objectives of this study are to present a comprehensive overview of research on ArSL recognition and present insights from previous research papers. In this study, a systematic literature review of ArSLR based on ML/DL methods and techniques published between 2014 and 2023 is conducted. Three online databases are used: Web of Science (WoS), IEEE Xplore, and Scopus. Each study has undergone the proper screening processes, which include inclusion and exclusion criteria. Throughout this systematic review, PRISMA guidelines have been appropriately followed and applied. The results of this screening are divided into two parts: analysis of all the datasets utilized in the reviewed papers, underscoring their characteristics and importance, and discussion of the ML/DL techniques' potential and limitations. From the 56 articles included in this study, it was noticed that most of the research papers focus on fingerspelling and isolated word recognition rather than continuous sentence recognition, and the vast majority of them are vision-based approaches. The challenges remaining in the field and future research directions in this area of study are also discussed.
手语是一种用于弥合聋人、听力受损者和其他人之间沟通障碍的交流方式。对于讲阿拉伯语的听力障碍或失聪者来说,阿拉伯手语(ArSL)是一种非语言交流形式。有效的阿拉伯手语识别(ArSLR)工具的开发有助于促进这种交流,特别是对于不熟悉ArSLR的人。尽管研究人员已经研究了各种影响ArSLR系统性能的机器学习(ML)和深度学习(DL)方法及技术,但缺乏对这些方法的系统综述。本研究的目的是全面概述阿拉伯手语识别研究,并呈现以往研究论文的见解。在本研究中,对2014年至2023年间发表的基于ML/DL方法和技术的ArSLR进行了系统的文献综述。使用了三个在线数据库:科学网(WoS)、IEEE Xplore和Scopus。每项研究都经过了适当的筛选过程,包括纳入和排除标准。在整个系统综述过程中,适当遵循和应用了PRISMA指南。筛选结果分为两部分:分析综述论文中使用的所有数据集,强调其特征和重要性,以及讨论ML/DL技术的潜力和局限性。从本研究纳入的56篇文章中可以注意到,大多数研究论文关注手指拼写和孤立单词识别,而非连续句子识别,并且其中绝大多数是基于视觉的方法。还讨论了该研究领域中仍然存在的挑战以及未来的研究方向。