Agab Salah Eddine, Chelali Fatma Zohra
Speech communication and signal processing laboratory, Faculty of Electrical Engineering, University of Sciences and Technology Houari Boumediene (USTHB), Box n°: 32 El Alia, 16111 Algiers, Algeria.
Multimed Tools Appl. 2023 Jan 31:1-31. doi: 10.1007/s11042-023-14433-x.
In recent years, researchers have been focusing on developing Human-Computer Interfaces that are fast, intuitive, and allow direct interaction with the computing environment. One of the most natural ways of communication is hand gestures. In this context, many systems were developed to recognize hand gestures using numerous vision-based techniques, these systems are highly affected by acquisition constraints, such as resolution, noise, lighting condition, hand shape, and pose. To enhance the performance under such constraints, we propose a static and dynamic hand gesture recognition system, which utilizes the Dual-Tree Complex Wavelet Transform to produce an approximation image characterized by less noise and redundancy. Subsequently, the Histogram of Oriented Gradients is applied to the resulting image to extract relevant information and produce a compact features vector. For classification, we compare the performance of three Artificial Neural Networks, namely, MLP, PNN, and RBNN. Random Decision Forest and SVM classifiers are also used to ameliorate the efficiency of our system. Experimental evaluation is performed on four datasets composed of alphabet signs and dynamic gestures. The obtained results demonstrate the efficiency of the combined features, for which the achieved recognition rates were comparable to the state-of-the-art.
近年来,研究人员一直致力于开发快速、直观且能与计算环境直接交互的人机界面。最自然的交流方式之一是手势。在此背景下,人们开发了许多系统,利用多种基于视觉的技术来识别手势,但这些系统极易受到采集限制的影响,如分辨率、噪声、光照条件、手的形状和姿势等。为了在这些限制条件下提高性能,我们提出了一种静态和动态手势识别系统,该系统利用双树复小波变换生成具有较少噪声和冗余的近似图像。随后,将定向梯度直方图应用于所得图像,以提取相关信息并生成紧凑的特征向量。为了进行分类,我们比较了三种人工神经网络,即多层感知器(MLP)、概率神经网络(PNN)和径向基神经网络(RBNN)的性能。还使用随机决策森林和支持向量机分类器来提高我们系统的效率。在由字母符号和动态手势组成的四个数据集上进行了实验评估。所得结果证明了组合特征的有效性,其识别率与当前最先进的水平相当。