复杂场景下基于多特征融合模型的毫米波手势识别

Millimeter wave gesture recognition using multi-feature fusion models in complex scenes.

作者信息

Hao Zhanjun, Sun Zhizhou, Li Fenfang, Wang Ruidong, Peng Jianxiang

机构信息

College of Computer Science and Engineering, Northwest Normal University, Lanzhou, 730070, China.

Gansu Province Internet of Things Engineering Research Centre, Northwest Normal University, Lanzhou, 730070, China.

出版信息

Sci Rep. 2024 Jun 14;14(1):13758. doi: 10.1038/s41598-024-64576-6.

DOI:10.1038/s41598-024-64576-6

PMID:38877076

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11178827/

Abstract

As a form of body language, the gesture plays an important role in smart homes, game interactions, and sign language communication, etc. The gesture recognition methods have been carried out extensively. The existing methods have inherent limitations regarding user experience, visual environment, and recognition granularity. Millimeter wave radar provides an effective method for the problems lie ahead gesture recognition because of the advantage of considerable bandwidth and high precision perception. Interfering factors and the complexity of the model raise an enormous challenge to the practical application of gesture recognition methods as the millimeter wave radar is applied to complex scenes. Based on multi-feature fusion, a gesture recognition method for complex scenes is proposed in this work. We collected data in variety places to improve sample reliability, filtered clutters to improve the signal's signal-to-noise ratio (SNR), and then obtained multi features involves range-time map (RTM), Doppler-time map (DTM) and angle-time map (ATM) and fused them to enhance the richness and expression ability of the features. A lightweight neural network model multi-CNN-LSTM is designed to gestures recognition. This model consists of three convolutional neural network (CNN) for three obtained features and one long short-term memory (LSTM) for temporal features. We analyzed the performance and complexity of the model and verified the effectiveness of feature extraction. Numerous experiments have shown that this method has generalization ability, adaptability, and high robustness in complex scenarios. The recognition accuracy of 14 experimental gestures reached 97.28%.

摘要

作为一种肢体语言形式，手势在智能家居、游戏交互和手语通信等方面发挥着重要作用。手势识别方法已经得到了广泛的研究。现有方法在用户体验、视觉环境和识别粒度方面存在固有局限性。毫米波雷达由于具有带宽大、感知精度高的优点，为解决手势识别面临的问题提供了一种有效方法。然而，当毫米波雷达应用于复杂场景时，干扰因素和模型的复杂性对手势识别方法的实际应用提出了巨大挑战。基于此，本文提出了一种基于多特征融合的复杂场景手势识别方法。我们在多个地点收集数据以提高样本可靠性，过滤杂波以提高信号的信噪比（SNR），然后获得包括距离-时间图（RTM）、多普勒-时间图（DTM）和角度-时间图（ATM）在内的多特征，并将它们融合以增强特征的丰富性和表达能力。设计了一种轻量级神经网络模型多卷积神经网络-长短期记忆网络（multi-CNN-LSTM）用于手势识别。该模型由三个用于处理三个获取特征的卷积神经网络（CNN）和一个用于处理时间特征的长短期记忆网络（LSTM）组成。我们分析了模型的性能和复杂性，并验证了特征提取的有效性。大量实验表明，该方法在复杂场景中具有泛化能力、适应性和高鲁棒性。14种实验手势的识别准确率达到了97.28%。