Zhang Yongqiang, Peng Lixin, Ma Guilei, Man Menghua, Liu Shanghe
National Key Laboratory on Electromagnetic Environment Effects, Army Engineering University, Shijiazhuang, China.
School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang, China.
Front Neurorobot. 2022 Jun 7;16:903197. doi: 10.3389/fnbot.2022.903197. eCollection 2022.
In this article, a multi-layer convolutional neural network (ResNet-18) and Long Short-Term Memory Networks (LSTM) model is proposed for dynamic gesture recognition. The Soli dataset is based on the dynamic gesture signals collected by millimeter-wave radar. As a gesture sensor radar, Soli radar has high positional accuracy and can recognize small movements, to achieve the ultimate goal of Human-Computer Interaction (HCI). A set of velocity-range Doppler images transformed from the original signal is used as the input of the model. Especially, ResNet-18 is used to extract deeper spatial features and solve the problem of gradient extinction or gradient explosion. LSTM is used to extract temporal features and solve the problem of long-time dependence. The model was implemented on the Soli dataset for the dynamic gesture recognition experiment, where the accuracy of gesture recognition obtained 92.55%. Finally, compare the model with the traditional methods. The result shows that the model proposed in this paper achieves higher accuracy in dynamic gesture recognition. The validity of the model is verified by experiments.
本文提出了一种用于动态手势识别的多层卷积神经网络(ResNet-18)和长短期记忆网络(LSTM)模型。Soli数据集基于毫米波雷达收集的动态手势信号。作为一种手势传感器雷达,Soli雷达具有较高的位置精度,能够识别微小动作,以实现人机交互(HCI)的最终目标。从原始信号转换而来的一组速度-距离多普勒图像用作模型的输入。特别是,ResNet-18用于提取更深层次的空间特征并解决梯度消失或梯度爆炸问题。LSTM用于提取时间特征并解决长期依赖问题。该模型在Soli数据集上进行了动态手势识别实验,其中手势识别准确率达到了92.55%。最后,将该模型与传统方法进行比较。结果表明,本文提出的模型在动态手势识别中实现了更高的准确率。通过实验验证了该模型的有效性。