School of Software and Internet of Things Engineering, Jiangxi University of Finance and Economics, Nanchang, Jiangxi, China.
PLoS One. 2022 Nov 7;17(11):e0276436. doi: 10.1371/journal.pone.0276436. eCollection 2022.
In the field of surface electromyography (sEMG) gesture recognition, how to improve recognition accuracy has been a research hotspot. The rapid development of deep learning provides a new solution to this problem. At present, the main applications of deep learning for sEMG gesture feature extraction are based on convolutional neural network (CNN) structures to capture spatial morphological information of the multichannel sEMG or based on long short-term memory network (LSTM) to extract time-dependent information of the single-channel sEMG. However, there are few methods to comprehensively consider the distribution area of the sEMG signal acquisition electrode sensor and the arrangement of the sEMG signal morphological features and electrode spatial features. In this paper, a novel multi-stream feature fusion network (MSFF-Net) model is proposed for sEMG gesture recognition. The model adopts a divide-and-conquer strategy to learn the relationship between different muscle regions and specific gestures. Firstly, a multi-stream convolutional neural network (Multi-stream CNN) and a convolutional block attention module integrated with a resblock (ResCBAM) are used to extract multi-dimensional spatial features from signal morphology, electrode space, and feature map space. Then the learned multi-view depth features are fused by a view aggregation network consisting of an early fusion network and a late fusion network. The results of all subjects and gesture movement validation experiments in the sEMG signal acquired from 12 sensors provided by NinaPro's DB2 and DB4 sub-databases show that the proposed model in this paper has better performance in terms of gesture recognition accuracy compared with the existing models.
在表面肌电信号(sEMG)手势识别领域,如何提高识别精度一直是研究热点。深度学习的快速发展为此问题提供了新的解决方案。目前,深度学习在 sEMG 手势特征提取中的主要应用是基于卷积神经网络(CNN)结构来捕获多通道 sEMG 的空间形态信息,或者基于长短期记忆网络(LSTM)来提取单通道 sEMG 的时变信息。然而,很少有方法全面考虑 sEMG 信号采集电极传感器的分布区域和 sEMG 信号形态特征和电极空间特征的排列。本文提出了一种用于 sEMG 手势识别的新型多流特征融合网络(MSFF-Net)模型。该模型采用分而治之的策略来学习不同肌肉区域和特定手势之间的关系。首先,采用多流卷积神经网络(Multi-stream CNN)和集成 ResBlock 的卷积块注意力模块(ResCBAM)从信号形态、电极空间和特征图空间提取多维空间特征。然后,通过由早期融合网络和晚期融合网络组成的视图聚合网络融合学习到的多视角深度特征。来自 NinaPro 的 DB2 和 DB4 子数据库中 12 个传感器采集的 sEMG 信号的所有受试者和手势运动验证实验的结果表明,与现有模型相比,本文提出的模型在手势识别精度方面具有更好的性能。