Centre for Automation and Robotics (CAR) UPM-CSIC, Universidad Politécnica de Madrid (UPM), 28006 Madrid, Spain.
State Key Laboratory of Internet of Things for Smart City, University of Macao, Macao.
Sensors (Basel). 2024 Jun 4;24(11):3631. doi: 10.3390/s24113631.
Gesture recognition using electromyography (EMG) signals has prevailed recently in the field of human-computer interactions for controlling intelligent prosthetics. Currently, machine learning and deep learning are the two most commonly employed methods for classifying hand gestures. Despite traditional machine learning methods already achieving impressive performance, it is still a huge amount of work to carry out feature extraction manually. The existing deep learning methods utilize complex neural network architectures to achieve higher accuracy, which will suffer from overfitting, insufficient adaptability, and low recognition accuracy. To improve the existing phenomenon, a novel lightweight model named dual stream LSTM feature fusion classifier is proposed based on the concatenation of five time-domain features of EMG signals and raw data, which are both processed with one-dimensional convolutional neural networks and LSTM layers to carry out the classification. The proposed method can effectively capture global features of EMG signals using a simple architecture, which means less computational cost. An experiment is conducted on a public DB1 dataset with 52 gestures, and each of the 27 subjects repeats every gesture 10 times. The accuracy rate achieved by the model is 89.66%, which is comparable to that achieved by more complex deep learning neural networks, and the inference time for each gesture is 87.6 ms, which can also be implied in a real-time control system. The proposed model is validated using a subject-wise experiment on 10 out of the 40 subjects in the DB2 dataset, achieving a mean accuracy of 91.74%. This is illustrated by its ability to fuse time-domain features and raw data to extract more effective information from the sEMG signal and select an appropriate, efficient, lightweight network to enhance the recognition results.
基于肌电信号的手势识别在人机交互领域中得到了广泛应用,可用于控制智能假肢。目前,机器学习和深度学习是两种最常用的手势分类方法。尽管传统的机器学习方法已经取得了令人印象深刻的性能,但手动进行特征提取仍然是一项艰巨的工作。现有的深度学习方法利用复杂的神经网络架构来实现更高的准确性,但会面临过拟合、适应性不足和识别精度低等问题。为了改善现有现象,提出了一种新的轻量级模型,名为双流 LSTM 特征融合分类器,它基于肌电信号和原始数据的五个时域特征的串联,这两个特征都经过一维卷积神经网络和 LSTM 层进行分类。该方法可以使用简单的架构有效地捕捉肌电信号的全局特征,这意味着计算成本更低。在一个包含 52 个手势的公共 DB1 数据集上进行了实验,其中 27 名受试者每人重复每个手势 10 次。该模型的准确率达到 89.66%,与更复杂的深度学习神经网络相当,每个手势的推断时间为 87.6ms,也可以应用于实时控制系统。在 DB2 数据集中的 40 名受试者中的 10 名进行了受试者分类实验,模型的平均准确率为 91.74%,这表明它能够融合时域特征和原始数据,从 sEMG 信号中提取更有效的信息,并选择合适、高效、轻量级的网络来增强识别结果。