Khademi Zahra, Ebrahimi Farideh, Kordy Hussain Montazery
Faculty of Electrical and Computer Engineering, Babol Noshirvani University of Technology, Shariati Ave., Babol, Iran.
Comput Biol Med. 2022 Apr;143:105288. doi: 10.1016/j.compbiomed.2022.105288. Epub 2022 Feb 10.
In the Motor Imagery (MI)-based Brain Computer Interface (BCI), users' intention is converted into a control signal through processing a specific pattern in brain signals reflecting motor characteristics. There are such restrictions as the limited size of the existing datasets and low signal to noise ratio in the classification of MI Electroencephalogram (EEG) signals. Machine learning (ML) methods, particularly Deep Learning (DL), have overcome these limitations relatively. In this study, three hybrid models were proposed to classify the EEG signal in the MI-based BCI. The proposed hybrid models consist of the convolutional neural networks (CNN) and the Long-Short Term Memory (LSTM). In the first model, the CNN with different number of convolutional-pooling blocks (from shallow to deep CNN) was examined; a two-block CNN model not affected by the vanishing gradient descent and yet able to extract desirable features employed; the second and third models contained pre-trained CNNs conducing to the exploration of more complex features. The transfer learning strategy and data augmentation methods were applied to overcome the limited size of the datasets by transferring learning from one model to another. This was achieved by employing two powerful pre-trained convolutional neural networks namely ResNet-50 and Inception-v3. The continuous wavelet transform (CWT) was used to generate images for the CNN. The performance of the proposed models was evaluated on the BCI Competition IV dataset 2a. The mean accuracy vlaues of 86%, 90%, and 92%, and mean Kappa values of 81%, 86%, and 88% were obtained for the hybrid neural network with the customized CNN, the hybrid neural network with ResNet-50 and the hybrid neural network with Inception-v3, respectively. Despite the promising performance of the three proposed models, the hybrid neural network with Inception-v3 outperformed the two other models. The best obtained result in the present study improved the previous best result in the literature by 7% in terms of classification accuracy. From the findings, it can be concluded that transfer learning based on a pre-trained CNN in combination with LSTM is a novel method in MI-based BCI. The study also has implications for the discrimination of motor imagery tasks in each EEG recording channel and in different brain regions which can reduce computational time in future works by only selecting the most effective channels.
在基于运动想象(MI)的脑机接口(BCI)中,用户的意图通过处理反映运动特征的脑信号中的特定模式被转换为控制信号。在MI脑电图(EEG)信号分类中存在诸如现有数据集规模有限以及信噪比低等限制。机器学习(ML)方法,尤其是深度学习(DL),相对克服了这些限制。在本研究中,提出了三种混合模型来对基于MI的BCI中的EEG信号进行分类。所提出的混合模型由卷积神经网络(CNN)和长短期记忆网络(LSTM)组成。在第一个模型中,研究了具有不同数量卷积池化块的CNN(从浅层到深层CNN);采用了一个不受梯度消失影响且能够提取理想特征的双块CNN模型;第二个和第三个模型包含有助于探索更复杂特征的预训练CNN。应用迁移学习策略和数据增强方法,通过从一个模型向另一个模型迁移学习来克服数据集规模有限的问题。这是通过使用两个强大的预训练卷积神经网络即ResNet - 50和Inception - v3来实现的。连续小波变换(CWT)用于为CNN生成图像。在所提出模型的性能在BCI竞赛IV数据集2a上进行了评估。对于定制CNN的混合神经网络、具有ResNet - 50的混合神经网络和具有Inception - v3的混合神经网络,分别获得了86%、90%和92%的平均准确率值,以及81%、86%和88%的平均卡帕值。尽管所提出的三个模型都有不错的性能,但具有Inception - v3的混合神经网络优于其他两个模型。本研究中获得的最佳结果在分类准确率方面比文献中先前的最佳结果提高了7%。从研究结果可以得出结论,基于预训练CNN结合LSTM的迁移学习是基于MI的BCI中的一种新方法。该研究对于区分每个EEG记录通道和不同脑区中的运动想象任务也有启示,这可以通过仅选择最有效的通道在未来工作中减少计算时间。