Zhi Hongyi, Yu Zhuliang, Yu Tianyou, Gu Zhenghui, Yang Jian
IEEE Trans Neural Syst Rehabil Eng. 2023;31:3988-3998. doi: 10.1109/TNSRE.2023.3323325. Epub 2023 Oct 18.
Motor imagery (MI) decoding plays a crucial role in the advancement of electroencephalography (EEG)-based brain-computer interface (BCI) technology. Currently, most researches focus on complex deep learning structures for MI decoding. The growing complexity of networks may result in overfitting and lead to inaccurate decoding outcomes due to the redundant information. To address this limitation and make full use of the multi-domain EEG features, a multi-domain temporal-spatial-frequency convolutional neural network (TSFCNet) is proposed for MI decoding. The proposed network provides a novel mechanism that utilize the spatial and temporal EEG features combined with frequency and time-frequency characteristics. This network enables powerful feature extraction without complicated network structure. Specifically, the TSFCNet first employs the MixConv-Residual block to extract multiscale temporal features from multi-band filtered EEG data. Next, the temporal-spatial-frequency convolution block implements three shallow, parallel and independent convolutional operations in spatial, frequency and time-frequency domain, and captures high discriminative representations from these domains respectively. Finally, these features are effectively aggregated by average pooling layers and variance layers, and the network is trained with the joint supervision of the cross-entropy and the center loss. Our experimental results show that the TSFCNet outperforms the state-of-the-art models with superior classification accuracy and kappa values (82.72% and 0.7695 for dataset BCI competition IV 2a, 86.39% and 0.7324 for dataset BCI competition IV 2b). These competitive results demonstrate that the proposed network is promising for enhancing the decoding performance of MI BCIs.
运动想象(MI)解码在基于脑电图(EEG)的脑机接口(BCI)技术发展中起着至关重要的作用。目前,大多数研究集中于用于MI解码的复杂深度学习结构。由于冗余信息,网络复杂度的不断增加可能导致过拟合并产生不准确的解码结果。为了解决这一局限性并充分利用多域EEG特征,本文提出了一种用于MI解码的多域时空频率卷积神经网络(TSFCNet)。所提出的网络提供了一种新颖的机制,该机制利用空间和时间EEG特征并结合频率和时频特征。该网络无需复杂的网络结构就能实现强大的特征提取。具体而言,TSFCNet首先采用MixConv-Residual块从多频段滤波后的EEG数据中提取多尺度时间特征。接下来,时空频率卷积块在空间、频率和时频域中执行三个浅层、并行且独立的卷积操作,并分别从这些域中捕获高判别性表示。最后,这些特征通过平均池化层和方差层进行有效聚合,并在交叉熵和中心损失的联合监督下对网络进行训练。我们的实验结果表明,TSFCNet在分类准确率和kappa值方面优于现有模型(数据集BCI竞赛IV 2a分别为82.72%和0.7695,数据集BCI竞赛IV 2b分别为86.39%和0.7324)。这些具有竞争力的结果表明,所提出的网络在提高MI BCI的解码性能方面具有潜力。