Chen Jichi, Cui Yuguo, Qian Cheng, He Enqiu
School of Mechanical Engineering, Shenyang University of Technology, Shenyang, Liaoning, China.
School of Chemical Equipment, Shenyang University of Technology, Liaoyang, Liaoning, China.
Comput Methods Biomech Biomed Engin. 2025 Feb;28(3):303-313. doi: 10.1080/10255842.2023.2286918. Epub 2023 Nov 28.
Emotion recognition (ER) plays a crucial role in enabling machines to perceive human emotional and psychological states, thus enhancing human-machine interaction. Recently, there has been a growing interest in ER based on electroencephalogram (EEG) signals. However, due to the noisy, nonlinear, and nonstationary properties of electroencephalography signals, developing an automatic and high-accuracy ER system is still a challenging task. In this study, a pretrained deep residual convolutional neural network model, including 17 convolutional layers and one fully connected layer with transfer learning technique in combination frequency-channel matrices (FCM) of two-dimensional data based on Welch power spectral density estimate from the one-dimensional EEG data has been proposed for improving the ER by automatically learning the underlying intrinsic features of multi-channel EEG data. The experiment result shows a mean accuracy of 93.61 ± 0.84%, a mean precision of 94.70 ± 0.60%, a mean sensitivity of 95.13 ± 1.02%, a mean specificity of 91.04 ± 1.02%, and a mean F1-score of 94.91 ± 0.68%, respectively using 5-fold cross-validation on the DEAP dataset. Meanwhile, to better explore and understand how the proposed model works, we noted that the ranking of clustering effect of FCM for the same category by employing the -distributed stochastic neighbor embedding strategy is: softmax layer activation is the best, the middle convolutional layer activation is the second, and the early max pooling layer activation is the worst. These findings confirm the promising potential of combining deep learning approaches with transfer learning techniques and FCM for effective ER tasks.
情感识别(ER)在使机器能够感知人类情绪和心理状态从而增强人机交互方面发挥着关键作用。近年来,基于脑电图(EEG)信号的情感识别越来越受到关注。然而,由于脑电图信号具有噪声、非线性和非平稳特性,开发一个自动且高精度的情感识别系统仍然是一项具有挑战性的任务。在本研究中,提出了一种预训练的深度残差卷积神经网络模型,该模型包括17个卷积层和1个全连接层,并结合基于一维EEG数据的韦尔奇功率谱密度估计的二维数据的频率 - 通道矩阵(FCM)中的迁移学习技术,通过自动学习多通道EEG数据的潜在内在特征来改进情感识别。实验结果表明,在DEAP数据集上使用5折交叉验证时,平均准确率为93.61±0.84%,平均精确率为94.70±0.60%,平均灵敏度为95.13±1.02%,平均特异性为91.04±1.02%,平均F1分数为94.91±0.68%。同时,为了更好地探索和理解所提出的模型是如何工作的,我们注意到,采用t - 分布随机邻域嵌入策略时,FCM对同一类别的聚类效果排名为:softmax层激活最佳,中间卷积层激活次之,早期最大池化层激活最差。这些发现证实了将深度学习方法与迁移学习技术以及FCM相结合用于有效情感识别任务的潜在前景。