Oka Hayato, Ono Keiko, Panagiotis Adamidis
Master's Program in Information and Computer Science, Doshisha University, Kyoto 610-0394, Japan.
Department of Intelligent Information Engineering and Sciences, Doshisha University, Kyoto 610-0394, Japan.
Sensors (Basel). 2024 Dec 21;24(24):8174. doi: 10.3390/s24248174.
Recent advances in emotion recognition through Artificial Intelligence (AI) have demonstrated potential applications in various fields (e.g., healthcare, advertising, and driving technology), with electroencephalogram (EEG)-based approaches demonstrating superior accuracy compared to facial or vocal methods due to their resistance to intentional manipulation. This study presents a novel approach to enhance EEG-based emotion estimation accuracy by emphasizing temporal features and efficient parameter space exploration. We propose a model combining Long Short-Term Memory (LSTM) with an attention mechanism to highlight temporal features in EEG data while optimizing LSTM parameters through Particle Swarm Optimization (PSO). The attention mechanism assigned weights to LSTM hidden states, and PSO dynamically optimizes the vital parameters, including units, batch size, and dropout rate. Using the DEAP and SEED datasets, which serve as benchmark datasets for emotion estimation research using EEG, we evaluate the model's performance. For the DEAP dataset, we conduct a four-class classification of combinations of high and low valence and arousal states. We perform a three-class classification of negative, neutral, and positive emotions for the SEED dataset. The proposed model achieves an accuracy of 0.9409 on the DEAP dataset, surpassing the previous state-of-the-art accuracy of 0.9100 reported by Lin et al. The model attains an accuracy of 0.9732 on the SEED dataset, recording one of the highest accuracies among the related research. These results demonstrate that integrating the attention mechanism with PSO significantly improves the accuracy of EEG-based emotion estimation, contributing to the advancement of emotion recognition technology.
通过人工智能(AI)进行情感识别的最新进展已在各个领域(如医疗保健、广告和驾驶技术)展示了潜在应用,基于脑电图(EEG)的方法由于其抗故意操纵性,与面部或语音方法相比显示出更高的准确性。本研究提出了一种新颖的方法,通过强调时间特征和高效的参数空间探索来提高基于EEG的情感估计准确性。我们提出了一种将长短期记忆(LSTM)与注意力机制相结合的模型,以突出EEG数据中的时间特征,同时通过粒子群优化(PSO)优化LSTM参数。注意力机制为LSTM隐藏状态分配权重,PSO动态优化包括单元、批量大小和丢弃率在内的关键参数。使用作为基于EEG的情感估计研究基准数据集的DEAP和SEED数据集,我们评估了该模型的性能。对于DEAP数据集,我们对高、低价态和唤醒状态的组合进行四类分类。对于SEED数据集,我们对负面、中性和正面情绪进行三类分类。所提出的模型在DEAP数据集上实现了0.9409的准确率,超过了Lin等人报告的先前最高准确率0.9100。该模型在SEED数据集上达到了0.9732的准确率,是相关研究中最高的准确率之一。这些结果表明,将注意力机制与PSO相结合显著提高了基于EEG的情感估计的准确性,有助于情感识别技术的进步。