基于对比学习的脑电信号与视听特征情感识别

Emotion Recognition Using EEG Signals and Audiovisual Features with Contrastive Learning.

作者信息

Lee Ju-Hwan, Kim Jin-Young, Kim Hyoung-Gook

机构信息

Department of Intelligent Electronics and Computer Engineering, Chonnam National University, 77 Yongbong-ro, Buk-gu, Gwangju 61186, Republic of Korea.

Department of Electronic Convergence Engineering, Kwangwoon University, 20 Gwangun-ro, Nowon-gu, Seoul 01897, Republic of Korea.

出版信息

Bioengineering (Basel). 2024 Oct 3;11(10):997. doi: 10.3390/bioengineering11100997.

DOI:10.3390/bioengineering11100997

PMID:39451373

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11504283/

Abstract

Multimodal emotion recognition has emerged as a promising approach to capture the complex nature of human emotions by integrating information from various sources such as physiological signals, visual behavioral cues, and audio-visual content. However, current methods often struggle with effectively processing redundant or conflicting information across modalities and may overlook implicit inter-modal correlations. To address these challenges, this paper presents a novel multimodal emotion recognition framework which integrates audio-visual features with viewers' EEG data to enhance emotion classification accuracy. The proposed approach employs modality-specific encoders to extract spatiotemporal features, which are then aligned through contrastive learning to capture inter-modal relationships. Additionally, cross-modal attention mechanisms are incorporated for effective feature fusion across modalities. The framework, comprising pre-training, fine-tuning, and testing phases, is evaluated on multiple datasets of emotional responses. The experimental results demonstrate that the proposed multimodal approach, which combines audio-visual features with EEG data, is highly effective in recognizing emotions, highlighting its potential for advancing emotion recognition systems.

摘要

多模态情感识别已成为一种很有前景的方法，通过整合来自各种来源的信息（如生理信号、视觉行为线索和视听内容）来捕捉人类情感的复杂本质。然而，当前的方法在有效处理跨模态的冗余或冲突信息方面往往存在困难，并且可能会忽略隐含的模态间相关性。为了应对这些挑战，本文提出了一种新颖的多模态情感识别框架，该框架将视听特征与观众的脑电图（EEG）数据相结合，以提高情感分类的准确性。所提出的方法采用特定模态的编码器来提取时空特征，然后通过对比学习进行对齐，以捕捉模态间的关系。此外，还引入了跨模态注意力机制，以实现跨模态的有效特征融合。该框架包括预训练、微调及测试阶段，并在多个情感反应数据集上进行了评估。实验结果表明，所提出的将视听特征与EEG数据相结合的多模态方法在情感识别方面非常有效，凸显了其在推进情感识别系统方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ced/11504283/5ee692447f88/bioengineering-11-00997-g001.jpg

相似文献

Emotion Recognition Using EEG Signals and Audiovisual Features with Contrastive Learning.基于对比学习的脑电信号与视听特征情感识别

Bioengineering (Basel). 2024 Oct 3;11(10):997. doi: 10.3390/bioengineering11100997.

AVaTER: Fusing Audio, Visual, and Textual Modalities Using Cross-Modal Attention for Emotion Recognition.AVaTER：使用跨模态注意力融合音频、视觉和文本模态进行情感识别。

Sensors (Basel). 2024 Sep 10;24(18):5862. doi: 10.3390/s24185862.

Cross-modal credibility modelling for EEG-based multimodal emotion recognition.基于 EEG 的多模态情感识别的跨模态可信度建模。

J Neural Eng. 2024 Apr 11;21(2). doi: 10.1088/1741-2552/ad3987.

Research on cross-modal emotion recognition based on multi-layer semantic fusion.基于多层语义融合的跨模态情感识别研究

Math Biosci Eng. 2024 Jan 17;21(2):2488-2514. doi: 10.3934/mbe.2024110.

AttendAffectNet-Emotion Prediction of Movie Viewers Using Multimodal Fusion with Self-Attention.使用带有自注意力机制的多模态融合方法预测电影观众的 AttendAffectNet-Emotion。

Sensors (Basel). 2021 Dec 14;21(24):8356. doi: 10.3390/s21248356.

A novel feature fusion network for multimodal emotion recognition from EEG and eye movement signals.一种用于从脑电图和眼动信号中进行多模态情感识别的新型特征融合网络。

Front Neurosci. 2023 Aug 3;17:1234162. doi: 10.3389/fnins.2023.1234162. eCollection 2023.

Attention-based 3D convolutional recurrent neural network model for multimodal emotion recognition.基于注意力的多模态情感识别三维卷积递归神经网络模型

Front Neurosci. 2024 Jan 10;17:1330077. doi: 10.3389/fnins.2023.1330077. eCollection 2023.

Multimodal interaction enhanced representation learning for video emotion recognition.用于视频情感识别的多模态交互增强表示学习。

Front Neurosci. 2022 Dec 19;16:1086380. doi: 10.3389/fnins.2022.1086380. eCollection 2022.

Expression EEG Multimodal Emotion Recognition Method Based on the Bidirectional LSTM and Attention Mechanism.基于双向 LSTM 和注意力机制的表达 EEG 多模态情绪识别方法。

Comput Math Methods Med. 2021 May 11;2021:9967592. doi: 10.1155/2021/9967592. eCollection 2021.

EEG-based emotion charting for Parkinson's disease patients using Convolutional Recurrent Neural Networks and cross dataset learning.基于 EEG 的帕金森病患者情绪图表分析，使用卷积循环神经网络和跨数据集学习。

Comput Biol Med. 2022 May;144:105327. doi: 10.1016/j.compbiomed.2022.105327. Epub 2022 Mar 11.

引用本文的文献

Cross-Subject Emotion Recognition with CT-ELCAN: Leveraging Cross-Modal Transformer and Enhanced Learning-Classify Adversarial Network.基于CT-ELCAN的跨主体情感识别：利用跨模态变压器和增强学习分类对抗网络

Bioengineering (Basel). 2025 May 15;12(5):528. doi: 10.3390/bioengineering12050528.

Improved EEG-Based Emotion Classification via Stockwell Entropy and CSP Integration.基于Stockwell熵与共空间模式集成的改进型脑电图情感分类

Entropy (Basel). 2025 Apr 24;27(5):457. doi: 10.3390/e27050457.

The Effect of EEG Biofeedback Training Frequency and Environmental Conditions on Simple and Complex Reaction Times.脑电图生物反馈训练频率和环境条件对简单及复杂反应时间的影响。

Bioengineering (Basel). 2024 Nov 29;11(12):1208. doi: 10.3390/bioengineering11121208.

本文引用的文献

MedCLIP: Contrastive Learning from Unpaired Medical Images and Text.MedCLIP：从未配对医学图像和文本中进行对比学习。

Proc Conf Empir Methods Nat Lang Process. 2022 Dec;2022:3876-3887. doi: 10.18653/v1/2022.emnlp-main.256.

Emotion recognition in EEG signals using deep learning methods: A review.基于深度学习方法的 EEG 信号情绪识别：综述。

Comput Biol Med. 2023 Oct;165:107450. doi: 10.1016/j.compbiomed.2023.107450. Epub 2023 Sep 9.

C2F-TCN: A Framework for Semi- and Fully-Supervised Temporal Action Segmentation.C2F-TCN：一种用于半监督和全监督时间动作分割的框架。

IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):11484-11501. doi: 10.1109/TPAMI.2023.3284080. Epub 2023 Sep 5.

A multi-stage dynamical fusion network for multimodal emotion recognition.一种用于多模态情感识别的多阶段动态融合网络。

Cogn Neurodyn. 2023 Jun;17(3):671-680. doi: 10.1007/s11571-022-09851-w. Epub 2022 Jul 31.

EEG-based emotion recognition using hybrid CNN and LSTM classification.基于脑电图的情感识别：结合卷积神经网络和长短期记忆网络分类方法

Front Comput Neurosci. 2022 Oct 7;16:1019776. doi: 10.3389/fncom.2022.1019776. eCollection 2022.

Time Synchronization of Multimodal Physiological Signals through Alignment of Common Signal Types and Its Technical Considerations in Digital Health.通过常见信号类型对齐实现多模态生理信号的时间同步及其在数字健康中的技术考量

J Imaging. 2022 Apr 21;8(5):120. doi: 10.3390/jimaging8050120.

Emotion recognition from multimodal physiological measurements based on an interpretable feature selection method.基于可解释特征选择方法的多模态生理测量情感识别

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:989-992. doi: 10.1109/EMBC46164.2021.9631019.

Autonomic nervous system activity in emotion: a review.自主神经系统活动与情绪：综述。

Biol Psychol. 2010 Jul;84(3):394-421. doi: 10.1016/j.biopsycho.2010.03.010. Epub 2010 Apr 4.

Basic emotions are associated with distinct patterns of cardiorespiratory activity.基本情绪与心肺活动的不同模式相关联。

Int J Psychophysiol. 2006 Jul;61(1):5-18. doi: 10.1016/j.ijpsycho.2005.10.024. Epub 2006 Jan 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于对比学习的脑电信号与视听特征情感识别

Emotion Recognition Using EEG Signals and Audiovisual Features with Contrastive Learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献