• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HiMul-LGG:一种基于分层决策融合的局部-全局图神经网络,用于对话中的多模态情感识别。

HiMul-LGG: A hierarchical decision fusion-based local-global graph neural network for multimodal emotion recognition in conversation.

作者信息

Fu Changzeng, Qian Fengkui, Su Kaifeng, Su Yikai, Wang Ze, Shi Jiaqi, Liu Zhigang, Liu Chaoran, Ishi Carlos Toshinori

机构信息

Northeastern University, China; Osaka University, Japan; RIKEN, Japan; Hebei Key Laboratory of Marine Perception Network and Data Processing, China.

Northeastern University, China.

出版信息

Neural Netw. 2025 Jan;181:106764. doi: 10.1016/j.neunet.2024.106764. Epub 2024 Sep 28.

DOI:10.1016/j.neunet.2024.106764
PMID:39368277
Abstract

Emotion recognition in conversation (ERC) is a vital task that requires deciphering human emotions through analysis of contextual and multimodal information. However, extant research on ERC concentrates predominantly on investigating multimodal fusion while overlooking the model's constraints in dealing with unimodal representation discrepancy and speaker dependencies. To address the aforementioned problems, this paper proposes a Hierarchical decision fusion-based Local-Global Graph Neural Network for multimodal ERC (HiMul-LGG). HiMul-LGG employs a hierarchical decision fusion strategy to ensure feature alignment across modalities. Moreover, HiMul-LGG also adopts a local-global graph neural network architecture to reinforce inter-modality and intra-modality speaker dependency. Additionally, HiMul-LGG utilizes a cross-modal multi-head attention mechanism to promote interplay between modalities. We evaluate HiMul-LGG on two emotion recognition datasets, IEMOCAP and MELD, where HiMul-LGG outperforms existing methods. The results of the ablation study also imply the effectiveness of the proposed hierarchical decision fusion strategy and local-global structure of Graph construction.

摘要

对话中的情感识别(ERC)是一项至关重要的任务,需要通过分析上下文和多模态信息来解读人类情感。然而,现有的关于ERC的研究主要集中在调查多模态融合上,而忽略了模型在处理单模态表示差异和说话者依赖性方面的限制。为了解决上述问题,本文提出了一种基于分层决策融合的局部-全局图神经网络用于多模态ERC(HiMul-LGG)。HiMul-LGG采用分层决策融合策略来确保跨模态的特征对齐。此外,HiMul-LGG还采用局部-全局图神经网络架构来加强跨模态和模态内的说话者依赖性。此外,HiMul-LGG利用跨模态多头注意力机制来促进模态之间的相互作用。我们在两个情感识别数据集IEMOCAP和MELD上对HiMul-LGG进行了评估,HiMul-LGG的表现优于现有方法。消融研究的结果也表明了所提出的分层决策融合策略和图构建的局部-全局结构的有效性。

相似文献

1
HiMul-LGG: A hierarchical decision fusion-based local-global graph neural network for multimodal emotion recognition in conversation.HiMul-LGG:一种基于分层决策融合的局部-全局图神经网络,用于对话中的多模态情感识别。
Neural Netw. 2025 Jan;181:106764. doi: 10.1016/j.neunet.2024.106764. Epub 2024 Sep 28.
2
HGF-MiLaG: Hierarchical Graph Fusion for Emotion Recognition in Conversation with Mid-Late Gender-Aware Strategy.HGF-MiLaG:用于对话中情感识别的具有中后期性别感知策略的分层图融合
Sensors (Basel). 2025 Feb 14;25(4):1182. doi: 10.3390/s25041182.
3
Uncertainty-Aware Graph Contrastive Fusion Network for multimodal physiological signal emotion recognition.用于多模态生理信号情感识别的不确定性感知图对比融合网络
Neural Netw. 2025 Jul;187:107363. doi: 10.1016/j.neunet.2025.107363. Epub 2025 Mar 14.
4
DER-GCN: Dialog and Event Relation-Aware Graph Convolutional Neural Network for Multimodal Dialog Emotion Recognition.DER-GCN:用于多模态对话情感识别的对话与事件关系感知图卷积神经网络
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4908-4921. doi: 10.1109/TNNLS.2024.3367940. Epub 2025 Feb 28.
5
Multimodal Emotion Recognition Based on Cascaded Multichannel and Hierarchical Fusion.基于级联多通道和分层融合的多模态情绪识别。
Comput Intell Neurosci. 2023 Jan 5;2023:9645611. doi: 10.1155/2023/9645611. eCollection 2023.
6
LGGNet: Learning From Local-Global-Graph Representations for Brain-Computer Interface.LGGNet:基于局部-全局-图表示的脑机接口学习。
IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):9773-9786. doi: 10.1109/TNNLS.2023.3236635. Epub 2024 Jul 8.
7
A 3D hierarchical cross-modality interaction network using transformers and convolutions for brain glioma segmentation in MR images.一种使用变换和卷积的 3D 层次跨模态交互网络,用于磁共振图像中的脑胶质瘤分割。
Med Phys. 2024 Nov;51(11):8371-8389. doi: 10.1002/mp.17354. Epub 2024 Aug 13.
8
Hierarchical multimodal self-attention-based graph neural network for DTI prediction.基于分层多模态自注意力的图神经网络用于 DTI 预测。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae293.
9
Residual-Based Graph Convolutional Network for Emotion Recognition in Conversation for Smart Internet of Things.基于残差的图卷积网络在智能物联网对话中的情感识别。
Big Data. 2021 Aug;9(4):279-288. doi: 10.1089/big.2020.0274. Epub 2021 Mar 2.
10
Attention-Based Temporal Graph Representation Learning for EEG-Based Emotion Recognition.基于注意力的时变图表示学习在基于 EEG 的情绪识别中的应用。
IEEE J Biomed Health Inform. 2024 Oct;28(10):5755-5767. doi: 10.1109/JBHI.2024.3395622. Epub 2024 Oct 3.

引用本文的文献

1
HGLER: A hierarchical heterogeneous graph networks for enhanced multimodal emotion recognition in conversations.HGLER:用于增强对话中多模态情感识别的分层异构图网络。
PLoS One. 2025 Sep 5;20(9):e0330632. doi: 10.1371/journal.pone.0330632. eCollection 2025.
2
HGF-MiLaG: Hierarchical Graph Fusion for Emotion Recognition in Conversation with Mid-Late Gender-Aware Strategy.HGF-MiLaG:用于对话中情感识别的具有中后期性别感知策略的分层图融合
Sensors (Basel). 2025 Feb 14;25(4):1182. doi: 10.3390/s25041182.