使用改进的高斯混合模型进行视觉特征分析的3D情感映射模型设计

Design of a 3D emotion mapping model for visual feature analysis using improved Gaussian mixture models.

作者信息

Wang Enshi, Khan Fakhri Alam

机构信息

School of Digital Art, Wuxi Vocational College of Science and Technology, Wuxi, Jiangsu, China.

Information and Computer Science Department, King Fahad University of Petroleum and Minerals, Dhahran, Saudi Arabia.

出版信息

PeerJ Comput Sci. 2024 Dec 20;10:e2596. doi: 10.7717/peerj-cs.2596. eCollection 2024.

DOI:10.7717/peerj-cs.2596

PMID:39845099

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11753788/

Abstract

Given the integration of color emotion space information from multiple feature sources in multimodal recognition systems, effectively fusing this information presents a significant challenge. This article proposes a three-dimensional (3D) color-emotion space visual feature extraction model for multimodal data integration based on an improved Gaussian mixture model to address these issues. Unlike traditional methods, which often struggle with redundant information and high model complexity, our approach optimizes feature fusion by employing entropy and visual feature sequences. By integrating machine vision with six activation functions and utilizing multiple aesthetic features, the proposed method exhibits strong performance in a high emotion mapping accuracy (EMA) of 92.4%, emotion recognition precision (ERP) of 88.35%, and an emotion recognition F1 score (ERFS) of 96.22%. These improvements over traditional approaches highlight the model's effectiveness in reducing complexity while enhancing emotional recognition accuracy, positioning it as a more efficient solution for visual emotion analysis in multimedia applications. The findings indicate that the model significantly enhances emotional recognition accuracy.

摘要

鉴于多模态识别系统中来自多个特征源的颜色情感空间信息的整合，有效融合这些信息带来了重大挑战。本文提出了一种基于改进高斯混合模型的用于多模态数据整合的三维（3D）颜色-情感空间视觉特征提取模型，以解决这些问题。与传统方法不同，传统方法常常在冗余信息和高模型复杂度方面存在困难，我们的方法通过采用熵和视觉特征序列来优化特征融合。通过将机器视觉与六个激活函数相结合并利用多个美学特征，所提出的方法在92.4%的高情感映射准确率（EMA）、88.35%的情感识别精度（ERP）和96.22%的情感识别F1分数（ERFS）方面表现出强大的性能。与传统方法相比的这些改进突出了该模型在降低复杂度同时提高情感识别准确率方面的有效性，使其成为多媒体应用中视觉情感分析的更高效解决方案。研究结果表明该模型显著提高了情感识别准确率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/234b/11753788/24ea9679bbd5/peerj-cs-10-2596-g002.jpg

相似文献

Design of a 3D emotion mapping model for visual feature analysis using improved Gaussian mixture models.使用改进的高斯混合模型进行视觉特征分析的3D情感映射模型设计

PeerJ Comput Sci. 2024 Dec 20;10:e2596. doi: 10.7717/peerj-cs.2596. eCollection 2024.

A fine-grained human facial key feature extraction and fusion method for emotion recognition.一种用于情感识别的细粒度人类面部关键特征提取与融合方法。

Sci Rep. 2025 Feb 20;15(1):6153. doi: 10.1038/s41598-025-90440-2.

AVaTER: Fusing Audio, Visual, and Textual Modalities Using Cross-Modal Attention for Emotion Recognition.AVaTER：使用跨模态注意力融合音频、视觉和文本模态进行情感识别。

Sensors (Basel). 2024 Sep 10;24(18):5862. doi: 10.3390/s24185862.

Emotion Recognition Using EEG Signals and Audiovisual Features with Contrastive Learning.基于对比学习的脑电信号与视听特征情感识别

Bioengineering (Basel). 2024 Oct 3;11(10):997. doi: 10.3390/bioengineering11100997.

A Deep Learning Model for Accurate Maize Disease Detection Based on State-Space Attention and Feature Fusion.一种基于状态空间注意力和特征融合的精确玉米病害检测深度学习模型。

Plants (Basel). 2024 Nov 9;13(22):3151. doi: 10.3390/plants13223151.

A Hybrid Multimodal Emotion Recognition Framework for UX Evaluation Using Generalized Mixture Functions.基于广义混合函数的用于用户体验评估的混合多模态情感识别框架。

Sensors (Basel). 2023 Apr 28;23(9):4373. doi: 10.3390/s23094373.

Multi-branch convolutional neural network with cross-attention mechanism for emotion recognition.用于情感识别的具有交叉注意力机制的多分支卷积神经网络。

Sci Rep. 2025 Feb 1;15(1):3976. doi: 10.1038/s41598-025-88248-1.

Hierarchical Attention-Based Multimodal Fusion Network for Video Emotion Recognition.基于分层注意力的多模态融合网络的视频情绪识别。

Comput Intell Neurosci. 2021 Sep 25;2021:5585041. doi: 10.1155/2021/5585041. eCollection 2021.

[Research on electroencephalogram emotion recognition based on the feature fusion algorithm of auto regressive model and wavelet packet entropy].基于自回归模型与小波包熵特征融合算法的脑电图情感识别研究

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2018 Feb 1;34(6):831-836. doi: 10.7507/1001-5515.201610047.

Multimodal Emotion Recognition Based on Cascaded Multichannel and Hierarchical Fusion.基于级联多通道和分层融合的多模态情绪识别。

Comput Intell Neurosci. 2023 Jan 5;2023:9645611. doi: 10.1155/2023/9645611. eCollection 2023.

本文引用的文献

Semi-Supervised Multimodal Representation Learning Through a Global Workspace.

IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):7843-7857. doi: 10.1109/TNNLS.2024.3416701. Epub 2025 May 2.

Embracing beauty through leftward movements: An ERP study on metaphorical association between hand actions and aesthetic judgments.通过向左运动来拥抱美：关于手动作和审美判断之间隐喻关联的 ERP 研究。

Neurosci Lett. 2024 Feb 6;822:137627. doi: 10.1016/j.neulet.2024.137627. Epub 2024 Jan 6.

Fluency, prediction and motivation: how processing dynamics, expectations and epistemic goals shape aesthetic judgements.流畅度、预测和动机：处理动态、期望和认识目标如何塑造审美判断。

Philos Trans R Soc Lond B Biol Sci. 2024 Jan 29;379(1895):20230326. doi: 10.1098/rstb.2023.0326. Epub 2023 Dec 18.

Detection of Diabetic Retinopathy using Convolutional Neural Networks for Feature Extraction and Classification (DRFEC).使用卷积神经网络进行特征提取和分类的糖尿病视网膜病变检测（DRFEC）。

Multimed Tools Appl. 2022 Nov 29:1-59. doi: 10.1007/s11042-022-14165-4.

Toward Model Building for Visual Aesthetic Perception.朝向视觉审美感知的模型构建。

Comput Intell Neurosci. 2017;2017:1292801. doi: 10.1155/2017/1292801. Epub 2017 Nov 15.

Combining universal beauty and cultural context in a unifying model of visual aesthetic experience.在视觉审美体验的统一模型中融合普遍之美与文化背景。

Front Hum Neurosci. 2015 Apr 28;9:218. doi: 10.3389/fnhum.2015.00218. eCollection 2015.

The quartet theory of human emotions: An integrative and neurofunctional model.人类情感的四重奏理论：一种综合的和神经功能模型。

Phys Life Rev. 2015 Jun;13:1-27. doi: 10.1016/j.plrev.2015.03.001. Epub 2015 Apr 7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用改进的高斯混合模型进行视觉特征分析的3D情感映射模型设计

Design of a 3D emotion mapping model for visual feature analysis using improved Gaussian mixture models.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献