基于改进的DenseNet121的脑电图与音频频谱图多模态融合用于重度抑郁症识别

Multimodal Fusion of EEG and Audio Spectrogram for Major Depressive Disorder Recognition Using Modified DenseNet121.

作者信息

Yousufi Musyyab, Damaševičius Robertas, Maskeliūnas Rytis

机构信息

Centre of Real Time Computer Systems, Kaunas University of Technology, 51368 Kaunas, Lithuania.

出版信息

Brain Sci. 2024 Oct 15;14(10):1018. doi: 10.3390/brainsci14101018.

DOI:10.3390/brainsci14101018

PMID:39452032

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11505707/

Abstract

BACKGROUND/OBJECTIVES: This study investigates the classification of Major Depressive Disorder (MDD) using electroencephalography (EEG) Short-Time Fourier-Transform (STFT) spectrograms and audio Mel-spectrogram data of 52 subjects. The objective is to develop a multimodal classification model that integrates audio and EEG data to accurately identify depressive tendencies.

METHODS

We utilized the Multimodal open dataset for Mental Disorder Analysis (MODMA) and trained a pre-trained Densenet121 model using transfer learning. Features from both the EEG and audio modalities were extracted and concatenated before being passed through the final classification layer. Additionally, an ablation study was conducted on both datasets separately.

RESULTS

The proposed multimodal classification model demonstrated superior performance compared to existing methods, achieving an Accuracy of 97.53%, Precision of 98.20%, F1 Score of 97.76%, and Recall of 97.32%. A confusion matrix was also used to evaluate the model's effectiveness.

CONCLUSIONS

The paper presents a robust multimodal classification approach that outperforms state-of-the-art methods with potential application in clinical diagnostics for depression assessment.

摘要

背景/目的：本研究使用52名受试者的脑电图（EEG）短时傅里叶变换（STFT）频谱图和音频梅尔频谱图数据，对重度抑郁症（MDD）进行分类。目的是开发一种多模态分类模型，该模型整合音频和脑电图数据以准确识别抑郁倾向。

方法

我们利用了精神障碍分析多模态开放数据集（MODMA），并使用迁移学习训练了一个预训练的Densenet121模型。在通过最终分类层之前，提取并连接了脑电图和音频模态的特征。此外，还分别对两个数据集进行了消融研究。

结果

与现有方法相比，所提出的多模态分类模型表现出卓越的性能，准确率达到97.53%，精确率为98.20%，F1分数为97.76%，召回率为97.32%。还使用混淆矩阵评估了模型的有效性。

结论

本文提出了一种强大的多模态分类方法，该方法优于现有技术方法，在抑郁症评估的临床诊断中具有潜在应用价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ecd/11505707/c6d4ece1c6f4/brainsci-14-01018-g001.jpg

相似文献

Multimodal Fusion of EEG and Audio Spectrogram for Major Depressive Disorder Recognition Using Modified DenseNet121.基于改进的DenseNet121的脑电图与音频频谱图多模态融合用于重度抑郁症识别

Brain Sci. 2024 Oct 15;14(10):1018. doi: 10.3390/brainsci14101018.

High-Density Electroencephalography and Speech Signal Based Deep Framework for Clinical Depression Diagnosis.基于高密度脑电图和语音信号的临床抑郁症诊断深度框架

IEEE/ACM Trans Comput Biol Bioinform. 2023 Jul-Aug;20(4):2587-2597. doi: 10.1109/TCBB.2023.3257175. Epub 2023 Aug 9.

Cross-Silo, Privacy-Preserving, and Lightweight Federated Multimodal System for the Identification of Major Depressive Disorder Using Audio and Electroencephalogram.用于使用音频和脑电图识别重度抑郁症的跨孤岛、隐私保护且轻量级的联邦多模态系统

Diagnostics (Basel). 2023 Dec 25;14(1):43. doi: 10.3390/diagnostics14010043.

End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis.端到端使用深度神经网络进行多模态临床抑郁症识别：比较分析。

Comput Methods Programs Biomed. 2021 Nov;211:106433. doi: 10.1016/j.cmpb.2021.106433. Epub 2021 Sep 28.

CNN-XGBoost fusion-based affective state recognition using EEG spectrogram image analysis.基于 CNN-XGBoost 融合的脑电频谱图图像分析情感状态识别。

Sci Rep. 2022 Aug 19;12(1):14122. doi: 10.1038/s41598-022-18257-x.

A major depressive disorder classification framework based on EEG signals using statistical, spectral, wavelet, functional connectivity, and nonlinear analysis.基于 EEG 信号的统计、谱、小波、功能连接和非线性分析的重度抑郁症分类框架。

J Neurosci Methods. 2021 Jul 1;358:109209. doi: 10.1016/j.jneumeth.2021.109209. Epub 2021 May 4.

DCNN for Pig Vocalization and Non-Vocalization Classification: Evaluate Model Robustness with New Data.用于猪发声与非发声分类的深度卷积神经网络：使用新数据评估模型稳健性

Animals (Basel). 2024 Jul 9;14(14):2029. doi: 10.3390/ani14142029.

EEG driving fatigue detection based on log-Mel spectrogram and convolutional recurrent neural networks.基于对数梅尔频谱图和卷积递归神经网络的脑电图驾驶疲劳检测

Front Neurosci. 2023 Mar 9;17:1136609. doi: 10.3389/fnins.2023.1136609. eCollection 2023.

Cross-modal credibility modelling for EEG-based multimodal emotion recognition.基于 EEG 的多模态情感识别的跨模态可信度建模。

J Neural Eng. 2024 Apr 11;21(2). doi: 10.1088/1741-2552/ad3987.

An Effective Hybrid Deep Learning Model for Single-Channel EEG-Based Subject-Independent Drowsiness Recognition.一种基于单通道脑电图的有效混合深度学习模型用于独立于个体的嗜睡识别

Brain Topogr. 2024 Jan;37(1):1-18. doi: 10.1007/s10548-023-01016-0. Epub 2023 Nov 23.

本文引用的文献

This is no "ICA bug": response to the article, "ICA's bug: how ghost ICs emerge from effective rank deficiency caused by EEG electrode interpolation and incorrect re-referencing".这并非“独立成分分析（ICA）错误”：对《ICA的错误：脑电电极插值和错误重参考导致的有效秩亏缺如何产生虚假独立成分》一文的回应

Front Neuroimaging. 2023 Dec 21;2:1331404. doi: 10.3389/fnimg.2023.1331404. eCollection 2023.

Electroencephalogram based brain-computer interface: Applications, challenges, and opportunities.基于脑电图的脑机接口：应用、挑战与机遇。

Multimed Tools Appl. 2023 May 4:1-45. doi: 10.1007/s11042-023-15653-x.

High-Density Electroencephalography and Speech Signal Based Deep Framework for Clinical Depression Diagnosis.基于高密度脑电图和语音信号的临床抑郁症诊断深度框架

IEEE/ACM Trans Comput Biol Bioinform. 2023 Jul-Aug;20(4):2587-2597. doi: 10.1109/TCBB.2023.3257175. Epub 2023 Aug 9.

A multi-modal open dataset for mental-disorder analysis.多模态开放精神障碍分析数据集。

Sci Data. 2022 Apr 19;9(1):178. doi: 10.1038/s41597-022-01211-x.

Computer-Aided Recognition Based on Decision-Level Multimodal Fusion for Depression.基于决策级多模态融合的抑郁计算机辅助识别。

IEEE J Biomed Health Inform. 2022 Jul;26(7):3466-3477. doi: 10.1109/JBHI.2022.3165640. Epub 2022 Jul 1.

Resting-State EEG Signal for Major Depressive Disorder Detection: A Systematic Validation on a Large and Diverse Dataset.基于大样本、多样化数据集的静息态 EEG 信号对重度抑郁症的检测：系统验证

Biosensors (Basel). 2021 Dec 6;11(12):499. doi: 10.3390/bios11120499.

A Convenient and Low-Cost Model of Depression Screening and Early Warning Based on Voice Data Using for Public Mental Health.基于语音数据的公众心理健康抑郁筛查和预警的便捷、低成本模型。

Int J Environ Res Public Health. 2021 Jun 14;18(12):6441. doi: 10.3390/ijerph18126441.

Major depressive disorder diagnosis based on effective connectivity in EEG signals: a convolutional neural network and long short-term memory approach.基于脑电图信号有效连通性的重度抑郁症诊断：一种卷积神经网络和长短期记忆方法。

Cogn Neurodyn. 2021 Apr;15(2):239-252. doi: 10.1007/s11571-020-09619-0. Epub 2020 Jul 26.

Cortical haemodynamic response measured by functional near infrared spectroscopy during a verbal fluency task in patients with major depression and borderline personality disorder.运用功能近红外光谱技术测量重度抑郁症和边缘型人格障碍患者在言语流畅性任务中的皮质血流动力学反应。

EBioMedicine. 2020 Jan;51:102586. doi: 10.1016/j.ebiom.2019.11.047. Epub 2019 Dec 24.

Classification of Depression Patients and Normal Subjects Based on Electroencephalogram (EEG) Signal Using Alpha Power and Theta Asymmetry.基于脑电信号阿尔法功率和 theta 不对称对抑郁患者和正常受试者的分类。

J Med Syst. 2019 Dec 13;44(1):28. doi: 10.1007/s10916-019-1486-z.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于改进的DenseNet121的脑电图与音频频谱图多模态融合用于重度抑郁症识别

Multimodal Fusion of EEG and Audio Spectrogram for Major Depressive Disorder Recognition Using Modified DenseNet121.

作者信息

机构信息

出版信息

METHODS

RESULTS

CONCLUSIONS

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献