• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于面部多模态数据的抑郁症诊断。

Diagnosis of depression based on facial multimodal data.

作者信息

Jin Nani, Ye Renjia, Li Peng

机构信息

Materdicine Lab, School of Life Sciences, Shanghai University, Shanghai, China.

Research Department, Third Xiangya Hospital of Central South University, Changsha, China.

出版信息

Front Psychiatry. 2025 Jan 28;16:1508772. doi: 10.3389/fpsyt.2025.1508772. eCollection 2025.

DOI:10.3389/fpsyt.2025.1508772
PMID:39935533
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11811426/
Abstract

INTRODUCTION

Depression is a serious mental health disease. Traditional scale-based depression diagnosis methods often have problems of strong subjectivity and high misdiagnosis rate, so it is particularly important to develop automatic diagnostic tools based on objective indicators.

METHODS

This study proposes a deep learning method that fuses multimodal data to automatically diagnose depression using facial video and audio data. We use spatiotemporal attention module to enhance the extraction of visual features and combine the Graph Convolutional Network (GCN) and the Long and Short Term Memory (LSTM) to analyze the audio features. Through the multi-modal feature fusion, the model can effectively capture different feature patterns related to depression.

RESULTS

We conduct extensive experiments on the publicly available clinical dataset, the Extended Distress Analysis Interview Corpus (E-DAIC). The experimental results show that we achieve robust accuracy on the E-DAIC dataset, with a Mean Absolute Error (MAE) of 3.51 in estimating PHQ-8 scores from recorded interviews.

DISCUSSION

Compared with existing methods, our model shows excellent performance in multi-modal information fusion, which is suitable for early evaluation of depression.

摘要

引言

抑郁症是一种严重的心理健康疾病。传统的基于量表的抑郁症诊断方法往往存在主观性强和误诊率高的问题,因此开发基于客观指标的自动诊断工具尤为重要。

方法

本研究提出了一种融合多模态数据的深度学习方法,利用面部视频和音频数据自动诊断抑郁症。我们使用时空注意力模块来增强视觉特征的提取,并结合图卷积网络(GCN)和长短时记忆网络(LSTM)来分析音频特征。通过多模态特征融合,该模型能够有效捕捉与抑郁症相关的不同特征模式。

结果

我们在公开可用的临床数据集——扩展痛苦分析访谈语料库(E-DAIC)上进行了广泛的实验。实验结果表明,我们在E-DAIC数据集上取得了稳健的准确率,从录制的访谈中估计PHQ-8分数时的平均绝对误差(MAE)为3.51。

讨论

与现有方法相比,我们的模型在多模态信息融合方面表现出优异的性能,适用于抑郁症的早期评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/2221c13c1423/fpsyt-16-1508772-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/61d6ce9120bc/fpsyt-16-1508772-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/f47892821d50/fpsyt-16-1508772-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/94c31515a904/fpsyt-16-1508772-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/a1d04169c57b/fpsyt-16-1508772-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/5b35ace9846e/fpsyt-16-1508772-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/51005ec4cd81/fpsyt-16-1508772-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/700f43eedc03/fpsyt-16-1508772-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/c8efe5be65f5/fpsyt-16-1508772-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/049a3872d77b/fpsyt-16-1508772-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/eb0b32f1aaa6/fpsyt-16-1508772-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/e479b88b8976/fpsyt-16-1508772-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/d52e19510696/fpsyt-16-1508772-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/2221c13c1423/fpsyt-16-1508772-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/61d6ce9120bc/fpsyt-16-1508772-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/f47892821d50/fpsyt-16-1508772-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/94c31515a904/fpsyt-16-1508772-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/a1d04169c57b/fpsyt-16-1508772-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/5b35ace9846e/fpsyt-16-1508772-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/51005ec4cd81/fpsyt-16-1508772-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/700f43eedc03/fpsyt-16-1508772-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/c8efe5be65f5/fpsyt-16-1508772-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/049a3872d77b/fpsyt-16-1508772-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/eb0b32f1aaa6/fpsyt-16-1508772-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/e479b88b8976/fpsyt-16-1508772-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/d52e19510696/fpsyt-16-1508772-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecbd/11811426/2221c13c1423/fpsyt-16-1508772-g013.jpg

相似文献

1
Diagnosis of depression based on facial multimodal data.基于面部多模态数据的抑郁症诊断。
Front Psychiatry. 2025 Jan 28;16:1508772. doi: 10.3389/fpsyt.2025.1508772. eCollection 2025.
2
End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis.端到端使用深度神经网络进行多模态临床抑郁症识别:比较分析。
Comput Methods Programs Biomed. 2021 Nov;211:106433. doi: 10.1016/j.cmpb.2021.106433. Epub 2021 Sep 28.
3
AMGCN-L: an adaptive multi-time-window graph convolutional network with long-short-term memory for depression detection.AMGCN-L:一种具有长短时记忆的自适应多时间窗图卷积网络,用于抑郁检测。
J Neural Eng. 2023 Oct 27;20(5). doi: 10.1088/1741-2552/ad038b.
4
MAMF-GCN: Multi-scale adaptive multi-channel fusion deep graph convolutional network for predicting mental disorder.MAMF-GCN:用于预测精神障碍的多尺度自适应多通道融合深度图卷积网络。
Comput Biol Med. 2022 Sep;148:105823. doi: 10.1016/j.compbiomed.2022.105823. Epub 2022 Jul 6.
5
A Multimodal Approach for Detection and Assessment of Depression Using Text, Audio and Video.一种使用文本、音频和视频检测与评估抑郁症的多模态方法。
Phenomics. 2024 May 3;4(3):234-249. doi: 10.1007/s43657-023-00152-8. eCollection 2024 Jun.
6
Multi-Head Attention-Based Long Short-Term Memory for Depression Detection From Speech.基于多头注意力机制的长短期记忆网络用于从语音中检测抑郁症
Front Neurorobot. 2021 Aug 26;15:684037. doi: 10.3389/fnbot.2021.684037. eCollection 2021.
7
Multimodal depression detection based on an attention graph convolution and transformer.基于注意力图卷积和变换器的多模态抑郁症检测
Math Biosci Eng. 2025 Feb 27;22(3):652-676. doi: 10.3934/mbe.2025024.
8
DepITCM: an audio-visual method for detecting depression.DepITCM:一种检测抑郁症的视听方法。
Front Psychiatry. 2025 Jan 23;15:1466507. doi: 10.3389/fpsyt.2024.1466507. eCollection 2024.
9
Sentence-level multi-modal feature learning for depression recognition.用于抑郁症识别的句子级多模态特征学习
Front Psychiatry. 2025 Mar 21;16:1439577. doi: 10.3389/fpsyt.2025.1439577. eCollection 2025.
10
Multi-Modal Adaptive Fusion Transformer Network for the Estimation of Depression Level.多模态自适应融合 Transformer 网络用于抑郁水平估计。
Sensors (Basel). 2021 Jul 12;21(14):4764. doi: 10.3390/s21144764.

本文引用的文献

1
A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network.一种基于图卷积神经网络利用音频特征间相关性进行抑郁严重程度预测的新型回归模型。
Diagnostics (Basel). 2023 Feb 14;13(4):727. doi: 10.3390/diagnostics13040727.
2
Voice Acoustic Parameters as Predictors of Depression.作为抑郁症预测指标的语音声学参数
J Voice. 2024 Jan;38(1):77-85. doi: 10.1016/j.jvoice.2021.06.018. Epub 2021 Aug 2.
3
Machine learning in major depression: From classification to treatment outcome prediction.
机器学习在重度抑郁症中的应用:从分类到治疗结局预测。
CNS Neurosci Ther. 2018 Nov;24(11):1037-1052. doi: 10.1111/cns.13048. Epub 2018 Aug 23.
4
Risk Factors for Depression: Differential Across Age?抑郁的风险因素:是否因年龄而异?
Am J Geriatr Psychiatry. 2017 Sep;25(9):966-977. doi: 10.1016/j.jagp.2017.04.004. Epub 2017 Apr 7.
5
Depression sum-scores don't add up: why analyzing specific depression symptoms is essential.抑郁总分并不等同于各症状得分总和:为何分析特定抑郁症状至关重要。
BMC Med. 2015 Apr 6;13:72. doi: 10.1186/s12916-015-0325-4.
6
The PHQ-8 as a measure of current depression in the general population.PHQ-8作为一般人群当前抑郁状况的一种测量工具。
J Affect Disord. 2009 Apr;114(1-3):163-73. doi: 10.1016/j.jad.2008.06.026. Epub 2008 Aug 27.