Suppr超能文献

基于注意力机制和特征金字塔模型的抑郁症自动诊断

Automatic diagnosis of depression based on attention mechanism and feature pyramid model.

作者信息

Xu Ningya, Huo Hua, Xu Jiaxin, Ma Lan, Wang Jinxuan

机构信息

Information Engineering College, Henan University of Science and Technology, Luoyang, Henan, China.

Engineering Technology Research Center of Big Data and Computational Intelligence, Henan University of Science and Technology, Luoyang, Henan, China.

出版信息

PLoS One. 2024 Mar 12;19(3):e0295051. doi: 10.1371/journal.pone.0295051. eCollection 2024.

Abstract

Currently, most diagnoses of depression are evaluated by medical professionals, with the results of these evaluations influenced by the subjective judgment of physicians. Physiological studies have shown that depressed patients display facial movements, head posture, and gaze direction disorders. To accurately diagnose the degree of depression of patients, this paper proposes a comprehensive framework, Cross-Channel Attentional Depression Detection Network, which can automatically diagnose the degree of depression of patients by inputting information from the facial images of depressed patients. Specifically, the comprehensive framework is composed of three main modules: (1) Face key point detection and cropping for video images based on Multi-Task Convolutional Neural Network. (2) The improved Feature Pyramid Networks model can fuse shallow features and deep features in video images and reduce the loss of miniscule features. (3) A proposed Cross-Channel Attention Convolutional Neural Network can enhance the interaction between tensor channel layers. Compared to other methods for automatic depression identification, a superior method was obtained by conducting extensive experiments on the depression dataset AVEC 2014, where the Root Mean Square Error and the Mean Absolute Error were 8.65 and 6.66, respectively.

摘要

目前,大多数抑郁症诊断由医学专业人员进行评估,这些评估结果受医生主观判断的影响。生理学研究表明,抑郁症患者存在面部运动、头部姿势和注视方向障碍。为了准确诊断患者的抑郁程度,本文提出了一个综合框架——跨通道注意力抑郁检测网络,该网络可以通过输入抑郁症患者面部图像的信息来自动诊断患者的抑郁程度。具体来说,该综合框架由三个主要模块组成:(1)基于多任务卷积神经网络的视频图像面部关键点检测与裁剪。(2)改进的特征金字塔网络模型可以融合视频图像中的浅层特征和深层特征,并减少微小特征的损失。(3)提出的跨通道注意力卷积神经网络可以增强张量通道层之间的交互。与其他自动抑郁识别方法相比,通过在抑郁数据集AVEC 2014上进行大量实验,获得了一种 superior 方法,其均方根误差和平均绝对误差分别为8.65和6.66。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4892/10931460/c5905cb5d91b/pone.0295051.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验