从面部图像中减少用于抑郁估计的有噪声标注。

Reducing noisy annotations for depression estimation from facial images.

机构信息

School of Computer Science and Technology, Xi'an University of Posts and Telecommunications, Xi'an, 710121, Shaanxi, China; Shaanxi Key Laboratory of Network Data Analysis and Intelligent Processing, Xi'an University of Posts and Telecommunications, Xi'an, 710121, Shaanxi, China; Xi'an Key Laboratory of Big Data and Intelligent Computing, Xi'an University of Posts and Telecommunications, Xi'an, 710121, Shaanxi, China.

Department of Computer Science, Aalto University, Espoo, Finland.

出版信息

Neural Netw. 2022 Sep;153:120-129. doi: 10.1016/j.neunet.2022.05.025. Epub 2022 Jun 3.

DOI:10.1016/j.neunet.2022.05.025

PMID:35717754

Abstract

Depression has been considered the most dominant mental disorder over the past few years. To help clinicians effectively and efficiently estimate the severity scale of depression, various automated systems based on deep learning have been proposed. To estimate the severity of depression, i.e., the depression severity score (Beck Depression Inventory-II), various deep architectures have been designed to perform regression using the Euclidean loss. However, they do not consider the label distribution, and they do not learn the relationships between the facial images and BDI-II scores, which can be resulting in the noisy labeling for automatic depression estimation (ADE). To mitigate this problem, we propose an automated deep architecture, namely the self-adaptation network (SAN), to improve this uncertain labeling for ADE. Specifically, the architecture consists of four modules: (1) ResNet-18 and ResNet-50 are adopted in the deep feature extraction module (DFEM) to extract informative deep features; (2) a self-attention module (SAM) is adopted to learn the weights from the mini-batch; (3) a square ranking regularization module (SRRM) to create high partitions and low partitions is proposed; and (4) a re-label module (RM) is used to re-label the uncertain annotations for ADE in the low partitions. We conduct extensive experiments on depression databases (i.e., AVEC2013 and AVEC2014) and obtain a performance comparable to the performances of other ADE methods in assessing the severity of depression. More importantly, the proposed method can learn valuable depression patterns from facial videos and obtain a performance comparable to the performances of other methods for depression recognition.

摘要

在过去的几年中，抑郁症一直被认为是最主要的精神障碍。为了帮助临床医生有效地评估抑郁症的严重程度，已经提出了各种基于深度学习的自动化系统。为了评估抑郁症的严重程度，即抑郁严重程度评分（贝克抑郁量表-II），已经设计了各种深度架构来使用欧几里得损失进行回归。然而，它们没有考虑标签分布，也没有学习面部图像和 BDI-II 分数之间的关系，这可能导致自动抑郁评估（ADE）的噪声标记。为了解决这个问题，我们提出了一种自动化的深度架构，即自适应网络（SAN），以改善 ADE 的这种不确定标记。具体来说，该架构由四个模块组成：（1）深度特征提取模块（DFEM）采用 ResNet-18 和 ResNet-50 提取信息丰富的深度特征；（2）采用自注意力模块（SAM）学习来自小批量的权重；（3）提出了一个平方排序正则化模块（SRRM）来创建高分区和低分区；（4）使用再标记模块（RM）在低分区中对 ADE 的不确定注释进行再标记。我们在抑郁症数据库（即 AVEC2013 和 AVEC2014）上进行了广泛的实验，并获得了与其他 ADE 方法评估抑郁症严重程度的性能相当的性能。更重要的是，该方法可以从面部视频中学习有价值的抑郁模式，并获得与其他抑郁识别方法相当的性能。

相似文献

Reducing noisy annotations for depression estimation from facial images.从面部图像中减少用于抑郁估计的有噪声标注。

Neural Netw. 2022 Sep;153:120-129. doi: 10.1016/j.neunet.2022.05.025. Epub 2022 Jun 3.

Leveraging ResNet and label distribution in advanced intelligent systems for facial expression recognition.利用 ResNet 和标签分布在高级智能系统中进行面部表情识别。

Math Biosci Eng. 2023 Apr 24;20(6):11101-11115. doi: 10.3934/mbe.2023491.

A facial depression recognition method based on hybrid multi-head cross attention network.一种基于混合多头交叉注意力网络的面部凹陷识别方法。

Front Neurosci. 2023 May 24;17:1188434. doi: 10.3389/fnins.2023.1188434. eCollection 2023.

Automated depression analysis using convolutional neural networks from speech.基于语音的卷积神经网络进行自动抑郁分析。

J Biomed Inform. 2018 Jul;83:103-111. doi: 10.1016/j.jbi.2018.05.007. Epub 2018 May 29.

PRA-Net: Part-and-Relation Attention Network for depression recognition from facial expression.PRA-Net：基于部件和关系注意的面部表情识别抑郁状态网络

Comput Biol Med. 2023 May;157:106589. doi: 10.1016/j.compbiomed.2023.106589. Epub 2023 Jan 24.

Speech depression recognition based on attentional residual network.基于注意力残差网络的语音抑郁识别。

Front Biosci (Landmark Ed). 2021 Dec 30;26(12):1746-1759. doi: 10.52586/5066.

S-CUDA: Self-cleansing unsupervised domain adaptation for medical image segmentation.S-CUDA：用于医学图像分割的自清洁无监督域适应

Med Image Anal. 2021 Dec;74:102214. doi: 10.1016/j.media.2021.102214. Epub 2021 Aug 12.

Retrieval-based face annotation by weak label regularized local coordinate coding.基于弱标签正则化局部坐标编码的检索式人脸标注。

IEEE Trans Pattern Anal Mach Intell. 2014 Mar;36(3):550-63. doi: 10.1109/TPAMI.2013.145.

Structure-Coherent Deep Feature Learning for Robust Face Alignment.结构一致的深度特征学习用于鲁棒人脸对齐。

IEEE Trans Image Process. 2021;30:5313-5326. doi: 10.1109/TIP.2021.3082319. Epub 2021 Jun 2.

Hybrid Attention Cascade Network for Facial Expression Recognition.用于面部表情识别的混合注意力级联网络。

Sensors (Basel). 2021 Mar 12;21(6):2003. doi: 10.3390/s21062003.

引用本文的文献

Automatic recognition of depression based on audio and video: A review.基于音频和视频的抑郁症自动识别：综述

World J Psychiatry. 2024 Feb 19;14(2):225-233. doi: 10.5498/wjp.v14.i2.225.

A facial depression recognition method based on hybrid multi-head cross attention network.一种基于混合多头交叉注意力网络的面部凹陷识别方法。

Front Neurosci. 2023 May 24;17:1188434. doi: 10.3389/fnins.2023.1188434. eCollection 2023.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从面部图像中减少用于抑郁估计的有噪声标注。

Reducing noisy annotations for depression estimation from facial images.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献