• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图形卷积神经网络的多音频特征对抑郁的分类及其严重程度

Classification of Depression and Its Severity Based on Multiple Audio Features Using a Graphical Convolutional Neural Network.

机构信息

Division of Information and Electronic Engineering, Muroran Institute of Technology, 27-1, Mizumoto-cho, Muroran 050-8585, Hokkaido, Japan.

College of Information and Systems, Muroran Institute of Technology, 27-1, Mizumoto-cho, Muroran 050-8585, Hokkaido, Japan.

出版信息

Int J Environ Res Public Health. 2023 Jan 15;20(2):1588. doi: 10.3390/ijerph20021588.

DOI:10.3390/ijerph20021588
PMID:36674342
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9864471/
Abstract

Audio features are physical features that reflect single or complex coordinated movements in the vocal organs. Hence, in speech-based automatic depression classification, it is critical to consider the relationship among audio features. Here, we propose a deep learning-based classification model for discriminating depression and its severity using correlation among audio features. This model represents the correlation between audio features as graph structures and learns speech characteristics using a graph convolutional neural network. We conducted classification experiments in which the same subjects were allowed to be included in both the training and test data (Setting 1) and the subjects in the training and test data were completely separated (Setting 2). The results showed that the classification accuracy in Setting 1 significantly outperformed existing state-of-the-art methods, whereas that in Setting 2, which has not been presented in existing studies, was much lower than in Setting 1. We conclude that the proposed model is an effective tool for discriminating recurring patients and their severities, but it is difficult to detect new depressed patients. For practical application of the model, depression-specific speech regions appearing locally rather than the entire speech of depressed patients should be detected and assigned the appropriate class labels.

摘要

音频特征是反映发声器官单一或复杂协调运动的物理特征。因此,在基于语音的自动抑郁分类中,考虑音频特征之间的关系至关重要。在这里,我们提出了一种基于深度学习的分类模型,用于使用音频特征之间的相关性来区分抑郁及其严重程度。该模型将音频特征之间的相关性表示为图结构,并使用图卷积神经网络学习语音特征。我们进行了分类实验,其中允许相同的受试者同时包含在训练和测试数据中(设置 1),并且训练和测试数据中的受试者完全分开(设置 2)。结果表明,设置 1 中的分类准确率明显优于现有最先进的方法,而在现有研究中尚未提出的设置 2 中的准确率要低得多。我们得出结论,所提出的模型是区分复发性患者及其严重程度的有效工具,但很难检测到新的抑郁患者。对于模型的实际应用,应该检测出现局部而非抑郁患者整个语音的抑郁特异性语音区域,并为其分配适当的类别标签。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7651/9864471/bcb2246b2855/ijerph-20-01588-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7651/9864471/e5a0cdb29b8b/ijerph-20-01588-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7651/9864471/6ed476319265/ijerph-20-01588-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7651/9864471/731cae507f50/ijerph-20-01588-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7651/9864471/bcb2246b2855/ijerph-20-01588-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7651/9864471/e5a0cdb29b8b/ijerph-20-01588-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7651/9864471/6ed476319265/ijerph-20-01588-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7651/9864471/731cae507f50/ijerph-20-01588-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7651/9864471/bcb2246b2855/ijerph-20-01588-g004.jpg

相似文献

1
Classification of Depression and Its Severity Based on Multiple Audio Features Using a Graphical Convolutional Neural Network.基于图形卷积神经网络的多音频特征对抑郁的分类及其严重程度
Int J Environ Res Public Health. 2023 Jan 15;20(2):1588. doi: 10.3390/ijerph20021588.
2
A New Regression Model for Depression Severity Prediction Based on Correlation among Audio Features Using a Graph Convolutional Neural Network.一种基于图卷积神经网络利用音频特征间相关性进行抑郁严重程度预测的新型回归模型。
Diagnostics (Basel). 2023 Feb 14;13(4):727. doi: 10.3390/diagnostics13040727.
3
End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis.端到端使用深度神经网络进行多模态临床抑郁症识别:比较分析。
Comput Methods Programs Biomed. 2021 Nov;211:106433. doi: 10.1016/j.cmpb.2021.106433. Epub 2021 Sep 28.
4
Depression recognition using a proposed speech chain model fusing speech production and perception features.基于语音产生和感知特征融合的语音链模型进行抑郁识别。
J Affect Disord. 2023 Feb 15;323:299-308. doi: 10.1016/j.jad.2022.11.060. Epub 2022 Nov 30.
5
Automatic Assessment of Aphasic Speech Sensed by Audio Sensors for Classification into Aphasia Severity Levels to Recommend Speech Therapies.利用音频传感器自动评估失语症患者的语音,以对失语症严重程度进行分类,从而推荐相应的语言治疗方法。
Sensors (Basel). 2022 Sep 14;22(18):6966. doi: 10.3390/s22186966.
6
Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network.基于深度卷积神经网络的特征选择算法对语音情感识别的影响。
Sensors (Basel). 2020 Oct 23;20(21):6008. doi: 10.3390/s20216008.
7
Neuronetwork Approach in the Early Diagnosis of Depression.神经网络方法在抑郁症早期诊断中的应用。
Psychiatr Danub. 2023 Oct;35(Suppl 2):77-85.
8
Automated depression analysis using convolutional neural networks from speech.基于语音的卷积神经网络进行自动抑郁分析。
J Biomed Inform. 2018 Jul;83:103-111. doi: 10.1016/j.jbi.2018.05.007. Epub 2018 May 29.
9
Depression Severity Classification from Speech Emotion.基于语音情感的抑郁严重程度分类
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:5763-5766. doi: 10.1109/EMBC.2018.8513610.
10
Automatic Depression Detection Using Smartphone-Based Text-Dependent Speech Signals: Deep Convolutional Neural Network Approach.使用基于智能手机的文本相关语音信号进行自动抑郁检测:深度学习卷积神经网络方法。
J Med Internet Res. 2023 Jan 25;25:e34474. doi: 10.2196/34474.

引用本文的文献

1
Diagnostic accuracy of deep learning using speech samples in depression: a systematic review and meta-analysis.使用语音样本进行深度学习对抑郁症的诊断准确性:系统评价和荟萃分析。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2394-2404. doi: 10.1093/jamia/ocae189.
2
Diagnosis of mental disorders using machine learning: Literature review and bibliometric mapping from 2012 to 2023.使用机器学习诊断精神障碍:2012年至2023年的文献综述与文献计量映射
Heliyon. 2024 Jun 8;10(12):e32548. doi: 10.1016/j.heliyon.2024.e32548. eCollection 2024 Jun 30.

本文引用的文献

1
A Weakly Supervised Learning Framework for Detecting Social Anxiety and Depression.一种用于检测社交焦虑和抑郁的弱监督学习框架。
Proc ACM Interact Mob Wearable Ubiquitous Technol. 2018 Jun;2(2). doi: 10.1145/3214284.
2
Focal Loss for Dense Object Detection.用于密集目标检测的焦散损失
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):318-327. doi: 10.1109/TPAMI.2018.2858826. Epub 2018 Jul 23.
3
Clinical diagnosis of depression in primary care: a meta-analysis.基层医疗中抑郁症的临床诊断:一项荟萃分析。
Lancet. 2009 Aug 22;374(9690):609-19. doi: 10.1016/S0140-6736(09)60879-5. Epub 2009 Jul 27.
4
The PHQ-8 as a measure of current depression in the general population.PHQ-8作为一般人群当前抑郁状况的一种测量工具。
J Affect Disord. 2009 Apr;114(1-3):163-73. doi: 10.1016/j.jad.2008.06.026. Epub 2008 Aug 27.
5
Motor and cognitive aspects of motor retardation in depression.抑郁症中运动迟缓的运动和认知方面
J Affect Disord. 2000 Jan-Mar;57(1-3):83-93. doi: 10.1016/s0165-0327(99)00068-3.
6
Adequacy and duration of antidepressant treatment in primary care.初级保健中抗抑郁药治疗的充分性和疗程
Med Care. 1992 Jan;30(1):67-76. doi: 10.1097/00005650-199201000-00007.