• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度张量的语音自动抑郁识别方法。

A deep tensor-based approach for automatic depression recognition from speech utterances.

机构信息

Electronics and Electrical Engineering Dept, Indian Institute of Technology Guwahati, Assam, India.

Electrical Engineering Dept, Indian Institute of Technology Dharwad, Dharwad, Karnataka, India.

出版信息

PLoS One. 2022 Aug 11;17(8):e0272659. doi: 10.1371/journal.pone.0272659. eCollection 2022.

DOI:10.1371/journal.pone.0272659
PMID:35951508
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9371305/
Abstract

Depression is one of the significant mental health issues affecting all age groups globally. While it has been widely recognized to be one of the major disease burdens in populations, complexities in definitive diagnosis present a major challenge. Usually, trained psychologists utilize conventional methods including individualized interview assessment and manually administered PHQ-8 scoring. However, heterogeneity in symptomatic presentations, which span somatic to affective complaints, impart substantial subjectivity in its diagnosis. Diagnostic accuracy is further compounded by the cross-sectional nature of sporadic assessment methods during physician-office visits, especially since depressive symptoms/severity may evolve over time. With widespread acceptance of smart wearable devices and smartphones, passive monitoring of depression traits using behavioral signals such as speech presents a unique opportunity as companion diagnostics to assist the trained clinicians in objective assessment over time. Therefore, we propose a framework for automated depression classification leveraging alterations in speech patterns in the well documented and extensively studied DAIC-WOZ depression dataset. This novel tensor-based approach requires a substantially simpler implementation architecture and extracts discriminative features for depression recognition with high f1 score and accuracy. We posit that such algorithms, which use significantly less compute load would allow effective onboard deployment in wearables for improve diagnostics accuracy and real-time monitoring of depressive disorders.

摘要

抑郁症是影响全球各年龄段人群的重大心理健康问题之一。尽管它已被广泛认为是人群中主要疾病负担之一,但明确诊断的复杂性带来了重大挑战。通常,经过培训的心理学家会利用包括个体访谈评估和手动 PHQ-8 评分在内的传统方法。然而,症状表现的异质性,从躯体到情感抱怨都存在很大的主观性,这使得其诊断变得更加复杂。由于在医生办公室就诊期间,偶发性评估方法具有横断面性质,因此诊断准确性进一步复杂化,尤其是因为抑郁症状/严重程度可能随时间而演变。随着智能可穿戴设备和智能手机的广泛接受,使用行为信号(如语音)被动监测抑郁特征为伴随诊断提供了一个独特的机会,以帮助受过训练的临床医生进行随时间的客观评估。因此,我们提出了一种利用在经过充分记录和广泛研究的 DAIC-WOZ 抑郁数据集中文本模式变化的自动化抑郁分类框架。这种新颖的基于张量的方法需要一个结构简单得多的实现架构,并利用具有高 f1 分数和准确性的判别特征来识别抑郁。我们假设,这种使用计算负载显著较少的算法将允许在可穿戴设备中进行有效部署,以提高诊断准确性并实时监测抑郁障碍。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cab/9371305/30afc4cef016/pone.0272659.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cab/9371305/7a55f322aacd/pone.0272659.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cab/9371305/b7a3320b5a3a/pone.0272659.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cab/9371305/2e1575662e7d/pone.0272659.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cab/9371305/30afc4cef016/pone.0272659.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cab/9371305/7a55f322aacd/pone.0272659.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cab/9371305/b7a3320b5a3a/pone.0272659.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cab/9371305/2e1575662e7d/pone.0272659.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cab/9371305/30afc4cef016/pone.0272659.g004.jpg

相似文献

1
A deep tensor-based approach for automatic depression recognition from speech utterances.基于深度张量的语音自动抑郁识别方法。
PLoS One. 2022 Aug 11;17(8):e0272659. doi: 10.1371/journal.pone.0272659. eCollection 2022.
2
Speechformer-CTC: Sequential Modeling of Depression Detection with Speech Temporal Classification.语音former-CTC:基于语音时间分类的抑郁症检测序列建模
Speech Commun. 2024 Sep;163. doi: 10.1016/j.specom.2024.103106. Epub 2024 Jul 18.
3
Challenges in Using mHealth Data From Smartphones and Wearable Devices to Predict Depression Symptom Severity: Retrospective Analysis.使用智能手机和可穿戴设备的移动医疗数据预测抑郁症状严重程度的挑战:回顾性分析。
J Med Internet Res. 2023 Aug 14;25:e45233. doi: 10.2196/45233.
4
End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis.端到端使用深度神经网络进行多模态临床抑郁症识别:比较分析。
Comput Methods Programs Biomed. 2021 Nov;211:106433. doi: 10.1016/j.cmpb.2021.106433. Epub 2021 Sep 28.
5
[A research on depression recognition based on voice pre-training model].基于语音预训练模型的抑郁症识别研究
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2024 Feb 25;41(1):9-16. doi: 10.7507/1001-5515.202304008.
6
Relationship Between Major Depression Symptom Severity and Sleep Collected Using a Wristband Wearable Device: Multicenter Longitudinal Observational Study.腕带可穿戴设备采集的重度抑郁症症状严重程度与睡眠的关系:多中心纵向观察性研究。
JMIR Mhealth Uhealth. 2021 Apr 12;9(4):e24604. doi: 10.2196/24604.
7
A Machine Learning Approach for Detecting Digital Behavioral Patterns of Depression Using Nonintrusive Smartphone Data (Complementary Path to Patient Health Questionnaire-9 Assessment): Prospective Observational Study.一种使用非侵入性智能手机数据检测抑郁症数字行为模式的机器学习方法(患者健康问卷-9评估的补充途径):前瞻性观察研究。
JMIR Form Res. 2022 May 16;6(5):e37736. doi: 10.2196/37736.
8
Attention guided learnable time-domain filterbanks for speech depression detection.注意力引导可学习时域滤波器组用于语音抑郁检测。
Neural Netw. 2023 Aug;165:135-149. doi: 10.1016/j.neunet.2023.05.041. Epub 2023 May 26.
9
Manifestation of depression in speech overlaps with characteristics used to represent and recognize speaker identity.抑郁在言语中的表现与用于代表和识别说话者身份的特征重叠。
Sci Rep. 2023 Jul 10;13(1):11155. doi: 10.1038/s41598-023-35184-7.
10
A Model of Normality Inspired Deep Learning Framework for Depression Relapse Prediction Using Audiovisual Data.基于深度学习的视听数据预测抑郁症复发的正常模式模型。
Comput Methods Programs Biomed. 2022 Nov;226:107132. doi: 10.1016/j.cmpb.2022.107132. Epub 2022 Sep 20.

本文引用的文献

1
End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis.端到端使用深度神经网络进行多模态临床抑郁症识别:比较分析。
Comput Methods Programs Biomed. 2021 Nov;211:106433. doi: 10.1016/j.cmpb.2021.106433. Epub 2021 Sep 28.
2
Cough Recognition Based on Mel-Spectrogram and Convolutional Neural Network.基于梅尔频谱图和卷积神经网络的咳嗽识别
Front Robot AI. 2021 May 7;8:580080. doi: 10.3389/frobt.2021.580080. eCollection 2021.
3
Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks.
使用集成卷积神经网络自动检测语音中的抑郁症
Entropy (Basel). 2020 Jun 20;22(6):688. doi: 10.3390/e22060688.
4
Understanding the Complex of Suicide in Depression: from Research to Clinics.理解抑郁症中的自杀情结:从研究到临床
Psychiatry Investig. 2020 Mar;17(3):207-221. doi: 10.30773/pi.2019.0171. Epub 2020 Mar 23.
5
Modeling Depression Symptoms from Social Network Data through Multiple Instance Learning.通过多实例学习从社交网络数据中建模抑郁症状
AMIA Jt Summits Transl Sci Proc. 2019 May 6;2019:44-53. eCollection 2019.
6
A Weakly Supervised Learning Framework for Detecting Social Anxiety and Depression.一种用于检测社交焦虑和抑郁的弱监督学习框架。
Proc ACM Interact Mob Wearable Ubiquitous Technol. 2018 Jun;2(2). doi: 10.1145/3214284.
7
Critical appraisal of major depression with suicidal ideation.对伴有自杀意念的重度抑郁症的批判性评价。
Ann Gen Psychiatry. 2019 May 31;18:7. doi: 10.1186/s12991-019-0232-8. eCollection 2019.
8
Tensor-Factorized Neural Networks.张量分解神经网络。
IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1998-2011. doi: 10.1109/TNNLS.2017.2690379. Epub 2017 Apr 17.
9
Studying depression using imaging and machine learning methods.使用成像和机器学习方法研究抑郁症。
Neuroimage Clin. 2015 Nov 10;10:115-23. doi: 10.1016/j.nicl.2015.11.003. eCollection 2016.
10
pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis.pyAudioAnalysis:一个用于音频信号分析的开源Python库。
PLoS One. 2015 Dec 11;10(12):e0144610. doi: 10.1371/journal.pone.0144610. eCollection 2015.