• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于基于语音的抑郁症分类和严重程度评估的暹罗神经网络。

Siamese Neural Network for Speech-Based Depression Classification and Severity Assessment.

作者信息

Ntalampiras Stavros, Qi Wen

机构信息

Department of Computer Science, University of Milan, 20135 via Celoria 18, Milan, Italy.

School of Future Technology, South China University of Technology, 510641 Wushan Road 381, Guangzhou, China.

出版信息

J Healthc Inform Res. 2024 Oct 3;8(4):577-593. doi: 10.1007/s41666-024-00175-4. eCollection 2024 Dec.

DOI:10.1007/s41666-024-00175-4
PMID:39463856
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11499503/
Abstract

The evaluation of an individual's mental health and behavioral functioning, known as psychological assessment, is generally conducted by a mental health professional. This process aids in diagnosing mental health conditions, identifying suitable treatment options, and assessing progress during treatment. Currently, national health systems are unable to cope with the constantly growing demand for such services. To address and expedite the diagnosis process, this study suggests an AI-powered tool capable of delivering understandable predictions through the automated processing of the captured speech signals. To this end, we employed a Siamese neural network (SNN) elaborating on standardized speech representations free of domain expert knowledge. Such an SNN-based framework is able to address multiple downstream tasks using the same latent representation. Interestingly, it has been applied both for classifying speech depression as well as assessing its severity. After extensive experiments on a publicly available dataset following a standardized protocol, it is shown to significantly outperform the state of the art with respect to both tasks. Last but not least, the present solution offers interpretable predictions, while being able to meaningfully interact with the medical experts.

摘要

对个人心理健康和行为功能的评估,即心理评估,通常由心理健康专业人员进行。这一过程有助于诊断心理健康状况、确定合适的治疗方案以及评估治疗过程中的进展。目前,国家卫生系统无法应对对此类服务不断增长的需求。为了应对并加快诊断过程,本研究提出了一种人工智能驱动的工具,该工具能够通过对捕获的语音信号进行自动处理来提供可理解的预测。为此,我们采用了一种暹罗神经网络(SNN),它基于标准化的语音表示,无需领域专家知识。这种基于SNN的框架能够使用相同的潜在表示来处理多个下游任务。有趣的是,它已被应用于对语音抑郁进行分类以及评估其严重程度。在遵循标准化协议对一个公开可用的数据集进行广泛实验后,结果表明,在这两项任务上,该方法均显著优于现有技术。最后但同样重要的是,本解决方案提供可解释的预测,同时能够与医学专家进行有意义的互动。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/2df2bfd1c2d7/41666_2024_175_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/fb2d726004ee/41666_2024_175_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/37637343313a/41666_2024_175_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/1c2cd782b2e3/41666_2024_175_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/2a52d1bd4c18/41666_2024_175_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/4460c5e40d5e/41666_2024_175_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/2df2bfd1c2d7/41666_2024_175_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/fb2d726004ee/41666_2024_175_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/37637343313a/41666_2024_175_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/1c2cd782b2e3/41666_2024_175_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/2a52d1bd4c18/41666_2024_175_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/4460c5e40d5e/41666_2024_175_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1591/11499503/2df2bfd1c2d7/41666_2024_175_Fig4_HTML.jpg

相似文献

1
Siamese Neural Network for Speech-Based Depression Classification and Severity Assessment.用于基于语音的抑郁症分类和严重程度评估的暹罗神经网络。
J Healthc Inform Res. 2024 Oct 3;8(4):577-593. doi: 10.1007/s41666-024-00175-4. eCollection 2024 Dec.
2
Explainable Siamese Neural Network for Classifying Pediatric Respiratory Sounds.用于儿科呼吸音分类的可解释暹罗神经网络
IEEE J Biomed Health Inform. 2023 Oct;27(10):4728-4735. doi: 10.1109/JBHI.2023.3299341. Epub 2023 Oct 5.
3
Few-Shot Learning for Clinical Natural Language Processing Using Siamese Neural Networks: Algorithm Development and Validation Study.使用暹罗神经网络的临床自然语言处理少样本学习:算法开发与验证研究
JMIR AI. 2023 May 4;2:e44293. doi: 10.2196/44293.
4
Attention guided learnable time-domain filterbanks for speech depression detection.注意力引导可学习时域滤波器组用于语音抑郁检测。
Neural Netw. 2023 Aug;165:135-149. doi: 10.1016/j.neunet.2023.05.041. Epub 2023 May 26.
5
Letter to the Editor: CONVERGENCES AND DIVERGENCES IN THE ICD-11 VS. DSM-5 CLASSIFICATION OF MOOD DISORDERS.给编辑的信:《ICD-11 与 DSM-5 心境障碍分类的趋同与分歧》
Turk Psikiyatri Derg. 2021;32(4):293-295. doi: 10.5080/u26899.
6
Automated depression analysis using convolutional neural networks from speech.基于语音的卷积神经网络进行自动抑郁分析。
J Biomed Inform. 2018 Jul;83:103-111. doi: 10.1016/j.jbi.2018.05.007. Epub 2018 May 29.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
A Siamese deep learning framework for efficient hardware Trojan detection using power side-channel data.一种用于使用功耗侧信道数据进行高效硬件木马检测的暹罗深度学习框架。
Sci Rep. 2024 Jun 6;14(1):13013. doi: 10.1038/s41598-024-62744-2.
9
CiwGAN and fiwGAN: Encoding information in acoustic data to model lexical learning with Generative Adversarial Networks.CiwGAN 和 fiwGAN:利用生成对抗网络将声学数据中的信息编码,以建模词汇学习。
Neural Netw. 2021 Jul;139:305-325. doi: 10.1016/j.neunet.2021.03.017. Epub 2021 Mar 19.
10
A Machine Learning Approach with Human-AI Collaboration for Automated Classification of Patient Safety Event Reports: Algorithm Development and Validation Study.一种人机协作的机器学习方法用于患者安全事件报告的自动分类:算法开发与验证研究
JMIR Hum Factors. 2024 Jan 25;11:e53378. doi: 10.2196/53378.

引用本文的文献

1
Species-independent analysis and identification of emotional animal vocalizations.与物种无关的动物情感发声分析与识别
Sci Rep. 2025 Aug 6;15(1):28828. doi: 10.1038/s41598-025-14323-2.

本文引用的文献

1
Explainable Siamese Neural Network for Classifying Pediatric Respiratory Sounds.用于儿科呼吸音分类的可解释暹罗神经网络
IEEE J Biomed Health Inform. 2023 Oct;27(10):4728-4735. doi: 10.1109/JBHI.2023.3299341. Epub 2023 Oct 5.
2
The applicability of the Beck Depression Inventory and Hamilton Depression Scale in the automatic recognition of depression based on speech signal processing.贝克抑郁量表和汉密尔顿抑郁量表在基于语音信号处理的抑郁症自动识别中的适用性。
Front Psychiatry. 2022 Aug 4;13:879896. doi: 10.3389/fpsyt.2022.879896. eCollection 2022.
3
The self on its axis: a framework for understanding depression.
自身轴心:理解抑郁的框架。
Transl Psychiatry. 2022 Jan 18;12(1):23. doi: 10.1038/s41398-022-01790-8.
4
End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis.端到端使用深度神经网络进行多模态临床抑郁症识别:比较分析。
Comput Methods Programs Biomed. 2021 Nov;211:106433. doi: 10.1016/j.cmpb.2021.106433. Epub 2021 Sep 28.
5
Detection of Minor and Major Depression through Voice as a Biomarker Using Machine Learning.通过语音作为生物标志物利用机器学习检测轻度和重度抑郁症。
J Clin Med. 2021 Jul 8;10(14):3046. doi: 10.3390/jcm10143046.
6
A Survey on Explainable Artificial Intelligence (XAI): Toward Medical XAI.可解释人工智能(XAI)研究综述:迈向医学 XAI
IEEE Trans Neural Netw Learn Syst. 2021 Nov;32(11):4793-4813. doi: 10.1109/TNNLS.2020.3027314. Epub 2021 Oct 27.
7
Using speech recognition technology to investigate the association between timing-related speech features and depression severity.利用语音识别技术研究与时间相关的语音特征与抑郁严重程度之间的关联。
PLoS One. 2020 Sep 11;15(9):e0238726. doi: 10.1371/journal.pone.0238726. eCollection 2020.
8
Automated assessment of psychiatric disorders using speech: A systematic review.使用语音对精神疾病进行自动评估:一项系统综述。
Laryngoscope Investig Otolaryngol. 2020 Jan 31;5(1):96-116. doi: 10.1002/lio2.354. eCollection 2020 Feb.
9
Multimodal Depression Detection: Fusion of Electroencephalography and Paralinguistic Behaviors Using a Novel Strategy for Classifier Ensemble.多模态抑郁检测:使用分类器集成的新策略融合脑电图和副语言行为。
IEEE J Biomed Health Inform. 2019 Nov;23(6):2265-2275. doi: 10.1109/JBHI.2019.2938247. Epub 2019 Aug 29.
10
Speech databases for mental disorders: A systematic review.精神障碍语音数据库:一项系统综述。
Gen Psychiatr. 2019 Jul 22;32(3):e100022. doi: 10.1136/gpsych-2018-100022. eCollection 2019.