• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用语音样本进行深度学习对抑郁症的诊断准确性:系统评价和荟萃分析。

Diagnostic accuracy of deep learning using speech samples in depression: a systematic review and meta-analysis.

机构信息

Department of Population Health Sciences, School of Life Course and Population Sciences, Faculty of Life Sciences & Medicine, King's College London, London, SE1 1UL, United Kingdom.

Department of Intelligent Medical Engineering, School of Biomedical Engineering, Anhui Medical University, Hefei, 230032, China.

出版信息

J Am Med Inform Assoc. 2024 Oct 1;31(10):2394-2404. doi: 10.1093/jamia/ocae189.

DOI:10.1093/jamia/ocae189
PMID:39013193
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11413444/
Abstract

OBJECTIVE

This study aims to conduct a systematic review and meta-analysis of the diagnostic accuracy of deep learning (DL) using speech samples in depression.

MATERIALS AND METHODS

This review included studies reporting diagnostic results of DL algorithms in depression using speech data, published from inception to January 31, 2024, on PubMed, Medline, Embase, PsycINFO, Scopus, IEEE, and Web of Science databases. Pooled accuracy, sensitivity, and specificity were obtained by random-effect models. The diagnostic Precision Study Quality Assessment Tool (QUADAS-2) was used to assess the risk of bias.

RESULTS

A total of 25 studies met the inclusion criteria and 8 of them were used in the meta-analysis. The pooled estimates of accuracy, specificity, and sensitivity for depression detection models were 0.87 (95% CI, 0.81-0.93), 0.85 (95% CI, 0.78-0.91), and 0.82 (95% CI, 0.71-0.94), respectively. When stratified by model structure, the highest pooled diagnostic accuracy was 0.89 (95% CI, 0.81-0.97) in the handcrafted group.

DISCUSSION

To our knowledge, our study is the first meta-analysis on the diagnostic performance of DL for depression detection from speech samples. All studies included in the meta-analysis used convolutional neural network (CNN) models, posing problems in deciphering the performance of other DL algorithms. The handcrafted model performed better than the end-to-end model in speech depression detection.

CONCLUSIONS

The application of DL in speech provided a useful tool for depression detection. CNN models with handcrafted acoustic features could help to improve the diagnostic performance.

PROTOCOL REGISTRATION

The study protocol was registered on PROSPERO (CRD42023423603).

摘要

目的

本研究旨在对使用语音样本进行抑郁症诊断的深度学习(DL)的诊断准确性进行系统评价和荟萃分析。

材料与方法

本综述纳入了自 2024 年 1 月 31 日起,在 PubMed、Medline、Embase、PsycINFO、Scopus、IEEE 和 Web of Science 数据库中发表的使用语音数据报告 DL 算法在抑郁症中诊断结果的研究。使用随机效应模型获取汇总准确性、敏感性和特异性。采用诊断性 Precision 研究质量评估工具(QUADAS-2)评估偏倚风险。

结果

共有 25 项研究符合纳入标准,其中 8 项研究纳入荟萃分析。抑郁检测模型的汇总准确性、特异性和敏感性估计值分别为 0.87(95%CI,0.81-0.93)、0.85(95%CI,0.78-0.91)和 0.82(95%CI,0.71-0.94)。按模型结构分层时,手工组的最高汇总诊断准确性为 0.89(95%CI,0.81-0.97)。

讨论

据我们所知,本研究是关于使用语音样本进行抑郁症检测的 DL 诊断性能的首次荟萃分析。荟萃分析中纳入的所有研究均使用卷积神经网络(CNN)模型,这给其他 DL 算法的性能解读带来了问题。在语音抑郁检测中,手工模型的性能优于端到端模型。

结论

DL 在语音中的应用为抑郁检测提供了一种有用的工具。具有手工声学特征的 CNN 模型可以帮助提高诊断性能。

注册协议

本研究方案已在 PROSPERO(CRD42023423603)上注册。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/f0a2b47ae7fe/ocae189f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/e4a70eefeefc/ocae189f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/d44a45d2f5ea/ocae189f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/e15cd1b7658b/ocae189f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/fb49397b37f0/ocae189f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/232560331479/ocae189f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/f0a2b47ae7fe/ocae189f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/e4a70eefeefc/ocae189f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/d44a45d2f5ea/ocae189f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/e15cd1b7658b/ocae189f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/fb49397b37f0/ocae189f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/232560331479/ocae189f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/11413444/f0a2b47ae7fe/ocae189f6.jpg

相似文献

1
Diagnostic accuracy of deep learning using speech samples in depression: a systematic review and meta-analysis.使用语音样本进行深度学习对抑郁症的诊断准确性:系统评价和荟萃分析。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2394-2404. doi: 10.1093/jamia/ocae189.
2
Machine and Deep Learning for Tuberculosis Detection on Chest X-Rays: Systematic Literature Review.基于 X 光片的结核病检测中的机器和深度学习:系统文献综述。
J Med Internet Res. 2023 Jul 3;25:e43154. doi: 10.2196/43154.
3
Deep learning algorithms for detection of diabetic retinopathy in retinal fundus photographs: A systematic review and meta-analysis.深度学习算法在眼底视网膜照片糖尿病性视网膜病变检测中的应用:系统评价和荟萃分析。
Comput Methods Programs Biomed. 2020 Jul;191:105320. doi: 10.1016/j.cmpb.2020.105320. Epub 2020 Jan 16.
4
Evaluation of accuracy of deep learning and conventional neural network algorithms in detection of dental implant type using intraoral radiographic images: A systematic review and meta-analysis.使用口腔内放射影像评估深度学习和传统神经网络算法在检测牙种植体类型中的准确性:一项系统评价和荟萃分析。
J Prosthet Dent. 2025 Jan;133(1):137-146. doi: 10.1016/j.prosdent.2023.11.030. Epub 2024 Jan 4.
5
Deep learning performance compared to healthcare experts in detecting wrist fractures from radiographs: A systematic review and meta-analysis.深度学习与医疗专家在从X光片中检测腕部骨折方面的性能比较:一项系统综述和荟萃分析。
Eur J Radiol. 2024 May;174:111399. doi: 10.1016/j.ejrad.2024.111399. Epub 2024 Feb 27.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Deep learning for detection of age-related macular degeneration: A systematic review and meta-analysis of diagnostic test accuracy studies.深度学习在年龄相关性黄斑变性检测中的应用:诊断性试验准确性研究的系统评价和荟萃分析。
PLoS One. 2023 Apr 6;18(4):e0284060. doi: 10.1371/journal.pone.0284060. eCollection 2023.
8
Diagnostic accuracy of depression questionnaires in adult patients with diabetes: A systematic review and meta-analysis.糖尿病成年患者抑郁问卷的诊断准确性:系统评价和荟萃分析。
PLoS One. 2019 Jun 20;14(6):e0218512. doi: 10.1371/journal.pone.0218512. eCollection 2019.
9
Artificial Intelligence in Ophthalmology: A Meta-Analysis of Deep Learning Models for Retinal Vessels Segmentation.眼科中的人工智能:视网膜血管分割深度学习模型的荟萃分析。
J Clin Med. 2020 Apr 3;9(4):1018. doi: 10.3390/jcm9041018.
10
Deep learning for classifying the stages of periodontitis on dental images: a systematic review and meta-analysis.深度学习在牙片牙周炎分期分类中的应用:系统评价与荟萃分析。
BMC Oral Health. 2023 Dec 19;23(1):1017. doi: 10.1186/s12903-023-03751-z.

本文引用的文献

1
Spectro-temporal acoustical markers differentiate speech from song across cultures.声谱-时间声学标记可跨文化区分言语和歌曲。
Nat Commun. 2024 Jun 6;15(1):4835. doi: 10.1038/s41467-024-49040-3.
2
Enhancing accuracy and privacy in speech-based depression detection through speaker disentanglement.通过说话人解缠提高基于语音的抑郁症检测的准确性和隐私性。
Comput Speech Lang. 2024 Jun;86. doi: 10.1016/j.csl.2023.101605. Epub 2023 Dec 26.
3
Attention guided learnable time-domain filterbanks for speech depression detection.注意力引导可学习时域滤波器组用于语音抑郁检测。
Neural Netw. 2023 Aug;165:135-149. doi: 10.1016/j.neunet.2023.05.041. Epub 2023 May 26.
4
Classification of Depression and Its Severity Based on Multiple Audio Features Using a Graphical Convolutional Neural Network.基于图形卷积神经网络的多音频特征对抑郁的分类及其严重程度
Int J Environ Res Public Health. 2023 Jan 15;20(2):1588. doi: 10.3390/ijerph20021588.
5
Depression recognition using a proposed speech chain model fusing speech production and perception features.基于语音产生和感知特征融合的语音链模型进行抑郁识别。
J Affect Disord. 2023 Feb 15;323:299-308. doi: 10.1016/j.jad.2022.11.060. Epub 2022 Nov 30.
6
A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement.通过说话人解缠在检测抑郁症时迈向保留说话人身份的一步。
Interspeech. 2022 Sep;2022:3338-3342. doi: 10.21437/interspeech.2022-10798.
7
Privacy-preserving Speech-based Depression Diagnosis via Federated Learning.基于联邦学习的隐私保护语音抑郁诊断。
Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:1371-1374. doi: 10.1109/EMBC48229.2022.9871861.
8
Vocal Acoustic Features as Potential Biomarkers for Identifying/Diagnosing Depression: A Cross-Sectional Study.嗓音声学特征作为识别/诊断抑郁症的潜在生物标志物:一项横断面研究。
Front Psychiatry. 2022 Apr 28;13:815678. doi: 10.3389/fpsyt.2022.815678. eCollection 2022.
9
A multi-modal open dataset for mental-disorder analysis.多模态开放精神障碍分析数据集。
Sci Data. 2022 Apr 19;9(1):178. doi: 10.1038/s41597-022-01211-x.
10
An insight into diagnosis of depression using machine learning techniques: a systematic review.利用机器学习技术进行抑郁症诊断的研究进展:系统综述。
Curr Med Res Opin. 2022 May;38(5):749-771. doi: 10.1080/03007995.2022.2038487. Epub 2022 Feb 17.