• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过5分钟在线收集的语音检测精神分裂症、双相情感障碍、精神病易感性和重度抑郁症。

Detecting schizophrenia, bipolar disorder, psychosis vulnerability and major depressive disorder from 5 minutes of online-collected speech.

作者信息

Olah Julianna, Wong Win Lee Edwin, Chaudhry Atta-Ul Raheem Rana, Mena Omar, Tang Sunny X

机构信息

Psyrin Ltd. London, UK.

Psychiatry Research, Feinstein Institutes for Medical Research.

出版信息

medRxiv. 2024 Sep 4:2024.09.03.24313020. doi: 10.1101/2024.09.03.24313020.

DOI:10.1101/2024.09.03.24313020
PMID:39281747
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11398428/
Abstract

BACKGROUND

Psychosis poses substantial social and healthcare burdens. The analysis of speech is a promising approach for the diagnosis and monitoring of psychosis, capturing symptoms like thought disorder and flattened affect. Recent advancements in Natural Language Processing (NLP) methodologies enable the automated extraction of informative speech features, which has been leveraged for early psychosis detection and assessment of symptomology. However, critical gaps persist, including the absence of standardized sample collection protocols, small sample sizes, and a lack of multi-illness classification, limiting clinical applicability. Our study aimed to (1) identify an optimal assessment approach for the online and remote collection of speech, in the context of assessing the psychosis spectrum and evaluate whether a fully automated, speech-based machine learning (ML) pipeline can discriminate among different conditions on the schizophrenia-bipolar spectrum (SSD-BD-SPE), help-seeking comparison subjects (MDD), and healthy controls (HC) at varying layers of analysis and diagnostic complexity.

METHODS

We adopted online data collection methods to collect 20 minutes of speech and demographic information from individuals. Participants were categorized as "healthy" help-seekers (HC), having a schizophrenia-spectrum disorder (SSD), bipolar disorder (BD), major depressive disorder (MDD), or being on the psychosis spectrum with sub-clinical psychotic experiences (SPE). SPE status was determined based on self-reported clinical diagnosis and responses to the PHQ-8 and PQ-16 screening questionnaires, while other diagnoses were determined based on self-report from participants. Linguistic and paralinguistic features were extracted and ensemble learning algorithms (e.g., XGBoost) were used to train models. A 70%-30% train-test split and 30-fold cross-validation was used to validate the model performance.

RESULTS

The final analysis sample included 1140 individuals and 22,650 minutes of speech. Using 5-minutes of speech, our model could discriminate between HC and those with a serious mental illness (SSD or BD) with 86% accuracy (AUC = 0.91, Recall = 0.7, Precision = 0.98). Furthermore, our model could discern among HC, SPE, BD and SSD groups with 86% accuracy (F1 macro = 0.855, Recall Macro = 0.86, Precision Macro = 0.86). Finally, in a 5-class discrimination task including individuals with MDD, our model had 76% accuracy (F1 macro = 0.757, Recall Macro = 0.758, Precision Macro = 0.766).

CONCLUSION

Our ML pipeline demonstrated disorder-specific learning, achieving excellent or good accuracy across several classification tasks. We demonstrated that the screening of mental disorders is possible via a fully automated, remote speech assessment pipeline. We tested our model on relatively high number conditions (5 classes) in the literature and in a stratified sample of psychosis spectrum, including HC, SPE, SSD and BD (4 classes). We tested our model on a large sample (N = 1150) and demonstrated best-in-class accuracy with remotely collected speech data in the psychosis spectrum, however, further clinical validation is needed to test the reliability of model performance.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eb1/11398428/0134438d82c9/nihpp-2024.09.03.24313020v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eb1/11398428/f14f51f15056/nihpp-2024.09.03.24313020v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eb1/11398428/8cd583943f8c/nihpp-2024.09.03.24313020v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eb1/11398428/fdd951123a07/nihpp-2024.09.03.24313020v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eb1/11398428/0134438d82c9/nihpp-2024.09.03.24313020v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eb1/11398428/f14f51f15056/nihpp-2024.09.03.24313020v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eb1/11398428/8cd583943f8c/nihpp-2024.09.03.24313020v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eb1/11398428/fdd951123a07/nihpp-2024.09.03.24313020v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8eb1/11398428/0134438d82c9/nihpp-2024.09.03.24313020v1-f0004.jpg
摘要

背景

精神病带来了巨大的社会和医疗负担。言语分析是一种很有前景的精神病诊断和监测方法,能够捕捉思维紊乱和平淡情感等症状。自然语言处理(NLP)方法的最新进展使得能够自动提取信息性言语特征,这已被用于早期精神病检测和症状学评估。然而,关键差距仍然存在,包括缺乏标准化的样本收集方案、样本量小以及缺乏多病种分类,限制了临床适用性。我们的研究旨在:(1)在评估精神病谱系的背景下,确定一种用于在线和远程收集言语的最佳评估方法,并评估基于言语的全自动机器学习(ML)流程能否在不同分析层和诊断复杂性水平上区分精神分裂症 - 双相谱系(SSD - BD - SPE)中的不同病症、寻求帮助的对照受试者(MDD)和健康对照(HC)。

方法

我们采用在线数据收集方法,从个体收集20分钟的言语和人口统计学信息。参与者被分类为“健康”寻求帮助者(HC)、患有精神分裂症谱系障碍(SSD)、双相情感障碍(BD)、重度抑郁症(MDD)或处于伴有亚临床精神病体验(SPE)的精神病谱系中。SPE状态根据自我报告的临床诊断以及对PHQ - 8和PQ - 16筛查问卷的回答来确定,而其他诊断则根据参与者的自我报告来确定。提取语言和副语言特征,并使用集成学习算法(如XGBoost)训练模型。采用70% - 30%的训练 - 测试分割和30折交叉验证来验证模型性能。

结果

最终分析样本包括1140名个体和22650分钟的言语。使用5分钟的言语记录,我们的模型能够以86%的准确率区分HC和患有严重精神疾病(SSD或BD)的个体(AUC = 0.91,召回率 = 0.7,精确率 = 0.98)。此外,我们的模型能够以86%的准确率区分HC、SPE、BD和SSD组(F1宏 = 0.855,召回率宏 = 0.86,精确率宏 = 0.86)。最后,在包括MDD个体在内的5类判别任务中,我们的模型准确率为76%(F1宏 = 0.757,召回率宏 = 0.758,精确率宏 = 0.766)。

结论

我们的ML流程展示了针对特定病症的学习能力,在多个分类任务中实现了优异或良好的准确率。我们证明了通过全自动远程言语评估流程进行精神障碍筛查是可行的。我们在文献中相对较多的病症情况(5类)以及包括HC、SPE、SSD和BD(4类)的精神病谱系分层样本上测试了我们的模型。我们在大样本(N = 1150)上测试了我们的模型,并在精神病谱系中通过远程收集的言语数据展示了一流的准确率,然而,需要进一步的临床验证来测试模型性能的可靠性。

相似文献

1
Detecting schizophrenia, bipolar disorder, psychosis vulnerability and major depressive disorder from 5 minutes of online-collected speech.通过5分钟在线收集的语音检测精神分裂症、双相情感障碍、精神病易感性和重度抑郁症。
medRxiv. 2024 Sep 4:2024.09.03.24313020. doi: 10.1101/2024.09.03.24313020.
2
Detecting schizophrenia, bipolar disorder, psychosis vulnerability and major depressive disorder from 5 minutes of online-collected speech.通过5分钟在线收集的语音检测精神分裂症、双相情感障碍、精神病易感性和重度抑郁症。
Transl Psychiatry. 2025 Jul 12;15(1):241. doi: 10.1038/s41398-025-03433-0.
3
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
4
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
5
Omega-3 fatty acids for depression in adults.成人抑郁症的ω-3脂肪酸治疗
Cochrane Database Syst Rev. 2015 Nov 5;2015(11):CD004692. doi: 10.1002/14651858.CD004692.pub4.
6
Sertindole for schizophrenia.用于治疗精神分裂症的舍吲哚。
Cochrane Database Syst Rev. 2005 Jul 20;2005(3):CD001715. doi: 10.1002/14651858.CD001715.pub2.
7
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
8
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
9
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.
10
Pramipexole in addition to mood stabilisers for treatment-resistant bipolar depression: the PAX-BD randomised double-blind placebo-controlled trial.普拉克索联合心境稳定剂治疗难治性双相抑郁:PAX - BD随机双盲安慰剂对照试验
Health Technol Assess. 2025 May;29(21):1-216. doi: 10.3310/HBFC1953.

本文引用的文献

1
Linguistic findings in persons with schizophrenia-a review of the current literature.精神分裂症患者的语言研究结果——当前文献综述
Front Psychol. 2023 Nov 21;14:1287706. doi: 10.3389/fpsyg.2023.1287706. eCollection 2023.
2
High Predictive Accuracy of Negative Schizotypy With Acoustic Measures.阴性分裂型人格特质与声学测量的高预测准确性。
Clin Psychol Sci. 2022 Mar;10(2):310-323. doi: 10.1177/21677026211017835. Epub 2021 Jun 23.
3
Relative importance of speech and voice features in the classification of schizophrenia and depression.
言语和嗓音特征在精神分裂症和抑郁症分类中的相对重要性。
Transl Psychiatry. 2023 Sep 19;13(1):298. doi: 10.1038/s41398-023-02594-0.
4
Multilingual markers of depression in remotely collected speech samples: A preliminary analysis.远程采集语音样本中抑郁的多语言标志物:初步分析。
J Affect Disord. 2023 Nov 15;341:128-136. doi: 10.1016/j.jad.2023.08.097. Epub 2023 Aug 18.
5
Exploring the ability of vocal biomarkers in distinguishing depression from bipolar disorder, schizophrenia, and healthy controls.探索嗓音生物标志物在区分抑郁症与双相情感障碍、精神分裂症及健康对照方面的能力。
Front Psychiatry. 2023 Jul 20;14:1079448. doi: 10.3389/fpsyt.2023.1079448. eCollection 2023.
6
Syntactic complexity and diversity of spontaneous speech production in schizophrenia spectrum and major depressive disorders.精神分裂症谱系障碍和重度抑郁症患者自发言语产生的句法复杂性和多样性。
Schizophrenia (Heidelb). 2023 May 29;9(1):35. doi: 10.1038/s41537-023-00359-8.
7
Brain Structural Network Connectivity of Formal Thought Disorder Dimensions in Affective and Psychotic Disorders.情感障碍和精神障碍中形式思维障碍维度的脑结构网络连通性
Biol Psychiatry. 2024 Apr 1;95(7):629-638. doi: 10.1016/j.biopsych.2023.05.010. Epub 2023 May 18.
8
Candidate biomarkers in psychiatric disorders: state of the field.精神疾病中的候选生物标志物:该领域的现状。
World Psychiatry. 2023 Jun;22(2):236-262. doi: 10.1002/wps.21078.
9
Speech as a Graph: Developmental Perspectives on the Organization of Spoken Language.言语图谱:口语语言组织的发展视角。
Biol Psychiatry Cogn Neurosci Neuroimaging. 2023 Oct;8(10):985-993. doi: 10.1016/j.bpsc.2023.04.004. Epub 2023 Apr 20.
10
Online speech assessment of the psychotic spectrum: Exploring the relationship between overlapping acoustic markers of schizotypy, depression and anxiety.在线精神病谱的言语评估:探索分裂型特质、抑郁和焦虑重叠声学标记之间的关系。
Schizophr Res. 2023 Sep;259:11-19. doi: 10.1016/j.schres.2023.03.044. Epub 2023 Apr 18.