Tabesh Mona, Mirström Mariam, Böhme Rebecca Astrid, Lasota Marta, Javaherian Yousef, Agbotsoka-Guiter Thibaud, Sikström Sverker
University of Milan-Bicocca, Italy.
Lund University, Sweden.
J Anxiety Disord. 2025 Jun;112:103020. doi: 10.1016/j.janxdis.2025.103020. Epub 2025 Apr 16.
Major Depression (MD) and General Anxiety Disorder (GAD) are the most common mental health disorders, which typically are assessed quantitatively by rating scales such as PHQ-9 and GAD-7. However, recent advances in natural language processing (NLP) and machine learning (ML) have opened up the possibility of question-based computational language assessment (QCLA). Here we investigate how accurate open-ended questions, using descriptive keywords or autobiographical narratives, can discriminate between participants that self-reported diagnosis of depression and anxiety, or health control. The results show that both language and rating scale measures can discriminate well, however, autobiographical narratives discriminate best between healthy and anxiety (ϕ = 1.58), as well as healthy and depression (ϕ = 1.38). Descriptive keywords, and to a certain extent autobiographical narratives, also discriminate better than summed scores of GAD-7 and PHQ-9 (ϕ=0.80 in discrimination between anxiety and depression), but not when individual items of these scales were analyzed by ML (ϕ=0.86 and ϕ=0.91 in item-level analysis of PHQ-9 and GAD-7, respectively). Combining the scales consistently elevated the discrimination even more (ϕ=1.39 in comparison between depression and anxiety), both in item-level and sum-scores analyses. These results indicate that QCLA measures often, but not in all cases, are better than standardized rating scales for assessment of depression and anxiety. Implication of these findings for mental health assessments are discussed.
重度抑郁症(MD)和广泛性焦虑症(GAD)是最常见的心理健康障碍,通常通过PHQ - 9和GAD - 7等评定量表进行定量评估。然而,自然语言处理(NLP)和机器学习(ML)的最新进展开启了基于问题的计算语言评估(QCLA)的可能性。在此,我们研究使用描述性关键词或自传体叙述的开放式问题在区分自我报告患有抑郁症和焦虑症的参与者与健康对照者方面的准确性如何。结果表明,语言和评定量表测量都能很好地进行区分,然而,自传体叙述在区分健康与焦虑(ϕ = 1.58)以及健康与抑郁(ϕ = 1.38)方面表现最佳。描述性关键词以及在一定程度上自传体叙述,在区分焦虑和抑郁方面也比GAD - 7和PHQ - 9的总分表现更好(焦虑与抑郁区分时ϕ = 0.80),但当通过机器学习分析这些量表的单个项目时则不然(PHQ - 9和GAD - 7的项目水平分析中ϕ分别为0.86和0.91)。在项目水平和总分分析中,将这些量表结合起来能进一步提高区分度(抑郁与焦虑比较时ϕ = 1.39)。这些结果表明,QCLA测量在评估抑郁和焦虑时通常(但并非在所有情况下)比标准化评定量表更好。本文讨论了这些发现对心理健康评估的意义。