Hur Jihyun K, Heffner Joseph, Feng Gloria W, Joormann Jutta, Rutledge Robb B
Department of Psychology, Yale University, New Haven, CT 06510.
Department of Psychiatry, Yale University, New Haven, CT 06511.
Proc Natl Acad Sci U S A. 2024 Sep 24;121(39):e2321321121. doi: 10.1073/pnas.2321321121. Epub 2024 Sep 16.
The prevalence of depression is a major societal health concern, and there is an ongoing need to develop tools that predict who will become depressed. Past research suggests that depression changes the language we use, but it is unclear whether language is predictive of worsening symptoms. Here, we test whether the sentiment of brief written linguistic responses predicts changes in depression. Across two studies ( = 467), participants provided responses to neutral open-ended questions, narrating aspects of their lives relevant to depression (e.g., mood, motivation, sleep). Participants also completed the Patient Health Questionnaire (PHQ-9) to assess depressive symptoms and a risky decision-making task with periodic measurements of momentary happiness to quantify mood dynamics. The sentiment of written responses was evaluated by human raters ( = 470), Large Language Models (LLMs; ChatGPT 3.5 and 4.0), and the Linguistic Inquiry and Word Count (LIWC) tool. We found that language sentiment evaluated by human raters and LLMs, but not LIWC, predicted changes in depressive symptoms at a three-week follow-up. Using computational modeling, we found that language sentiment was associated with current mood, but language sentiment predicted symptom changes even after controlling for current mood. In summary, we demonstrate a scalable tool that combines brief written responses with sentiment analysis by AI tools that matches human performance in the prediction of future psychiatric symptoms.
抑郁症的患病率是一个重大的社会健康问题,因此持续需要开发能够预测谁会患上抑郁症的工具。过去的研究表明,抑郁症会改变我们使用的语言,但尚不清楚语言是否能预测症状的恶化。在此,我们测试简短书面语言回应的情感是否能预测抑郁症的变化。在两项研究(N = 467)中,参与者对中性开放式问题做出回应,叙述其生活中与抑郁症相关的方面(如情绪、动机、睡眠)。参与者还完成了患者健康问卷(PHQ - 9)以评估抑郁症状,并完成一项风险决策任务,定期测量瞬间幸福感以量化情绪动态。书面回应的情感由人工评分者(N = 470)、大语言模型(LLMs;ChatGPT 3.5和4.0)以及语言调查与字数统计(LIWC)工具进行评估。我们发现,人工评分者和大语言模型评估的语言情感而非LIWC能预测三周随访时抑郁症状的变化。通过计算建模,我们发现语言情感与当前情绪相关,但即使在控制了当前情绪之后,语言情感仍能预测症状变化。总之,我们展示了一种可扩展的工具,该工具将简短书面回应与人工智能工具的情感分析相结合,在预测未来精神症状方面与人类表现相当。