Shin Daun, Kim Kyungdo, Lee Seung-Bo, Lee Changwoo, Bae Ye Seul, Cho Won Ik, Kim Min Ji, Hyung Keun Park C, Chie Eui Kyu, Kim Nam Soo, Ahn Yong Min
Department of Neuropsychiatry, Seoul National University Hospital, Seoul, South Korea.
Transdisciplinary Department of Medicine & Advanced Technology, Seoul National University Hospital, Seoul, South Korea.
Front Psychiatry. 2022 May 24;13:801301. doi: 10.3389/fpsyt.2022.801301. eCollection 2022.
Depression and suicide are critical social problems worldwide, but tools to objectively diagnose them are lacking. Therefore, this study aimed to diagnose depression through machine learning and determine whether it is possible to identify groups at high risk of suicide through words spoken by the participants in a semi-structured interview.
A total of 83 healthy and 83 depressed patients were recruited. All participants were recorded during the Mini-International Neuropsychiatric Interview. Through the suicide risk assessment from the interview items, participants with depression were classified into high-suicide-risk (31 participants) and low-suicide-risk (52 participants) groups. The recording was transcribed into text after only the words uttered by the participant were extracted. In addition, all participants were evaluated for depression, anxiety, suicidal ideation, and impulsivity. The chi-square test and student's -test were used to compare clinical variables, and the Naive Bayes classifier was used for the machine learning text model.
A total of 21,376 words were extracted from all participants and the model for diagnosing patients with depression based on this text confirmed an area under the curve (AUC) of 0.905, a sensitivity of 0.699, and a specificity of 0.964. In the model that distinguished the two groups using statistically significant demographic variables, the AUC was only 0.761. The DeLong test result (-value 0.001) confirmed that the text-based classification was superior to the demographic model. When predicting the high-suicide-risk group, the demographics-based AUC was 0.499, while the text-based one was 0.632. However, the AUC of the ensemble model incorporating demographic variables was 0.800.
The possibility of diagnosing depression using interview text was confirmed; regarding suicide risk, the diagnosis accuracy increased when demographic variables were incorporated. Therefore, participants' words during an interview show significant potential as an objective and diagnostic marker through machine learning.
抑郁症和自杀是全球范围内严峻的社会问题,但缺乏客观诊断它们的工具。因此,本研究旨在通过机器学习诊断抑郁症,并确定是否有可能通过参与者在半结构化访谈中所说的话来识别自杀高风险群体。
共招募了83名健康参与者和83名抑郁症患者。在进行迷你国际神经精神病学访谈期间,对所有参与者进行了录音。通过访谈项目中的自杀风险评估,将抑郁症患者分为高自杀风险组(31名参与者)和低自杀风险组(52名参与者)。仅提取参与者说出的话后,将录音转录为文本。此外,对所有参与者进行了抑郁、焦虑、自杀意念和冲动性评估。使用卡方检验和学生t检验比较临床变量,并将朴素贝叶斯分类器用于机器学习文本模型。
从所有参与者中总共提取了21376个单词,基于此文本诊断抑郁症患者的模型确认曲线下面积(AUC)为0.905,敏感性为0.699,特异性为0.964。在使用具有统计学意义的人口统计学变量区分两组的模型中,AUC仅为0.761。德龙检验结果(p值0.001)证实基于文本的分类优于人口统计学模型。在预测高自杀风险组时,基于人口统计学的AUC为0.499,而基于文本的AUC为0.632。然而,纳入人口统计学变量的集成模型的AUC为0.800。
证实了使用访谈文本诊断抑郁症的可能性;关于自杀风险,纳入人口统计学变量时诊断准确性提高。因此,参与者在访谈中的话语通过机器学习显示出作为客观诊断标志物的巨大潜力。