Pérez Anxo, Parapar Javier, Barreiro Álvaro
Information Retrieval Lab, CITIC, Universidade da Coruña, Campus de Elviña s/n, 15071 A Coruña, Spain.
Artif Intell Med. 2022 Oct;132:102380. doi: 10.1016/j.artmed.2022.102380. Epub 2022 Aug 24.
Depression is one of the most common mental health illnesses. The biggest obstacle lies in an efficient and early detection of the disorder. Self-report questionnaires are the instruments used by medical experts to elaborate a diagnosis. These questionnaires were designed by analyzing different depressive symptoms. However, factors such as social stigmas negatively affect the success of traditional methods. This paper presents a novel approach for automatically estimating the degree of depression in social media users. In this regard, we addressed the task Measuring the Severity of the Signs of Depression of eRisk 2020, an initiative in the CLEF Conference. We aimed to explore neural language models to exploit different aspects of the subject's writings depending on the symptom to capture. We devised two distinct methods based on the symptoms' sensitivity in terms of willingness on commenting about them publicly. The first exploits users' general language based on their publications. The second seeks more direct evidence from publications that specifically mention the symptoms concerns. Both methods automatically estimate the Beck Depression Inventory (BDI-II) total score. For evaluating our proposals, we used benchmark Reddit data for depression severity estimation. Our findings showed that approaches based on neural language models are a feasible alternative for estimating depression rating scales, even when small amounts of training data are available.
抑郁症是最常见的心理健康疾病之一。最大的障碍在于对该疾病进行有效且早期的检测。自我报告问卷是医学专家用于做出诊断的工具。这些问卷是通过分析不同的抑郁症状而设计的。然而,诸如社会 stigma 等因素会对传统方法的成功产生负面影响。本文提出了一种自动估计社交媒体用户抑郁程度的新方法。在这方面,我们解决了“测量 eRisk 2020 抑郁症状严重程度”这一任务,这是 CLEF 会议中的一项倡议。我们旨在探索神经语言模型,以便根据要捕捉的症状来利用受试者写作的不同方面。我们根据症状在公开谈论它们的意愿方面的敏感性设计了两种不同的方法。第一种方法基于用户的出版物来利用其通用语言。第二种方法从专门提及症状相关内容的出版物中寻找更直接的证据。两种方法都能自动估计贝克抑郁量表(BDI-II)的总分。为了评估我们的提议,我们使用了用于抑郁严重程度估计的基准 Reddit 数据。我们的研究结果表明,基于神经语言模型的方法是估计抑郁评定量表的一种可行替代方法,即使只有少量的训练数据可用。