Leis Angela, Mayer Miguel-Angel, Ronzano Francesco, Torrens Marta, Castillo Claudio, Furlong Laura I, Sanz Ferran
Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM) and Universitat Pompeu Fabra, Barcelona, Spain.
Institute of Neuropsychiatry and Addictions, IMIM, Barcelona, Spain.
Stud Health Technol Inform. 2020 Jun 16;270:921-925. doi: 10.3233/SHTI200296.
People use language to express their thoughts and feelings, unveiling important aspects of their psychological traits and social interactions. Although there are several studies describing methodologies to create a collection of words in English related to depression and other conditions, in most of them the selection of words is not clinical or expert based. The objective of this study is twofold: firstly, to introduce a comprehensive collection of Spanish words commonly used by patients suffering from depression, which will be available as a free open source for research purposes (GitHub), and secondly, to study the usefulness of this collection of words in identifying social media posts that could be indicative of patients suffering from depression. The level of agreement among medical doctors to determine the best words that should be used to select tweets related to depression was low. This finding may be due to the complexity of depression and the extraordinary diversity in the way people express themselves when describing their illness. It is critical to perform a thorough analysis of the specific language used in each condition, before deciding the best words to be used for filtering the tweets in each disease. As our study shows, the words supposedly more linked to depression are very common words used in other contexts, and consequently less specific for detecting depressive users. In addition, grammatical gender forms should be considered when analysing some languages such as Spanish.
人们使用语言来表达自己的思想和情感,揭示其心理特征和社会互动的重要方面。尽管有几项研究描述了创建与抑郁症及其他病症相关的英语词汇集的方法,但在大多数研究中,词汇的选择并非基于临床或专家意见。本研究的目的有两个:第一,引入抑郁症患者常用的西班牙语词汇综合集,该词汇集将作为免费开源资源供研究使用(GitHub);第二,研究该词汇集在识别可能表明抑郁症患者的社交媒体帖子方面的有用性。医生们在确定用于筛选与抑郁症相关推文的最佳词汇时,达成的共识程度较低。这一发现可能是由于抑郁症的复杂性以及人们在描述病情时表达自身的方式极为多样。在决定用于筛选每种疾病推文的最佳词汇之前,对每种病症中使用的特定语言进行全面分析至关重要。正如我们的研究所表明的,那些据称与抑郁症关联度更高的词汇是在其他语境中非常常见的词汇,因此在检测抑郁症患者时特异性较低。此外,在分析像西班牙语这样的一些语言时,应考虑语法性形式。