Arioz Umut, Smrke Urška, Plohl Nejc, Mlakar Izidor
Faculty of Electrical Engineering and Computer Science, The University of Maribor, 2000 Maribor, Slovenia.
Department of Psychology, Faculty of Arts, The University of Maribor, 2000 Maribor, Slovenia.
Diagnostics (Basel). 2022 Nov 3;12(11):2683. doi: 10.3390/diagnostics12112683.
Depression is a prevalent comorbidity in patients with severe physical disorders, such as cancer, stroke, and coronary diseases. Although it can significantly impact the course of the primary disease, the signs of depression are often underestimated and overlooked. The aim of this paper was to review algorithms for the automatic, uniform, and multimodal classification of signs of depression from human conversations and to evaluate their accuracy. For the scoping review, the PRISMA guidelines for scoping reviews were followed. In the scoping review, the search yielded 1095 papers, out of which 20 papers (8.26%) included more than two modalities, and 3 of those papers provided codes. Within the scope of this review, supported vector machine (SVM), random forest (RF), and long short-term memory network (LSTM; with gated and non-gated recurrent units) models, as well as different combinations of features, were identified as the most widely researched techniques. We tested the models using the DAIC-WOZ dataset (original training dataset) and using the SymptomMedia dataset to further assess their reliability and dependency on the nature of the training datasets. The best performance was obtained by the LSTM with gated recurrent units (F1-score of 0.64 for the DAIC-WOZ dataset). However, with a drop to an F1-score of 0.56 for the SymptomMedia dataset, the method also appears to be the most data-dependent.
抑郁症是癌症、中风和冠心病等严重躯体疾病患者中普遍存在的一种共病。尽管它会对原发性疾病的病程产生重大影响,但抑郁症的症状往往被低估和忽视。本文的目的是回顾用于从人类对话中自动、统一且多模态分类抑郁症症状的算法,并评估其准确性。对于范围综述,遵循了范围综述的PRISMA指南。在范围综述中,检索到1095篇论文,其中20篇(8.26%)包含两种以上模态,且其中3篇论文提供了代码。在本综述范围内,支持向量机(SVM)、随机森林(RF)和长短期记忆网络(LSTM;有门控和无门控循环单元)模型以及不同的特征组合被确定为研究最广泛的技术。我们使用DAIC-WOZ数据集(原始训练数据集)并使用SymptomMedia数据集对模型进行测试,以进一步评估它们的可靠性以及对训练数据集性质的依赖性。具有门控循环单元的LSTM取得了最佳性能(DAIC-WOZ数据集的F1分数为0.64)。然而,在SymptomMedia数据集上F1分数降至0.56,该方法似乎也是最依赖数据的。