Hornstein Silvan, Forman-Hoffman Valerie, Nazander Albert, Ranta Kristian, Hilbert Kevin
Meru Health Inc, Palo Alto, CA, USA.
Department of Psychology, Humboldt-Universität zu Berlin, Berlin, Germany.
Digit Health. 2021 Nov 29;7:20552076211060659. doi: 10.1177/20552076211060659. eCollection 2021 Jan-Dec.
Predicting the outcomes of individual participants for treatment interventions appears central to making mental healthcare more tailored and effective. However, little work has been done to investigate the performance of machine learning-based predictions within digital mental health interventions. Therefore, this study evaluates the performance of machine learning in predicting treatment response in a digital mental health intervention designed for treating depression and anxiety.
Several algorithms were trained based on the data of 970 participants to predict a significant reduction in depression and anxiety symptoms using clinical and sociodemographic variables. As a random forest classifier performed best over cross-validation, it was used to predict the outcomes of 279 new participants.
The random forest achieved an accuracy of 0.71 for the test set (base rate: 0.67, area under curve (AUC): 0.60, = 0.001, balanced accuracy: 0.60). Additionally, predicted non-responders showed less average reduction of their Patient Health Questionnaire-9 (PHQ-9) (-2.7, = 0.004) and General Anxiety Disorder Screener-7 values (-3.7, < 0.001) compared to responders. Besides pre-treatment Patient Health Questionnaire-9 and General Anxiety Disorder Screener-7 values, the self-reported motivation, type of referral into the programme (self vs. healthcare provider) as well as Work Productivity and Activity Impairment Questionnaire items contributed most to the predictions.
This study provides evidence that social-demographic and clinical variables can be used for machine learning to predict therapy outcomes within the context of a therapist-supported digital mental health intervention. Despite the overall moderate performance, this appears promising as these predictions can potentially improve the outcomes of non-responders by monitoring their progress or by offering alternative or additional treatment.
预测个体参与者的治疗干预结果似乎是使心理保健更具针对性和有效性的核心。然而,在数字心理健康干预中,针对基于机器学习的预测性能的研究却很少。因此,本研究评估了机器学习在预测一种旨在治疗抑郁和焦虑的数字心理健康干预中的治疗反应方面的性能。
基于970名参与者的数据训练了几种算法,以使用临床和社会人口统计学变量来预测抑郁和焦虑症状的显著减轻。由于随机森林分类器在交叉验证中表现最佳,因此被用于预测279名新参与者的结果。
随机森林在测试集上的准确率为0.71(基线率:0.67,曲线下面积(AUC):0.60,P = 0.001,平衡准确率:0.60)。此外,与有反应者相比,预测的无反应者在患者健康问卷-9(PHQ-9)(-2.7,P = 0.004)和广泛性焦虑障碍筛查量表-7值(-3.7,P < 0.001)上的平均减轻幅度较小。除了治疗前的患者健康问卷-9和广泛性焦虑障碍筛查量表-7值外,自我报告的动机、进入该项目的转诊类型(自我转诊与医疗保健提供者转诊)以及工作效率和活动障碍问卷项目对预测的贡献最大。
本研究提供了证据,表明社会人口统计学和临床变量可用于机器学习,以在治疗师支持的数字心理健康干预背景下预测治疗结果。尽管总体表现中等,但这似乎很有前景,因为这些预测有可能通过监测无反应者的进展或提供替代或额外治疗来改善其治疗结果。