Department of Psychology, Humboldt-Universität zu Berlin, Berlin, Germany.
Department of Psychology, Humboldt-Universität zu Berlin, Berlin, Germany; Zentrum für Psychotherapie, Humboldt-Universität zu Berlin, Berlin, Germany.
Behav Res Ther. 2020 Jan;124:103530. doi: 10.1016/j.brat.2019.103530. Epub 2019 Dec 16.
The availability of large-scale datasets and sophisticated machine learning tools enables developing models that predict treatment outcomes for individual patients. However, few studies used routinely available sociodemographic and clinical data for this task, and many previous investigations used highly selected samples. This study aimed to investigate cognitive behavioral therapy (CBT) outcomes in a large, naturalistic and longitudinal dataset. Routine data from a university-based outpatient center with n = 2.147 patients was analyzed. Only baseline data including sociodemographics, symptom measures and functional impairment ratings was used for prediction. Different competing classification and regression models were compared to each other; the best models were then applied to previously unseen validation data. Applied on the validation set, the best performing classification model for remission achieved a balanced accuracy of 59% (p < 0.001) and the best performing regression model for dimensional change achieved r = 0.27 (p < 0.001). Age, sex, functional impairment, symptom severity, and axis II comorbidity were among the most important features. Predictor performances significantly exceeded chance level but were far from clinical utility. Neither applying more sophisticated approaches nor restricting the sample to homogeneous subgroups resulted in considerable performance gains. Adding hypotheses-based, more specific clinical constructs and deep (e.g. neurobiological) to digital phenotypes may increase prediction performance.
大型数据集和复杂的机器学习工具的可用性使开发能够预测个体患者治疗结果的模型成为可能。然而,很少有研究使用常规可用的社会人口统计学和临床数据来完成这项任务,并且许多先前的研究使用了高度精选的样本。本研究旨在使用大型、自然和纵向数据集来调查认知行为疗法(CBT)的结果。分析了来自一个以大学为基础的门诊中心的常规数据,n=2147 名患者。仅使用包括社会人口统计学、症状测量和功能障碍评分在内的基线数据进行预测。将不同的竞争分类和回归模型相互比较;然后将最佳模型应用于以前未见的验证数据。在验证集上应用时,用于缓解的最佳分类模型的平衡准确性为 59%(p<0.001),用于维度变化的最佳回归模型的 r 值为 0.27(p<0.001)。年龄、性别、功能障碍、症状严重程度和轴 II 共病是最重要的特征之一。预测器的性能明显优于随机水平,但远未达到临床实用性。无论是应用更复杂的方法还是将样本限制在同质亚组,都不会导致性能显著提高。添加基于假设的、更具体的临床结构和数字表型的深度(例如神经生物学)可能会提高预测性能。