Departments of Radiology and Psychiatry, University of Colorado School of Medicine, Aurora, CO, United States.
Section on Neurobiology of Fear and Anxiety, National Institute of Mental Health, Bethesda, MD, United States.
Neuroimage Clin. 2019;21:101676. doi: 10.1016/j.nicl.2019.101676. Epub 2019 Jan 11.
Relapse rates are consistently high for stimulant user disorders. In order to obtain prognostic information about individuals in treatment, machine learning models have been applied to neuroimaging and clinical data. Yet few efforts have been made to test these models in independent samples or show that they can outperform linear models. In this exploratory study, we examine whether machine learning models relative to linear models provide greater predictive accuracy and less overfitting.
This longitudinal study included 63 methamphetamine-dependent (training sample) and 29 cocaine-dependent (test sample) individuals who completed an MRI scan during residential treatment. Linear and machine learning models predicting relapse at a one-year follow up that were previously developed in the methamphetamine-dependent sample using neuroimaging and clinical variables were applied to the cocaine-dependent sample. Receiver operating characteristic analysis was used to assess performance using area under the curve (AUC) as the primary outcome.
Twelve individuals in the cocaine-dependent sample remained abstinent, and 17 relapsed. The linear models produced more accurate prediction in the training sample than the machine learning models but showed reduced performance in the testing sample, with AUC decreasing by 0.18. The machine learning models produced similar predictive performance in the training and test samples, with AUC changing by 0.03. In the test sample, neither the linear nor the machine learning model predicted relapse at rates above chance.
Although machine learning algorithms may have advantages, in this study neither model's performance was sufficient to be clinically useful. In order to improve predictive models, stronger predictor variables and larger samples are needed.
兴奋剂使用者障碍的复发率一直很高。为了获得治疗中个体的预后信息,机器学习模型已被应用于神经影像学和临床数据。然而,很少有人努力在独立样本中测试这些模型,也没有证明它们可以优于线性模型。在这项探索性研究中,我们研究了机器学习模型相对于线性模型是否能提供更高的预测准确性和更少的过拟合。
这项纵向研究包括 63 名甲基苯丙胺依赖者(训练样本)和 29 名可卡因依赖者(测试样本),他们在住院治疗期间完成了磁共振成像扫描。先前在甲基苯丙胺依赖者样本中使用神经影像学和临床变量开发的预测一年后复发的线性和机器学习模型被应用于可卡因依赖者样本。使用曲线下面积(AUC)作为主要结果,通过接收者操作特征分析评估性能。
可卡因依赖者样本中有 12 人保持戒断,17 人复发。线性模型在训练样本中的预测比机器学习模型更准确,但在测试样本中的表现却有所下降,AUC 下降了 0.18。机器学习模型在训练和测试样本中产生了相似的预测性能,AUC 变化了 0.03。在测试样本中,无论是线性模型还是机器学习模型,都不能以高于机会的比率预测复发。
尽管机器学习算法可能具有优势,但在这项研究中,没有一种模型的性能足以具有临床意义。为了提高预测模型的性能,需要更强的预测变量和更大的样本。