Department of Pharmaceutical Outcomes & Policy, College of Pharmacy, University of Florida, Gainesville.
Department of Mathematics, University of Arizona, Tucson.
JAMA Netw Open. 2019 Mar 1;2(3):e190968. doi: 10.1001/jamanetworkopen.2019.0968.
Current approaches to identifying individuals at high risk for opioid overdose target many patients who are not truly at high risk.
To develop and validate a machine-learning algorithm to predict opioid overdose risk among Medicare beneficiaries with at least 1 opioid prescription.
DESIGN, SETTING, AND PARTICIPANTS: A prognostic study was conducted between September 1, 2017, and December 31, 2018. Participants (n = 560 057) included fee-for-service Medicare beneficiaries without cancer who filled 1 or more opioid prescriptions from January 1, 2011, to December 31, 2015. Beneficiaries were randomly and equally divided into training, testing, and validation samples.
Potential predictors (n = 268), including sociodemographics, health status, patterns of opioid use, and practitioner-level and regional-level factors, were measured in 3-month windows, starting 3 months before initiating opioids until loss of follow-up or the end of observation.
Opioid overdose episodes from inpatient and emergency department claims were identified. Multivariate logistic regression (MLR), least absolute shrinkage and selection operator-type regression (LASSO), random forest (RF), gradient boosting machine (GBM), and deep neural network (DNN) were applied to predict overdose risk in the subsequent 3 months after initiation of treatment with prescription opioids. Prediction performance was assessed using the C statistic and other metrics (eg, sensitivity, specificity, and number needed to evaluate [NNE] to identify one overdose). The Youden index was used to identify the optimized threshold of predicted score that balanced sensitivity and specificity.
Beneficiaries in the training (n = 186 686), testing (n = 186 685), and validation (n = 186 686) samples had similar characteristics (mean [SD] age of 68.0 [14.5] years, and approximately 63% were female, 82% were white, 35% had disabilities, 41% were dual eligible, and 0.60% had at least 1 overdose episode). In the validation sample, the DNN (C statistic = 0.91; 95% CI, 0.88-0.93) and GBM (C statistic = 0.90; 95% CI, 0.87-0.94) algorithms outperformed the LASSO (C statistic = 0.84; 95% CI, 0.80-0.89), RF (C statistic = 0.80; 95% CI, 0.75-0.84), and MLR (C statistic = 0.75; 95% CI, 0.69-0.80) methods for predicting opioid overdose. At the optimized sensitivity and specificity, DNN had a sensitivity of 92.3%, specificity of 75.7%, NNE of 542, positive predictive value of 0.18%, and negative predictive value of 99.9%. The DNN classified patients into low-risk (76.2% [142 180] of the cohort), medium-risk (18.6% [34 579] of the cohort), and high-risk (5.2% [9747] of the cohort) subgroups, with only 1 in 10 000 in the low-risk subgroup having an overdose episode. More than 90% of overdose episodes occurred in the high-risk and medium-risk subgroups, although positive predictive values were low, given the rare overdose outcome.
Machine-learning algorithms appear to perform well for risk prediction and stratification of opioid overdose, especially in identifying low-risk subgroups that have minimal risk of overdose.
目前用于识别阿片类药物过量高危人群的方法主要针对许多并非真正处于高危状态的患者。
开发和验证一种机器学习算法,以预测至少有 1 个阿片类药物处方的医疗保险受益人的阿片类药物过量风险。
设计、地点和参与者:这是一项预后研究,于 2017 年 9 月 1 日至 2018 年 12 月 31 日进行。参与者(n=560077)包括没有癌症的按服务付费医疗保险受益人,他们在 2011 年 1 月 1 日至 2015 年 12 月 31 日期间至少服用了 1 种阿片类药物处方。受益人被随机且平均分为训练、测试和验证样本。
从 3 个月前开始服用阿片类药物直到随访结束或观察结束,共测量了 268 个潜在预测因子(n=560077),包括社会人口统计学、健康状况、阿片类药物使用模式以及从业者和地区水平因素。
从住院和急诊部门的索赔中确定阿片类药物过量事件。采用多变量逻辑回归(MLR)、最小绝对收缩和选择运算符型回归(LASSO)、随机森林(RF)、梯度提升机(GBM)和深度神经网络(DNN)预测随后 3 个月内开始使用处方阿片类药物后的过量风险。使用 C 统计量和其他指标(如敏感性、特异性和识别 1 例过量所需的评估人数[NNE])评估预测性能。使用约登指数确定平衡敏感性和特异性的最佳预测评分阈值。
在训练(n=186686)、测试(n=186685)和验证(n=186686)样本中,受益人的特征相似(平均[标准差]年龄为 68.0[14.5]岁,约 63%为女性,82%为白人,35%有残疾,41%为双重资格,0.60%至少有 1 次过量事件)。在验证样本中,DNN(C 统计量=0.91;95%CI,0.88-0.93)和 GBM(C 统计量=0.90;95%CI,0.87-0.94)算法优于 LASSO(C 统计量=0.84;95%CI,0.80-0.89)、RF(C 统计量=0.80;95%CI,0.75-0.84)和 MLR(C 统计量=0.75;95%CI,0.69-0.80)方法,用于预测阿片类药物过量。在优化的灵敏度和特异性下,DNN 的灵敏度为 92.3%,特异性为 75.7%,NNE 为 542,阳性预测值为 0.18%,阴性预测值为 99.9%。DNN 将患者分为低风险(队列的 76.2%[142180])、中风险(队列的 18.6%[34579])和高风险(队列的 5.2%[9747])亚组,低风险亚组中每 10000 人中只有 1 人发生过量事件。尽管阳性预测值较低,但大多数过量事件发生在高风险和中风险亚组中,因为过量结局罕见。
机器学习算法似乎在阿片类药物过量风险预测和分层方面表现良好,尤其是在识别风险最小的低风险亚组方面。