Department of Health Policy and Management, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA.
Wellframe, Inc., Boston, Massachusetts, USA.
Popul Health Manag. 2020 Aug;23(4):319-325. doi: 10.1089/pop.2019.0132. Epub 2019 Nov 25.
Digital care management programs can reduce health care costs and improve quality of care. However, it is unclear how to target patients who are most likely to benefit from these programs ex ante, a shortcoming of current "risk score"-based approaches across many interventions. This study explores a framework to define impactability by using machine learning (ML) models to identify those patients most likely to benefit from a digital health intervention for care management. Anonymized insurance claims data were used from a commercially insured population across several US states and combined with inferred sociodemographic data. The approach involves creating 2 models and the comparative analysis of the methodologies and performances therein. The authors first train a cost prediction model to calculate the differences in predicted (without intervention) versus actual (with onboarding onto digital health platform) health care expenditures for patients (N 5600). This enables classification impactability if differences in predicted versus actual costs meet a predetermined threshold. Several random forest and logistic regression machine learning models were then trained to accurately categorize new patients as impactable versus not impactable. These parameters are modified through grid search to define the parameters that deliver optimal performance, reaching an overall sensitivity of 0.77 and specificity of 0.65 among all models. This approach shows that impactability for a digital health intervention can be successfully defined using ML methods, thus enabling efficient allocation of resources. This framework is generalizable to analyzing impactability of any intervention and can contribute to realizing closed-loop feedback systems for continuous improvement in health care.
数字护理管理程序可以降低医疗保健成本并提高护理质量。然而,目前许多干预措施都基于“风险评分”,不清楚如何针对最有可能从这些程序中受益的患者进行预先定位,这是一个缺点。本研究探讨了一种通过使用机器学习 (ML) 模型来确定最有可能从数字健康干预措施中受益于护理管理的患者的定义可影响性的框架。使用来自美国多个州的商业保险人群的匿名保险索赔数据,并结合推断的社会人口统计学数据。该方法涉及创建 2 个模型,并对其中的方法和性能进行比较分析。作者首先训练一个成本预测模型,以计算患者(N=5600)的预测(无干预)与实际(加入数字健康平台)医疗支出之间的差异。如果预测成本与实际成本之间的差异达到预定阈值,则可以进行分类可影响性。然后,训练了几个随机森林和逻辑回归机器学习模型,以准确地将新患者分类为可影响性与不可影响性。通过网格搜索修改这些参数,以定义提供最佳性能的参数,所有模型的总体灵敏度为 0.77,特异性为 0.65。该方法表明,使用 ML 方法可以成功定义数字健康干预措施的可影响性,从而能够有效地分配资源。该框架可推广到分析任何干预措施的可影响性,并有助于实现闭环反馈系统,以持续改进医疗保健。