Suppr超能文献

预测医院收取的再入院费用:机器学习方法。

Predicting Readmission Charges Billed by Hospitals: Machine Learning Approach.

作者信息

Gopukumar Deepika, Ghoshal Abhijeet, Zhao Huimin

机构信息

Department of Health and Clinical Outcomes Research, School of Medicine, Saint Louis University, St.Louis, MO, United States.

Department of Business Administration, Gies College of Business, University of Illinois Urbana-Champaign, Champaign, IL, United States.

出版信息

JMIR Med Inform. 2022 Aug 30;10(8):e37578. doi: 10.2196/37578.

Abstract

BACKGROUND

The Centers for Medicare and Medicaid Services projects that health care costs will continue to grow over the next few years. Rising readmission costs contribute significantly to increasing health care costs. Multiple areas of health care, including readmissions, have benefited from the application of various machine learning algorithms in several ways.

OBJECTIVE

We aimed to identify suitable models for predicting readmission charges billed by hospitals. Our literature review revealed that this application of machine learning is underexplored. We used various predictive methods, ranging from glass-box models (such as regularization techniques) to black-box models (such as deep learning-based models).

METHODS

We defined readmissions as readmission with the same major diagnostic category (RSDC) and all-cause readmission category (RADC). For these readmission categories, 576,701 and 1,091,580 individuals, respectively, were identified from the Nationwide Readmission Database of the Healthcare Cost and Utilization Project by the Agency for Healthcare Research and Quality for 2013. Linear regression, lasso regression, elastic net, ridge regression, eXtreme gradient boosting (XGBoost), and a deep learning model based on multilayer perceptron (MLP) were the 6 machine learning algorithms we tested for RSDC and RADC through 10-fold cross-validation.

RESULTS

Our preliminary analysis using a data-driven approach revealed that within RADC, the subsequent readmission charge billed per patient was higher than the previous charge for 541,090 individuals, and this number was 319,233 for RSDC. The top 3 major diagnostic categories (MDCs) for such instances were the same for RADC and RSDC. The average readmission charge billed was higher than the previous charge for 21 of the MDCs in the case of RSDC, whereas it was only for 13 of the MDCs in RADC. We recommend XGBoost and the deep learning model based on MLP for predicting readmission charges. The following performance metrics were obtained for XGBoost: (1) RADC (mean absolute percentage error [MAPE]=3.121%; root mean squared error [RMSE]=0.414; mean absolute error [MAE]=0.317; root relative squared error [RRSE]=0.410; relative absolute error [RAE]=0.399; normalized RMSE [NRMSE]=0.040; mean absolute deviation [MAD]=0.031) and (2) RSDC (MAPE=3.171%; RMSE=0.421; MAE=0.321; RRSE=0.407; RAE=0.393; NRMSE=0.041; MAD=0.031). The performance obtained for MLP-based deep neural networks are as follows: (1) RADC (MAPE=3.103%; RMSE=0.413; MAE=0.316; RRSE=0.410; RAE=0.397; NRMSE=0.040; MAD=0.031) and (2) RSDC (MAPE=3.202%; RMSE=0.427; MAE=0.326; RRSE=0.413; RAE=0.399; NRMSE=0.041; MAD=0.032). Repeated measures ANOVA revealed that the mean RMSE differed significantly across models with P<.001. Post hoc tests using the Bonferroni correction method indicated that the mean RMSE of the deep learning/XGBoost models was statistically significantly (P<.001) lower than that of all other models, namely linear regression/elastic net/lasso/ridge regression.

CONCLUSIONS

Models built using XGBoost and MLP are suitable for predicting readmission charges billed by hospitals. The MDCs allow models to accurately predict hospital readmission charges.

摘要

背景

医疗保险和医疗补助服务中心预计,未来几年医疗保健成本将持续增长。再入院成本的上升是医疗保健成本增加的重要因素。医疗保健的多个领域,包括再入院,都以多种方式从各种机器学习算法的应用中受益。

目的

我们旨在确定适用于预测医院再入院费用的模型。我们的文献综述表明,机器学习在这方面的应用尚未得到充分探索。我们使用了各种预测方法,从白盒模型(如正则化技术)到黑盒模型(如基于深度学习的模型)。

方法

我们将再入院定义为同一主要诊断类别(RSDC)的再入院和全因再入院类别(RADC)。对于这些再入院类别,医疗保健研究与质量局通过2013年医疗保健成本和利用项目的全国再入院数据库分别识别出576,701人和1,091,580人。线性回归、套索回归、弹性网络、岭回归、极端梯度提升(XGBoost)和基于多层感知器(MLP)的深度学习模型是我们通过10折交叉验证针对RSDC和RADC测试的6种机器学习算法。

结果

我们使用数据驱动方法进行的初步分析表明,在RADC中,541,090名患者的后续再入院费用高于前一次费用,而在RSDC中这一数字为319,233人。RADC和RSDC此类情况的前三大主要诊断类别(MDC)相同。在RSDC的情况下,21个MDC的平均再入院费用高于前一次费用,而在RADC中仅13个MDC如此。我们推荐使用XGBoost和基于MLP的深度学习模型来预测再入院费用。XGBoost获得的以下性能指标为:(1)RADC(平均绝对百分比误差[MAPE]=3.121%;均方根误差[RMSE]=0.414;平均绝对误差[MAE]=0.317;根相对平方误差[RRSE]=0.410;相对绝对误差[RAE]=0.399;归一化RMSE[NRMSE]=0.040;平均绝对偏差[MAD]=0.031)和(2)RSDC(MAPE=3.171%;RMSE=0.421;MAE=0.321;RRSE=0.407;RAE=0.393;NRMSE=0.041;MAD=0.031)。基于MLP的深度神经网络获得的性能如下:(1)RADC(MAPE=3.103%;RMSE=0.413;MAE=0.316;RRSE=0.410;RAE=0.397;NRMSE=0.040;MAD=0.031)和(2)RSDC(MAPE=3.202%;RMSE=0.427;MAE=0.326;RRSE=0.413;RAE=0.399;NRMSE=0.041;MAD=0.032)。重复测量方差分析显示,各模型的平均RMSE差异显著,P<0.001。使用Bonferroni校正方法的事后检验表明,深度学习/XGBoost模型的平均RMSE在统计学上显著低于所有其他模型,即线性回归/弹性网络/套索/岭回归(P<0.001)。

结论

使用XGBoost和MLP构建的模型适用于预测医院的再入院费用。MDC有助于模型准确预测医院再入院费用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ba/9472041/d298e53d3e76/medinform_v10i8e37578_fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验