Porto Bruno Matos, Fogliatto Flavio Sanson
Industrial Engineering Department, Federal University of Rio Grande do Sul, Av. Osvaldo Aranha, 99, 5th floor, Porto Alegre, RS, 90020-035, Brazil.
Industrial Engineering Department, Federal University of Rio Grande do Sul, Av. Osvaldo Aranha, 99, 5th floor, Porto Alegre, 90035-190, Brazil.
BMC Med Inform Decis Mak. 2024 Dec 18;24(1):377. doi: 10.1186/s12911-024-02788-6.
Emergency department (ED) overcrowding is an important problem in many countries. Accurate predictions of ED patient arrivals can help management to better allocate staff and medical resources. In this study, we investigate the use of calendar and meteorological predictors, as well as feature-engineered variables, to predict daily patient arrivals using datasets from eleven different EDs across three countries.
Six machine learning (ML) algorithms were tested on forecasting horizons of 7 and 45 days. Three of them - Light Gradient Boosting Machine (LightGBM), Support Vector Machine with Radial Basis Function (SVM-RBF), and Neural Network Autoregression (NNAR) - were never before reported for predicting ED patient arrivals. Algorithms' hyperparameters were tuned through a grid-search with cross-validation. Prediction performance was assessed using fivefold cross-validation and four performance metrics.
The eXtreme Gradient Boosting (XGBoost) was the best-performing model on both prediction horizons, also outperforming results reported in past studies on ED arrival prediction. XGBoost and NNAR achieved the best performance in nine out of the eleven analyzed datasets, with MAPE values ranging from 5.03% to 14.1%. Feature engineering (FE) improved the performance of the ML algorithms.
Accuracy in predicting ED arrivals, achieved through the FE approach, is key for managing human and material resources, as well as reducing patient waiting times and lengths of stay.
急诊科过度拥挤是许多国家面临的一个重要问题。准确预测急诊科患者 arrivals 有助于管理部门更好地分配人员和医疗资源。在本研究中,我们调查了使用日历和气象预测指标以及特征工程变量,利用来自三个国家 11 个不同急诊科的数据集来预测每日患者 arrivals。
对六种机器学习(ML)算法在 7 天和 45 天的预测期上进行了测试。其中三种算法——轻量级梯度提升机(LightGBM)、径向基函数支持向量机(SVM-RBF)和神经网络自回归(NNAR)——此前从未被报道用于预测急诊科患者 arrivals。通过带有交叉验证的网格搜索对算法的超参数进行了调整。使用五折交叉验证和四个性能指标评估预测性能。
极端梯度提升(XGBoost)在两个预测期上都是表现最佳的模型,其表现也优于过去关于急诊科 arrivals 预测的研究中所报告的结果。XGBoost 和 NNAR 在 11 个分析数据集中的 9 个中取得了最佳性能,平均绝对百分比误差(MAPE)值在 5.03%至 14.1%之间。特征工程(FE)提高了 ML 算法的性能。
通过 FE 方法实现的急诊科 arrivals 预测准确性,对于管理人力和物力资源以及减少患者等待时间和住院时间至关重要。