McMahon Mark, Plate Sylvie, Herz Tobias, Brenner Gabi, Kleinknecht-Dolf Michael, Krauthammer Michael
Department of Quantitative Biomedicine, University of Zürich, Schmelzbergstrasse 26, Zürich, 8006, Switzerland, 41 446356631.
Biomedical Informatics DFL, University Hospital Zürich, Schmelzbergstrasse 26, Zürich, 8006, Switzerland.
J Med Internet Res. 2025 Sep 16;27:e66667. doi: 10.2196/66667.
Determining effective nurse staffing levels is crucial for ensuring quality patient care and operational efficiency within hospitals. Traditional workload prediction methods often rely on professional judgment or simple volume-based approaches, which can be inaccurate. Machine learning offers a promising avenue for more data-driven and precise predictions, by using historical nursing workload data to forecast future patient care requirements, which could help with staff planning while also improving patient outcomes and nurse well-being.
This methodological study aimed to use nursing activity data, specifically LEP (Leistungserfassung in der Pflege; "documentation of nursing activities"), to predict the future workload requirements using machine learning techniques.
We conducted a retrospective observational study at the University Hospital of Zürich, using nursing workload data for inpatients across eight wards, collected between 2017 and 2021. Data were transformed to represent nursing workload per ward and shift, with 3 shifts per day. Variables used in modeling included historical workload trends, patient characteristics, and upcoming operations. Machine learning models, including linear regression variants and tree-based methods (Random Forest and XGBoost), were trained and tested on this dataset to predict workload 72 hours in advance, on a shift-by-shift basis. Model performance was assessed using mean absolute error and mean absolute percentage error, and results were compared against a baseline of assuming no change in workload from the time of prediction. Prediction accuracy was further evaluated by categorizing future workload changes into decreased, similar, or increased workload relative to current shift levels.
Our findings demonstrate that machine learning models consistently outperform the baseline across all wards. The best-performing model was the lasso regression model, which achieved an average improvement in accuracy of 25.0% compared to the baseline. When used to predict upcoming changes in workload levels, the model achieved strong classification performance, giving an average area under the receiver operating characteristic curve of 0.79 and precision values between 66.2% and 75.3%. Crucially, the model severely misclassified-predicting an upcoming increase as a decrease, and vice versa-in just 0.17% of cases, highlighting potential reliability for using the model in practice. Key variables identified as important for predictions include historical shift workload averages and overall ward workload trends.
This study suggests the potential of machine learning to enhance nurse workload prediction, while highlighting the need for refinement. Limitations due to potential discrepancies between recorded nursing activities and the actual workload highlight the need for further investigation into data quality. To maximize impact, future research should focus on: (1) using more diverse data, (2) more advanced machine learning architecture that performs time-series modeling, (3) addressing data quality concerns, and (4) conducting controlled trials for real-world evaluation.
确定有效的护士人员配置水平对于确保医院内的优质患者护理和运营效率至关重要。传统的工作量预测方法通常依赖专业判断或简单的基于数量的方法,这些方法可能不准确。机器学习通过使用历史护理工作量数据来预测未来患者护理需求,为更数据驱动和精确的预测提供了一条有前景的途径,这有助于人员规划,同时改善患者结局和护士的福祉。
这项方法学研究旨在使用护理活动数据,特别是LEP(护理活动记录),通过机器学习技术预测未来的工作量需求。
我们在苏黎世大学医院进行了一项回顾性观察研究,使用了2017年至2021年期间收集的八个病房住院患者的护理工作量数据。数据被转换以表示每个病房和班次的护理工作量,每天有3个班次。建模中使用的变量包括历史工作量趋势、患者特征和即将进行的手术。机器学习模型,包括线性回归变体和基于树的方法(随机森林和XGBoost),在该数据集上进行训练和测试,以提前72小时逐班次预测工作量。使用平均绝对误差和平均绝对百分比误差评估模型性能,并将结果与预测时假设工作量不变的基线进行比较。通过将未来工作量变化相对于当前班次水平分类为工作量减少、相似或增加,进一步评估预测准确性。
我们的研究结果表明,机器学习模型在所有病房中始终优于基线。表现最佳的模型是套索回归模型,与基线相比,其准确率平均提高了25.0%。当用于预测即将到来的工作量水平变化时,该模型具有很强的分类性能,接收器操作特征曲线下的平均面积为0.79,精确值在66.2%至75.3%之间。至关重要的是,该模型在仅0.17%的病例中出现严重误分类——将即将到来的增加预测为减少,反之亦然——突出了在实践中使用该模型的潜在可靠性。被确定为对预测重要的关键变量包括历史班次工作量平均值和整个病房的工作量趋势。
这项研究表明了机器学习在增强护士工作量预测方面的潜力,同时强调了改进的必要性。由于记录的护理活动与实际工作量之间可能存在差异而导致的局限性突出了对数据质量进行进一步调查的必要性。为了最大化影响,未来的研究应侧重于:(1)使用更多样化的数据,(2)执行时间序列建模的更先进的机器学习架构,(3)解决数据质量问题,以及(4)进行实际评估的对照试验。