Yang Yi, Madanian Samaneh, Parry David
Auckland University of Technology, Auckland, New Zealand.
Murdoch University, Perth, Australia.
JMIR Med Inform. 2024 Jan 12;12:e48273. doi: 10.2196/48273.
The phenomenon of patients missing booked appointments without canceling them-known as Did Not Show (DNS), Did Not Attend (DNA), or Failed To Attend (FTA)-has a detrimental effect on patients' health and results in massive health care resource wastage.
Our objective was to develop machine learning (ML) models and evaluate their performance in predicting the likelihood of DNS for hospital outpatient appointments at the MidCentral District Health Board (MDHB) in New Zealand.
We sourced 5 years of MDHB outpatient records (a total of 1,080,566 outpatient visits) to build the ML prediction models. We developed 3 ML models using logistic regression, random forest, and Extreme Gradient Boosting (XGBoost). Subsequently, 10-fold cross-validation and hyperparameter tuning were deployed to minimize model bias and boost the algorithms' prediction strength. All models were evaluated against accuracy, sensitivity, specificity, and area under the receiver operating characteristic (AUROC) curve metrics.
Based on 5 years of MDHB data, the best prediction classifier was XGBoost, with an area under the curve (AUC) of 0.92, sensitivity of 0.83, and specificity of 0.85. The patients' DNS history, age, ethnicity, and appointment lead time significantly contributed to DNS prediction. An ML system trained on a large data set can produce useful levels of DNS prediction.
This research is one of the very first published studies that use ML technologies to assist with DNS management in New Zealand. It is a proof of concept and could be used to benchmark DNS predictions for the MDHB and other district health boards. We encourage conducting additional qualitative research to investigate the root cause of DNS issues and potential solutions. Addressing DNS using better strategies potentially can result in better utilization of health care resources and improve health equity.
患者未取消已预约的就诊,即爽约现象(又称未到诊、未就诊或未出席),对患者健康有不利影响,并导致大量医疗资源浪费。
我们的目的是开发机器学习(ML)模型,并评估其在预测新西兰中中央地区卫生委员会(MDHB)医院门诊预约爽约可能性方面的性能。
我们获取了5年的MDHB门诊记录(共1,080,566次门诊就诊)来构建ML预测模型。我们使用逻辑回归、随机森林和极端梯度提升(XGBoost)开发了3种ML模型。随后,采用10折交叉验证和超参数调整来最小化模型偏差并增强算法的预测能力。所有模型均根据准确率、灵敏度、特异性和受试者工作特征曲线下面积(AUROC)指标进行评估。
基于5年的MDHB数据,最佳预测分类器是XGBoost,曲线下面积(AUC)为0.92,灵敏度为0.83,特异性为0.85。患者的爽约历史、年龄、种族和预约提前期对爽约预测有显著贡献。在大数据集上训练的ML系统可以产生有用的爽约预测水平。
本研究是新西兰最早发表的使用ML技术辅助爽约管理的研究之一。这是一个概念验证,可用于为MDHB和其他地区卫生委员会的爽约预测设定基准。我们鼓励开展更多定性研究,以调查爽约问题的根本原因和潜在解决方案。采用更好的策略解决爽约问题可能会提高医疗资源的利用率并改善健康公平性。