College of Dental Medicine-Illinois, Midwestern University, Downers Grove, Illinois, USA.
Midwestern University, Downers Grove, Illinois, USA.
J Dent Educ. 2023 Dec;87(12):1735-1745. doi: 10.1002/jdd.13375. Epub 2023 Oct 2.
PURPOSE/OBJECTIVES: This study had a twofold outcome. The first aim was to develop an efficient, machine learning (ML) model using data from a dental school clinic (DSC) electronic health record (EHR). This model identified patients with a high likelihood of failing an appointment and provided a user-friendly system with a rating score that would alert clinicians and administrators of patients at high risk of no-show appointments. The second aim was to identify key factors with ML modeling that contributed to patient no-show appointments.
Using de-identified data from a DSC EHR, eight ML algorithms were evaluated: simple decision tree, bagging regressor classifier, random forest classifier, gradient boosted regression, AdaBoost regression, XGBoost regression, neural network, and logistic regression classifier. The performance of each model was assessed using a confusion matrix with different threshold level of probability; precision, recall and predicted accuracy on each threshold; receiver-operating characteristic curve (ROC) and area under curve (AUC); as well as F1 score.
The ML models agreed on the threshold of probability score at 0.20-0.25 with Bagging classifier as the model that performed best with a F1 score of 0.41 and AUC of 0.76. Results showed a strong correlation between appointment failure and appointment confirmation, patient's age, number of visits before the appointment, total number of prior failed appointments, appointment lead time, as well as the patient's total number of medical alerts.
Altogether, the implementation of this user-friendly ML model can improve DSC workflow, benefiting dental students learning outcomes and optimizing personalized patient care.
目的/目标:本研究有两个结果。第一个目的是使用牙科学院诊所(DSC)电子健康记录(EHR)中的数据开发一个高效的机器学习(ML)模型。该模型确定了预约失败可能性高的患者,并提供了一个用户友好的系统,具有评分,可提醒临床医生和管理员注意高风险不出现预约的患者。第二个目的是使用 ML 建模确定导致患者不出现预约的关键因素。
使用 DSC EHR 的去识别数据,评估了八种 ML 算法:简单决策树、袋装回归分类器、随机森林分类器、梯度提升回归、AdaBoost 回归、XGBoost 回归、神经网络和逻辑回归分类器。使用不同概率阈值水平的混淆矩阵评估每个模型的性能;在每个阈值上的精度、召回率和预测准确性;接收者操作特征曲线(ROC)和曲线下面积(AUC);以及 F1 分数。
ML 模型在概率评分阈值为 0.20-0.25 时达成一致,Bagging 分类器作为表现最佳的模型,F1 得分为 0.41,AUC 为 0.76。结果表明,预约失败与预约确认、患者年龄、预约前就诊次数、总预约失败次数、预约提前时间以及患者的总医疗警报次数之间存在很强的相关性。
总之,该用户友好的 ML 模型的实施可以改善 DSC 的工作流程,使牙科学生的学习成果受益,并优化个性化患者护理。