Department of Anesthesiology, School of Medicine, Washington University in St Louis, St Louis, MO, United States.
Department of Computer Science, McKelvey School of Engineering, Washington University in St Louis, St Louis, MO, United States.
J Biomed Inform. 2022 Mar;127:104015. doi: 10.1016/j.jbi.2022.104015. Epub 2022 Feb 5.
Burnout is a significant public health concern affecting more than half of the healthcare workforce; however, passive screening tools to detect burnout are lacking. We investigated the ability of machine learning (ML) techniques to identify burnout using passively collected electronic health record (EHR)-based audit log data.
Physician trainees participated in a longitudinal study where they completed monthly burnout surveys and provided access to their EHR-based audit logs. Using the monthly burnout scores as the target outcome, we trained ML models using combinations of features derived from audit log data-aggregate measures of clinical workload, time series-based temporal measures of EHR use, and the baseline burnout score. Five ML models were constructed to predict burnout as a continuous score: penalized linear regression, support vector machine, neural network, random forest, and gradient boosting machine.
88 trainee physicians participated and completed 416 surveys; greater than10 million audit log actions were collected (Mean [Standard Deviation] = 25,691 [14,331] actions per month, per physician). The workload feature set predicted burnout score with a mean absolute error (MAE) of 0.602 (95% Confidence Interval (CI), 0.412-0.826), and was able to predict burnout status with an average AUROC of 0.595 (95% CI 0.355-0.808) and average accuracy 0.567 (95% CI 0.393-0.742). The temporal feature set had a similar performance, with MAE 0.596 (95% CI 0.391-0.826), and AUROC 0.581 (95% CI 0.343-0.790). The addition of the baseline burnout score to the workload features improved the model performance to a mean AUROC of 0.829 (95% CI 0.607-0.996) and mean accuracy of 0.781 (95% CI 0.587-0.936); however, this performance was not meaningfully different than using the baseline burnout score alone.
Current findings illustrate the complexities of predicting burnout exclusively based on clinical work activities as captured in the EHR, highlighting its multi-factorial and individualized nature. Future prediction studies of burnout should account for individual factors (e.g., resilience, physiological measurements such as sleep) and associated system-level factors (e.g., leadership).
burnout 是一个严重的公共卫生问题,影响了超过一半的医疗保健劳动力;然而,缺乏被动的筛查工具来检测 burnout。我们研究了使用被动收集的电子健康记录 (EHR) 审计日志数据的机器学习 (ML) 技术来识别 burnout 的能力。
医师受训者参加了一项纵向研究,他们每月完成 burnout 调查,并提供对其基于 EHR 的审计日志的访问。使用每月 burnout 得分作为目标结果,我们使用从审计日志数据中提取的特征组合来训练 ML 模型-临床工作量的聚合度量、基于时间序列的 EHR 使用时间度量以及基线 burnout 得分。构建了五个用于预测 burnout 连续评分的 ML 模型:惩罚线性回归、支持向量机、神经网络、随机森林和梯度提升机。
88 名受训医师参与并完成了 416 项调查;收集了超过 1000 万条审计日志操作(Mean [SD] = 25691 [14331] 每个医师每月的操作)。工作负载特征集预测 burnout 评分的平均绝对误差 (MAE) 为 0.602(95%置信区间 [CI],0.412-0.826),并且能够以平均 AUROC 0.595(95%CI 0.355-0.808)和平均准确率 0.567(95%CI 0.393-0.742)预测 burnout 状态。时间特征集的性能相似,MAE 为 0.596(95%CI 0.391-0.826),AUROC 为 0.581(95%CI 0.343-0.790)。将基线 burnout 评分添加到工作负载特征中可将模型性能提高到平均 AUROC 0.829(95%CI 0.607-0.996)和平均准确率 0.781(95%CI 0.587-0.936);然而,这与单独使用基线 burnout 评分相比并没有明显的不同。
当前的研究结果说明了仅根据 EHR 中捕获的临床工作活动预测 burnout 的复杂性,突出了其多因素和个体化的性质。未来的 burnout 预测研究应考虑个体因素(例如,弹性、睡眠等生理测量)和相关的系统因素(例如,领导力)。