Rockenschaub Patrick, Gill Martin J, McNulty Dave, Carroll Orlagh, Freemantle Nick, Shallcross Laura
Institute of Health Informatics, University College London, London, United Kingdom.
Department of Clinical Microbiology, Queen Elizabeth Hospital Birmingham, University Hospitals Birmingham NHS Foundation Trust, Birmingham, United Kingdom.
PLOS Digit Health. 2023 Jun 13;2(6):e0000261. doi: 10.1371/journal.pdig.0000261. eCollection 2023 Jun.
Urinary tract infections (UTIs) are a major cause of emergency hospital admissions, but it remains challenging to diagnose them reliably. Application of machine learning (ML) to routine patient data could support clinical decision-making. We developed a ML model predicting bacteriuria in the ED and evaluated its performance in key patient groups to determine scope for its future use to improve UTI diagnosis and thus guide antibiotic prescribing decisions in clinical practice. We used retrospective electronic health records from a large UK hospital (2011-2019). Non-pregnant adults who attended the ED and had a urine sample cultured were eligible for inclusion. The primary outcome was predominant bacterial growth ≥104 cfu/mL in urine. Predictors included demography, medical history, ED diagnoses, blood tests, and urine flow cytometry. Linear and tree-based models were trained via repeated cross-validation, re-calibrated, and validated on data from 2018/19. Changes in performance were investigated by age, sex, ethnicity, and suspected ED diagnosis, and compared to clinical judgement. Among 12,680 included samples, 4,677 (36.9%) showed bacterial growth. Relying primarily on flow cytometry parameters, our best model achieved an area under the ROC curve (AUC) of 0.813 (95% CI 0.792-0.834) in the test data, and achieved both higher sensitivity and specificity compared to proxies of clinician's judgement. Performance remained stable for white and non-white patients but was lower during a period of laboratory procedure change in 2015, in patients ≥65 years (AUC 0.783, 95% CI 0.752-0.815), and in men (AUC 0.758, 95% CI 0.717-0.798). Performance was also slightly reduced in patients with recorded suspicion of UTI (AUC 0.797, 95% CI 0.765-0.828). Our results suggest scope for use of ML to inform antibiotic prescribing decisions by improving diagnosis of suspected UTI in the ED, but performance varied with patient characteristics. Clinical utility of predictive models for UTI is therefore likely to differ for important patient subgroups including women <65 years, women ≥65 years, and men. Tailored models and decision thresholds may be required that account for differences in achievable performance, background incidence, and risks of infectious complications in these groups.
尿路感染(UTIs)是急诊住院的主要原因,但可靠诊断仍具有挑战性。将机器学习(ML)应用于常规患者数据可以支持临床决策。我们开发了一种预测急诊科菌尿症的ML模型,并在关键患者群体中评估其性能,以确定其未来用于改善UTI诊断从而指导临床实践中抗生素处方决策的范围。我们使用了来自一家大型英国医院(2011 - 2019年)的回顾性电子健康记录。到急诊科就诊并进行了尿样培养的非妊娠成年人符合纳入条件。主要结局是尿液中主要细菌生长≥104 cfu/mL。预测因素包括人口统计学、病史、急诊科诊断、血液检查和尿流式细胞术。基于线性和树的模型通过重复交叉验证进行训练、重新校准,并在2018/19年的数据上进行验证。通过年龄、性别、种族和疑似急诊科诊断调查性能变化,并与临床判断进行比较。在纳入的12,680个样本中,4,677个(36.9%)显示有细菌生长。我们的最佳模型主要依靠流式细胞术参数,在测试数据中的ROC曲线下面积(AUC)为0.813(95% CI 0.792 - 0.834),与临床医生判断的指标相比,具有更高的敏感性和特异性。白人和非白人患者的性能保持稳定,但在2015年实验室程序变化期间、≥65岁患者(AUC 0.783,95% CI 0.752 - 0.815)和男性(AUC 0.758,95% CI 0.717 - 0.798)中性能较低。在记录有UTI疑似的患者中性能也略有降低(AUC 0.797,95% CI 0.765 - 0.828)。我们的结果表明,ML有潜力通过改善急诊科疑似UTI的诊断来为抗生素处方决策提供信息,但性能因患者特征而异。因此,UTI预测模型在包括<65岁女性、≥65岁女性和男性在内的重要患者亚组中的临床效用可能不同。可能需要针对这些组中可实现的性能差异、背景发病率和感染并发症风险制定量身定制的模型和决策阈值。