Cohen Scott A, Chen Ziyi, Bian Jiang, Boucher Christina, Wu Yonghui, Prosperi Mattia
Department of Epidemiology, University of Florida, Gainesville, FL, USA.
Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA.
Artif Intell Med Conf Artif Intell Med (2005-). 2025 Jun;15734:65-76. doi: 10.1007/978-3-031-95838-0_7. Epub 2025 Jun 23.
Approaches to guide empiric antimicrobial therapy are needed, especially in critically ill populations with prevalent antimicrobial resistance (AMR). While artificial intelligence shows promise in predicting AMR, scalable and generalizable prediction models are essential for broad clinical adoption. We utilized a publicly available clinical large language model (LLM), Gatortron, in comparison to traditional machine learning, to predict AMR and methicillin-resistant (MRSA)-specific patterns within a hospital-onset sepsis cohort using electronic health record (EHR) data available at time of illness onset. EHR data from approximately 150,000 hospitalizations with a documented bacterial infection at a large tertiary care healthcare system between 2010 and 2023 were examined. Among 2,019 eligible hospital-onset sepsis encounters, an AMR pathogen was identified in 911 (45%) and MRSA was isolated in 234 (26%). LLMs outperformed traditional models in predicting MRSA, achieving an AUC of 0.73 compared to 0.66 for the best traditional ML model, with superior F1 scores (0.43 vs. 0.16 for ML). Negative predictive value for MRSA prediction using LLM was at least 90% across majority of infection presentations. The LLM's superior prediction using a relatively simplified feature set demonstrates the potential of leveraging EHR data for early resistance prediction, though further refinement is needed to enhance sensitivity and clinical applicability.
需要有指导经验性抗菌治疗的方法,尤其是在抗菌药物耐药性(AMR)普遍存在的危重症人群中。虽然人工智能在预测AMR方面显示出前景,但可扩展且可推广的预测模型对于广泛的临床应用至关重要。我们使用了一个公开可用的临床大语言模型(LLM)Gatortron,并与传统机器学习方法进行比较,利用疾病发作时可用的电子健康记录(EHR)数据,来预测医院获得性脓毒症队列中的AMR和耐甲氧西林金黄色葡萄球菌(MRSA)特异性模式。我们检查了2010年至2023年间在一个大型三级医疗保健系统中约15万例有细菌感染记录的住院患者的EHR数据。在2019例符合条件的医院获得性脓毒症病例中,911例(45%)鉴定出AMR病原体,234例(26%)分离出MRSA。在预测MRSA方面,大语言模型优于传统模型,曲线下面积(AUC)达到0.73,而最佳传统机器学习模型为0.66,大语言模型的F1分数更高(机器学习模型为0.16,大语言模型为0.43)。在大多数感染表现中,使用大语言模型预测MRSA的阴性预测值至少为90%。大语言模型使用相对简化的特征集进行的卓越预测表明,利用EHR数据进行早期耐药性预测具有潜力,不过仍需要进一步优化以提高敏感性和临床适用性。