机器学习算法与自然语言处理相结合可能会提高急诊科检测菌血症的概率：对94482名患者的回顾性大数据分析

Combination of machine learning algorithms with natural language processing may increase the probability of bacteremia detection in the emergency department: A retrospective, big-data analysis of 94,482 patients.

作者信息

Ben-Haim Gal, Yosef Mika, Rowand Eyade, Ben-Yosef Jonathan, Berman Aya, Sina Sigal, Halabi Nitsan, Grossbard Eitan, Marziano Yehonatan, Segal Gad

机构信息

Emergency Department, Chaim Sheba Medical Center, Ramat-Gan, Israel.

The Faculty of Medicine, Tel-Aviv University, Tel-Aviv, Israel.

出版信息

Digit Health. 2024 Sep 12;10:20552076241277673. doi: 10.1177/20552076241277673. eCollection 2024 Jan-Dec.

DOI:10.1177/20552076241277673

PMID:39291149

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11406632/

Abstract

BACKGROUND

Prompt diagnosis of bacteremia in the emergency department (ED) is of utmost importance. Nevertheless, the average time to first clinical laboratory finding range from 1 to 3 days. Alongside a myriad of scoring systems for occult bacteremia prediction, efforts for applying artificial intelligence (AI) in this realm are still preliminary. In the current study we combined an AI algorithm with a Natural Language Processing (NLP) algorithm that would potentially increase the yield extracted from clinical ED data.

METHODS

This study involved adult patients who visited our emergency department and at least one blood culture was taken to rule out bacteremia. Using both tabular and free text data, we built an ensemble model that leverages XGBoost for structured data, and logistic regression (LR) on a word-analysis technique called bag-of-words (BOW) Term Frequency-Inverse Document Frequency (TF-IDF), for textual data. All algorithms were designed in order to predict the risk for bacteremia with ED patients whose blood cultures were sent to the laboratory.

RESULTS

The study cohort comprised 94,482 individuals, of whom 52% were males. The prevalence of bacteremia in the entire cohort was 9.7%. The model trained on the tabular data yielded an area under the curve (AUC) of 73.7% for XGBoost, while the LR that was trained on the free text achieved an AUC of 71.3%. After checking a range of weights, the best combination was for 55% weight on the XGBoost prediction and 45% weight on the LR prediction. The final model prediction yielded an AUC of 75.6%.

CONCLUSION

Harnessing artificial intelligence to the task of bacteremia surveillance in the ED settings by a combination of both free text and tabular data analysis improved predictive performance compared to using tabular data alone. We recommend that future AI applications based on our findings should be assimilated into the clinical routines of ED physicians.

摘要

背景

在急诊科（ED）快速诊断菌血症至关重要。然而，首次临床实验室检查结果的平均时间为1至3天。除了众多用于预测隐匿性菌血症的评分系统外，在这一领域应用人工智能（AI）的努力仍处于初步阶段。在本研究中，我们将一种AI算法与一种自然语言处理（NLP）算法相结合，这可能会提高从急诊临床数据中提取的信息。

方法

本研究纳入了就诊于我们急诊科且至少进行了一次血培养以排除菌血症的成年患者。利用表格数据和自由文本数据，我们构建了一个集成模型，该模型利用XGBoost处理结构化数据，并在一种名为词袋（BOW）词频 - 逆文档频率（TF - IDF）的词分析技术上使用逻辑回归（LR）处理文本数据。所有算法的设计目的都是预测血培养已送检实验室的急诊患者发生菌血症的风险。

结果

研究队列包括94482人，其中52%为男性。整个队列中菌血症的患病率为9.7%。在表格数据上训练的模型中，XGBoost的曲线下面积（AUC）为73.7%，而在自由文本上训练的LR的AUC为71.3%。在检查了一系列权重后，最佳组合是XGBoost预测权重为55%，LR预测权重为45%。最终模型预测的AUC为75.6%。

结论

与仅使用表格数据相比，通过结合自由文本和表格数据分析，在急诊环境中利用人工智能进行菌血症监测可提高预测性能。我们建议，基于我们的研究结果，未来的人工智能应用应融入急诊医生的临床常规工作中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b9d/11406632/ea0be1c67a0e/10.1177_20552076241277673-fig1.jpg

相似文献

Combination of machine learning algorithms with natural language processing may increase the probability of bacteremia detection in the emergency department: A retrospective, big-data analysis of 94,482 patients.

Digit Health. 2024 Sep 12;10:20552076241277673. doi: 10.1177/20552076241277673. eCollection 2024 Jan-Dec.

Real-time artificial intelligence system for bacteremia prediction in adult febrile emergency department patients.

Int J Med Inform. 2023 Oct;178:105176. doi: 10.1016/j.ijmedinf.2023.105176. Epub 2023 Aug 6.

Natural language processing of head CT reports to identify intracranial mass effect: CTIME algorithm.

Am J Emerg Med. 2022 Jan;51:388-392. doi: 10.1016/j.ajem.2021.11.001. Epub 2021 Nov 9.

Predicting adult neuroscience intensive care unit admission from emergency department triage using a retrospective, tabular-free text machine learning approach.

Sci Rep. 2021 Jan 14;11(1):1381. doi: 10.1038/s41598-021-80985-3.

Development of an artificial intelligence bacteremia prediction model and evaluation of its impact on physician predictions focusing on uncertainty.

Sci Rep. 2023 Aug 19;13(1):13518. doi: 10.1038/s41598-023-40708-2.

Prediction of bacteremia at the emergency department during triage and disposition stages using machine learning models.

Am J Emerg Med. 2022 Mar;53:86-93. doi: 10.1016/j.ajem.2021.12.065. Epub 2022 Jan 1.

Automated Classification of Free-Text Radiology Reports: Using Different Feature Extraction Methods to Identify Fractures of the Distal Fibula.

Rofo. 2023 Aug;195(8):713-719. doi: 10.1055/a-2061-6562. Epub 2023 May 9.

An Artificial Intelligence Model for Predicting Trauma Mortality Among Emergency Department Patients in South Korea: Retrospective Cohort Study.

J Med Internet Res. 2023 Aug 29;25:e49283. doi: 10.2196/49283.

Predicting Bacteremia among Septic Patients Based on ED Information by Machine Learning Methods: A Comparative Study.

Diagnostics (Basel). 2022 Oct 15;12(10):2498. doi: 10.3390/diagnostics12102498.

Early short-term prediction of emergency department length of stay using natural language processing for low-acuity outpatients.

Am J Emerg Med. 2020 Nov;38(11):2368-2373. doi: 10.1016/j.ajem.2020.03.019. Epub 2020 Mar 10.

引用本文的文献

Mapping artificial intelligence models in emergency medicine: A scoping review on artificial intelligence performance in emergency care and education.

Turk J Emerg Med. 2025 Apr 1;25(2):67-91. doi: 10.4103/tjem.tjem_45_25. eCollection 2025 Apr-Jun.

本文引用的文献

How artificial intelligence could transform emergency care.

Am J Emerg Med. 2024 Jul;81:40-46. doi: 10.1016/j.ajem.2024.04.024. Epub 2024 Apr 16.

Investigating machine learning and natural language processing techniques applied for detecting eating disorders: a systematic literature review.

Front Psychiatry. 2024 Mar 26;15:1319522. doi: 10.3389/fpsyt.2024.1319522. eCollection 2024.

De-identification of clinical free text using natural language processing: A systematic review of current approaches.

Artif Intell Med. 2024 May;151:102845. doi: 10.1016/j.artmed.2024.102845. Epub 2024 Mar 20.

[Usefulness of the MPB-INFURG-SEMES model to predict bacteremia in the patient with solid tumor in the Emergency Department].

Rev Esp Quimioter. 2024 Jun;37(3):257-265. doi: 10.37201/req/004.2024. Epub 2024 Mar 23.

Assessing the research landscape and clinical utility of large language models: a scoping review.

BMC Med Inform Decis Mak. 2024 Mar 12;24(1):72. doi: 10.1186/s12911-024-02459-6.

Transportability of bacterial infection prediction models for critically ill patients.

J Am Med Inform Assoc. 2023 Dec 22;31(1):98-108. doi: 10.1093/jamia/ocad174.

Development of an artificial intelligence bacteremia prediction model and evaluation of its impact on physician predictions focusing on uncertainty.

Sci Rep. 2023 Aug 19;13(1):13518. doi: 10.1038/s41598-023-40708-2.

Real-time artificial intelligence system for bacteremia prediction in adult febrile emergency department patients.

Int J Med Inform. 2023 Oct;178:105176. doi: 10.1016/j.ijmedinf.2023.105176. Epub 2023 Aug 6.

Taxonomy of hybrid architectures involving rule-based reasoning and machine learning in clinical decision systems: A scoping review.

J Biomed Inform. 2023 Aug;144:104428. doi: 10.1016/j.jbi.2023.104428. Epub 2023 Jun 22.

Health system-scale language models are all-purpose prediction engines.

Nature. 2023 Jul;619(7969):357-362. doi: 10.1038/s41586-023-06160-y. Epub 2023 Jun 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

机器学习算法与自然语言处理相结合可能会提高急诊科检测菌血症的概率：对94482名患者的回顾性大数据分析

Combination of machine learning algorithms with natural language processing may increase the probability of bacteremia detection in the emergency department: A retrospective, big-data analysis of 94,482 patients.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献