利用大语言模型检测胃肠道出血以助力质量改进和合理报销。

Detection of Gastrointestinal Bleeding With Large Language Models to Aid Quality Improvement and Appropriate Reimbursement.

作者信息

Zheng Neil S, Keloth Vipina K, You Kisung, Kats Daniel, Li Darrick K, Deshpande Ohm, Sachar Hamita, Xu Hua, Laine Loren, Shung Dennis L

机构信息

Section of Digestive Diseases, Department of Medicine, Yale School of Medicine, New Haven, Connecticut; Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts.

Department of Biomedical Informatics and Data Science, Yale School of Medicine, New Haven, Connecticut; Department of Medicine, Yale School of Medicine, New Haven, Connecticut.

出版信息

Gastroenterology. 2025 Jan;168(1):111-120.e4. doi: 10.1053/j.gastro.2024.09.014. Epub 2024 Sep 18.

DOI:10.1053/j.gastro.2024.09.014

PMID:39304088

Abstract

BACKGROUND & AIMS: Early identification and accurate characterization of overt gastrointestinal bleeding (GIB) enables opportunities to optimize patient management and ensures appropriately risk-adjusted coding for claims-based quality measures and reimbursement. Recent advancements in generative artificial intelligence, particularly large language models (LLMs), create opportunities to support accurate identification of clinical conditions. In this study, we present the first LLM-based pipeline for identification of overt GIB in the electronic health record (EHR). We demonstrate 2 clinically relevant applications: the automated detection of recurrent bleeding and appropriate reimbursement coding for patients with GIB.

METHODS

Development of the LLM-based pipeline was performed on 17,712 nursing notes from 1108 patients who were hospitalized with acute GIB and underwent endoscopy in the hospital from 2014 to 2023. The pipeline was used to train an EHR-based machine learning model for detection of recurrent bleeding on 546 patients presenting to 2 hospitals and externally validated on 562 patients presenting to 4 different hospitals. The pipeline was used to develop an algorithm for appropriate reimbursement coding on 7956 patients who underwent endoscopy in the hospital from 2019 to 2023.

RESULTS

The LLM-based pipeline accurately detected melena (positive predictive value, 0.972; sensitivity, 0.900), hematochezia (positive predictive value, 0.900; sensitivity, 0.908), and hematemesis (positive predictive value, 0.859; sensitivity, 0.932). The EHR-based machine learning model identified recurrent bleeding with area under the curve of 0.986, sensitivity of 98.4%, and specificity of 97.5%. The reimbursement coding algorithm resulted in an average per-patient reimbursement increase of $1299 to $3247 with a total difference of $697,460 to $1,743,649.

CONCLUSIONS

An LLM-based pipeline can robustly detect overt GIB in the EHR with clinically relevant applications in detection of recurrent bleeding and appropriate reimbursement coding.

摘要

背景与目的

早期识别和准确表征显性胃肠道出血（GIB）可为优化患者管理提供机会，并确保基于索赔的质量指标和报销的风险调整编码适当。生成式人工智能的最新进展，特别是大语言模型（LLMs），为支持临床状况的准确识别创造了机会。在本研究中，我们展示了首个基于大语言模型的流程，用于在电子健康记录（EHR）中识别显性GIB。我们展示了两个临床相关应用：复发性出血的自动检测以及GIB患者的适当报销编码。

方法

基于大语言模型的流程是在2014年至2023年期间因急性GIB住院并在医院接受内镜检查的1108例患者的17712份护理记录上开发的。该流程用于训练基于EHR的机器学习模型，以检测到两家医院就诊的546例患者的复发性出血，并在到四家不同医院就诊的562例患者上进行外部验证。该流程用于为2019年至2023年在医院接受内镜检查的7956例患者开发适当报销编码算法。

结果

基于大语言模型的流程准确检测出黑便（阳性预测值，0.972；敏感性，0.900）、便血（阳性预测值，0.900；敏感性，0.908）和呕血（阳性预测值，0.859；敏感性，0.932）。基于EHR的机器学习模型识别复发性出血的曲线下面积为0.986，敏感性为98.4%，特异性为97.5%。报销编码算法使每位患者的平均报销增加了1299美元至3247美元，总差异为697460美元至1743649美元。

结论

基于大语言模型的流程能够在EHR中可靠地检测显性GIB，并在复发性出血检测和适当报销编码方面具有临床相关应用。

相似文献

Detection of Gastrointestinal Bleeding With Large Language Models to Aid Quality Improvement and Appropriate Reimbursement.

Gastroenterology. 2025 Jan;168(1):111-120.e4. doi: 10.1053/j.gastro.2024.09.014. Epub 2024 Sep 18.

Validation of an Electronic Health Record-Based Machine Learning Model Compared With Clinical Risk Scores for Gastrointestinal Bleeding.

Gastroenterology. 2024 Nov;167(6):1198-1212. doi: 10.1053/j.gastro.2024.06.030. Epub 2024 Jul 5.

Early identification of patients with acute gastrointestinal bleeding using natural language processing and decision rules.

J Gastroenterol Hepatol. 2021 Jun;36(6):1590-1597. doi: 10.1111/jgh.15313. Epub 2021 Jan 25.

Integrating large language models with human expertise for disease detection in electronic health records.

Comput Biol Med. 2025 Jun;191:110161. doi: 10.1016/j.compbiomed.2025.110161. Epub 2025 Apr 7.

A decision support system to facilitate management of patients with acute gastrointestinal bleeding.

Artif Intell Med. 2008 Mar;42(3):247-59. doi: 10.1016/j.artmed.2007.10.003. Epub 2007 Dec 11.

Engineering of Generative Artificial Intelligence and Natural Language Processing Models to Accurately Identify Arrhythmia Recurrence.

Circ Arrhythm Electrophysiol. 2025 Jan;18(1):e013023. doi: 10.1161/CIRCEP.124.013023. Epub 2024 Dec 16.

Comparison of 2 Natural Language Processing Methods for Identification of Bleeding Among Critically Ill Patients.

JAMA Netw Open. 2018 Oct 5;1(6):e183451. doi: 10.1001/jamanetworkopen.2018.3451.

Advancing care for acute gastrointestinal bleeding using artificial intelligence.

J Gastroenterol Hepatol. 2021 Feb;36(2):273-278. doi: 10.1111/jgh.15372.

Classifying Unstructured Text in Electronic Health Records for Mental Health Prediction Models: Large Language Model Evaluation Study.

JMIR Med Inform. 2025 Jan 21;13:e65454. doi: 10.2196/65454.

Limited usefulness of endoscopic evaluation in patients with continuous-flow left ventricular assist devices and gastrointestinal bleeding.

J Heart Lung Transplant. 2018 Jun;37(6):723-732. doi: 10.1016/j.healun.2017.12.017. Epub 2017 Dec 20.

引用本文的文献

Performance and improvement strategies for adapting generative large language models for electronic health record applications: A systematic review.

Int J Med Inform. 2025 Aug 28;205:106091. doi: 10.1016/j.ijmedinf.2025.106091.

Large language models for clinical decision support in gastroenterology and hepatology.

Nat Rev Gastroenterol Hepatol. 2025 Aug 22. doi: 10.1038/s41575-025-01108-1.

The Potential Clinical Utility of the Customized Large Language Model in Gastroenterology: A Pilot Study.

Bioengineering (Basel). 2024 Dec 24;12(1):1. doi: 10.3390/bioengineering12010001.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用大语言模型检测胃肠道出血以助力质量改进和合理报销。

Detection of Gastrointestinal Bleeding With Large Language Models to Aid Quality Improvement and Appropriate Reimbursement.

作者信息

机构信息

出版信息

METHODS

RESULTS

CONCLUSIONS

背景与目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献