• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用带有检索增强文本生成功能的大语言模型来自动化急诊医学文档记录的流程。

A Pipeline for Automating Emergency Medicine Documentation Using LLMs with Retrieval-Augmented Text Generation.

作者信息

Moser Denis, Bender Matthias, Sariyar Murat

机构信息

Department Medical Informatics, Bern University of Applied Sciences, Biel/Bienne, Switzerland.

出版信息

Appl Artif Intell. 2025 Jun 18;39(1):2519169. doi: 10.1080/08839514.2025.2519169. eCollection 2025.

DOI:10.1080/08839514.2025.2519169
PMID:40755813
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12315831/
Abstract

Accurate and efficient documentation of patient information is vital in emergency healthcare settings. Traditional manual documentation methods are often time-consuming and prone to errors, potentially affecting patient outcomes. Large Language Models (LLMs) offer a promising solution to enhance medical communication systems; however, their clinical deployment, particularly in non-English languages such as German, presents challenges related to content accuracy, clinical relevance, and data privacy. This study addresses these challenges by developing and evaluating an automated pipeline for emergency medical documentation in German. The research objectives include (1) generating synthetic dialogues with known ground truth data to create controlled datasets for evaluating NLP performance and (2) designing an innovative pipeline to retrieve essential clinical information from these dialogues. A subset of 100 anonymized patient records from the MIMIC-IV-ED dataset was selected, ensuring diversity in demographics, chief complaints, and conditions. A Retrieval-Augmented Generation (RAG) system extracted key nominal and numerical features using chunking, embedding, and dynamic prompts. Evaluation metrics included precision, recall, F1-score, and sentiment analysis. Initial results demonstrated high extraction accuracy, particularly in medication data (F1-scores: 86.21%-100%), though performance declined in nuanced clinical language, requiring further refinement for real-world emergency settings.

摘要

在紧急医疗环境中,准确高效地记录患者信息至关重要。传统的手动记录方法往往耗时且容易出错,可能会影响患者的治疗结果。大语言模型(LLMs)为增强医疗通信系统提供了一个有前景的解决方案;然而,它们在临床中的应用,尤其是在德语等非英语语言环境中,在内容准确性、临床相关性和数据隐私方面存在挑战。本研究通过开发和评估一个用于德语紧急医疗记录的自动化流程来应对这些挑战。研究目标包括:(1)使用已知的真实数据生成合成对话,以创建用于评估自然语言处理(NLP)性能的受控数据集;(2)设计一个创新的流程,从这些对话中检索基本的临床信息。从MIMIC-IV-ED数据集中选取了100份匿名患者记录的子集,确保在人口统计学、主要症状和病情方面具有多样性。一个检索增强生成(RAG)系统使用分块、嵌入和动态提示来提取关键的名词和数字特征。评估指标包括精确率、召回率、F1分数和情感分析。初步结果显示出较高的提取准确性,尤其是在用药数据方面(F1分数:86.21%-100%),不过在细微的临床语言方面性能有所下降,需要进一步优化以适用于现实世界的紧急情况。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/170f5b95e6bb/UAAI_A_2519169_F0008_B.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/982266436558/UAAI_A_2519169_F0001_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/a878e50bd4e3/UAAI_A_2519169_F0002_B.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/1cd837a364b1/UAAI_A_2519169_F0003_B.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/53e1e83a5f0a/UAAI_A_2519169_F0004_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/7330ca11d2f5/UAAI_A_2519169_F0005_B.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/855339bc9d39/UAAI_A_2519169_F0006_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/7fb336c53923/UAAI_A_2519169_F0007_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/170f5b95e6bb/UAAI_A_2519169_F0008_B.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/982266436558/UAAI_A_2519169_F0001_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/a878e50bd4e3/UAAI_A_2519169_F0002_B.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/1cd837a364b1/UAAI_A_2519169_F0003_B.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/53e1e83a5f0a/UAAI_A_2519169_F0004_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/7330ca11d2f5/UAAI_A_2519169_F0005_B.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/855339bc9d39/UAAI_A_2519169_F0006_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/7fb336c53923/UAAI_A_2519169_F0007_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/688b/12315831/170f5b95e6bb/UAAI_A_2519169_F0008_B.jpg

相似文献

1
A Pipeline for Automating Emergency Medicine Documentation Using LLMs with Retrieval-Augmented Text Generation.一种使用带有检索增强文本生成功能的大语言模型来自动化急诊医学文档记录的流程。
Appl Artif Intell. 2025 Jun 18;39(1):2519169. doi: 10.1080/08839514.2025.2519169. eCollection 2025.
2
Assessing Retrieval-Augmented Large Language Model Performance in Emergency Department ICD-10-CM Coding Compared to Human Coders.与人工编码员相比,评估检索增强型大语言模型在急诊科ICD-10-CM编码中的性能。
medRxiv. 2024 Oct 17:2024.10.15.24315526. doi: 10.1101/2024.10.15.24315526.
3
Can open source large language models be used for tumor documentation in Germany?-An evaluation on urological doctors' notes.在德国,开源大语言模型可用于肿瘤记录吗?——对泌尿科医生笔记的评估
BioData Min. 2025 Jul 24;18(1):48. doi: 10.1186/s13040-025-00463-8.
4
From text to data: Open-source large language models in extracting cancer related medical attributes from German pathology reports.从文本到数据:用于从德语病理报告中提取癌症相关医学属性的开源大语言模型
Int J Med Inform. 2025 Nov;203:106022. doi: 10.1016/j.ijmedinf.2025.106022. Epub 2025 Jul 2.
5
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
6
Large Language Model Symptom Identification From Clinical Text: Multicenter Study.基于临床文本的大语言模型症状识别:多中心研究。
J Med Internet Res. 2025 Jul 31;27:e72984. doi: 10.2196/72984.
7
Predicting 30-Day Postoperative Mortality and American Society of Anesthesiologists Physical Status Using Retrieval-Augmented Large Language Models: Development and Validation Study.使用检索增强大语言模型预测术后30天死亡率和美国麻醉医师协会身体状况:开发与验证研究
J Med Internet Res. 2025 Jun 3;27:e75052. doi: 10.2196/75052.
8
Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.在卫生经济学与结果研究中使用生成式人工智能:技术与突破入门
Pharmacoecon Open. 2025 Apr 29. doi: 10.1007/s41669-025-00580-4.
9
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
10
A large language model based pipeline for extracting information from patient complaint and anamnesis in clinical notes for severity assessment.一种基于大语言模型的管道,用于从临床记录中的患者主诉和病史中提取信息以进行严重程度评估。
Sci Rep. 2025 Jul 14;15(1):25345. doi: 10.1038/s41598-025-07649-4.

本文引用的文献

1
Retrieval augmented generation for 10 large language models and its generalizability in assessing medical fitness.10种大语言模型的检索增强生成及其在评估医学适用性方面的通用性。
NPJ Digit Med. 2025 Apr 5;8(1):187. doi: 10.1038/s41746-025-01519-z.
2
Viability of Open Large Language Models for Clinical Documentation in German Health Care: Real-World Model Evaluation Study.德国医疗保健领域中开源大型语言模型用于临床文档记录的可行性:真实世界模型评估研究
JMIR Med Inform. 2024 Aug 28;12:e59617. doi: 10.2196/59617.
3
Optimizing Data Extraction: Harnessing RAG and LLMs for German Medical Documents.
优化数据提取:利用 RAG 和大型语言模型处理德语文献
Stud Health Technol Inform. 2024 Aug 22;316:949-950. doi: 10.3233/SHTI240567.
4
Applying generative AI with retrieval augmented generation to summarize and extract key clinical information from electronic health records.运用生成式人工智能与检索增强生成相结合,从电子健康记录中总结和提取关键临床信息。
J Biomed Inform. 2024 Aug;156:104662. doi: 10.1016/j.jbi.2024.104662. Epub 2024 Jun 14.
5
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge.ChatDoctor:一种基于医学领域知识对大型语言模型Meta-AI(LLaMA)进行微调的医学聊天模型。
Cureus. 2023 Jun 24;15(6):e40895. doi: 10.7759/cureus.40895. eCollection 2023 Jun.
6
Automatic documentation of professional health interactions: A systematic review.职业健康互动的自动记录:一项系统综述。
Artif Intell Med. 2023 Mar;137:102487. doi: 10.1016/j.artmed.2023.102487. Epub 2023 Jan 19.
7
Complete and Resilient Documentation for Operational Medical Environments Leveraging Mobile Hands-free Technology in a Systems Approach: Experimental Study.在系统方法中利用移动免提技术实现操作性医疗环境的完整和有弹性的文档记录:实验研究。
JMIR Mhealth Uhealth. 2021 Oct 12;9(10):e32301. doi: 10.2196/32301.
8
A patient-centered digital scribe for automatic medical documentation.一种以患者为中心的用于自动医疗记录的数字抄写员。
JAMIA Open. 2021 Feb 17;4(1):ooab003. doi: 10.1093/jamiaopen/ooab003. eCollection 2021 Jan.
9
Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction.医学BERT:基于大规模结构化电子健康记录进行疾病预测的预训练上下文嵌入模型
NPJ Digit Med. 2021 May 20;4(1):86. doi: 10.1038/s41746-021-00455-y.
10
Feasibility Assessment of a Pre-Hospital Automated Sensing Clinical Documentation System.院前自动传感临床文档系统的可行性评估
AMIA Annu Symp Proc. 2020 Mar 4;2019:248-257. eCollection 2019.