• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

运用生成式人工智能与检索增强生成相结合,从电子健康记录中总结和提取关键临床信息。

Applying generative AI with retrieval augmented generation to summarize and extract key clinical information from electronic health records.

机构信息

School of Computing and Information Technology, University of Wollongong, Wollongong, NSW 2522, Australia; School of Computer Science, Qassim University, Qassim 51452, Saudi Arabia.

School of Computing and Information Technology, University of Wollongong, Wollongong, NSW 2522, Australia.

出版信息

J Biomed Inform. 2024 Aug;156:104662. doi: 10.1016/j.jbi.2024.104662. Epub 2024 Jun 14.

DOI:10.1016/j.jbi.2024.104662
PMID:38880236
Abstract

BACKGROUND

Malnutrition is a prevalent issue in aged care facilities (RACFs), leading to adverse health outcomes. The ability to efficiently extract key clinical information from a large volume of data in electronic health records (EHR) can improve understanding about the extent of the problem and developing effective interventions. This research aimed to test the efficacy of zero-shot prompt engineering applied to generative artificial intelligence (AI) models on their own and in combination with retrieval augmented generation (RAG), for the automating tasks of summarizing both structured and unstructured data in EHR and extracting important malnutrition information.

METHODOLOGY

We utilized Llama 2 13B model with zero-shot prompting. The dataset comprises unstructured and structured EHRs related to malnutrition management in 40 Australian RACFs. We employed zero-shot learning to the model alone first, then combined it with RAG to accomplish two tasks: generate structured summaries about the nutritional status of a client and extract key information about malnutrition risk factors. We utilized 25 notes in the first task and 1,399 in the second task. We evaluated the model's output of each task manually against a gold standard dataset.

RESULT

The evaluation outcomes indicated that zero-shot learning applied to generative AI model is highly effective in summarizing and extracting information about nutritional status of RACFs' clients. The generated summaries provided concise and accurate representation of the original data with an overall accuracy of 93.25%. The addition of RAG improved the summarization process, leading to a 6% increase and achieving an accuracy of 99.25%. The model also proved its capability in extracting risk factors with an accuracy of 90%. However, adding RAG did not further improve accuracy in this task. Overall, the model has shown a robust performance when information was explicitly stated in the notes; however, it could encounter hallucination limitations, particularly when details were not explicitly provided.

CONCLUSION

This study demonstrates the high performance and limitations of applying zero-shot learning to generative AI models to automatic generation of structured summarization of EHRs data and extracting key clinical information. The inclusion of the RAG approach improved the model performance and mitigated the hallucination problem.

摘要

背景

营养不良是养老院(RACF)中普遍存在的问题,导致不良健康后果。能够从电子健康记录(EHR)中的大量数据中高效提取关键临床信息,可以提高对问题严重程度的认识,并开发有效的干预措施。本研究旨在测试零样本提示工程在生成人工智能(AI)模型中的功效,这些模型单独使用以及与检索增强生成(RAG)相结合,可用于自动化 EHR 中结构化和非结构化数据的摘要以及提取重要营养信息。

方法

我们使用 Llama 2 13B 模型进行零样本提示。该数据集包含与澳大利亚 40 家 RACF 中营养不良管理相关的非结构化和结构化 EHR。我们首先对模型进行零样本学习,然后将其与 RAG 相结合,完成两项任务:生成关于客户营养状况的结构化摘要和提取营养风险因素的关键信息。我们在第一项任务中使用了 25 条笔记,在第二项任务中使用了 1399 条笔记。我们根据黄金标准数据集手动评估模型在每个任务中的输出。

结果

评估结果表明,零样本学习应用于生成式 AI 模型,在总结和提取 RACF 客户营养状况信息方面非常有效。生成的摘要简洁准确地表示了原始数据,整体准确率为 93.25%。添加 RAG 改善了摘要过程,准确率提高了 6%,达到 99.25%。该模型在提取风险因素方面也表现出很高的准确率,达到 90%。然而,在这项任务中添加 RAG 并没有进一步提高准确率。总体而言,该模型在笔记中明确说明信息时表现出强大的性能;然而,它可能会遇到幻觉限制,尤其是在没有明确提供细节的情况下。

结论

本研究证明了将零样本学习应用于生成式 AI 模型,以自动生成 EHR 数据的结构化摘要和提取关键临床信息的高效性和局限性。包括 RAG 方法提高了模型性能并减轻了幻觉问题。

相似文献

1
Applying generative AI with retrieval augmented generation to summarize and extract key clinical information from electronic health records.运用生成式人工智能与检索增强生成相结合,从电子健康记录中总结和提取关键临床信息。
J Biomed Inform. 2024 Aug;156:104662. doi: 10.1016/j.jbi.2024.104662. Epub 2024 Jun 14.
2
A large language model-based generative natural language processing framework fine-tuned on clinical notes accurately extracts headache frequency from electronic health records.基于大型语言模型的生成式自然语言处理框架,在临床笔记上进行了微调,能够从电子健康记录中准确提取头痛频率。
Headache. 2024 Apr;64(4):400-409. doi: 10.1111/head.14702. Epub 2024 Mar 25.
3
Extraction of Substance Use Information From Clinical Notes: Generative Pretrained Transformer-Based Investigation.从临床记录中提取物质使用信息:基于生成式预训练变换器的研究
JMIR Med Inform. 2024 Aug 19;12:e56243. doi: 10.2196/56243.
4
Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records.开发人工智能模型,从日本电子健康记录中提取肿瘤学结局。
Adv Ther. 2023 Mar;40(3):934-950. doi: 10.1007/s12325-022-02397-7. Epub 2022 Dec 22.
5
Retrieval-Augmented Generation for Extracting CHADS-VASc Risk Factors from Unstructured Clinical Notes in Patients with Atrial Fibrillation.用于从心房颤动患者的非结构化临床记录中提取CHADS-VASc风险因素的检索增强生成技术
medRxiv. 2024 Sep 22:2024.09.19.24313992. doi: 10.1101/2024.09.19.24313992.
6
Empowering personalized pharmacogenomics with generative AI solutions.利用生成式人工智能解决方案增强个性化药物基因组学。
J Am Med Inform Assoc. 2024 May 20;31(6):1356-1366. doi: 10.1093/jamia/ocae039.
7
Malnutrition and its contributing factors for older people living in residential aged care facilities: Insights from natural language processing of aged care records.老年人营养不良及其在养老院居住的相关因素:基于养老院记录自然语言处理的分析。
Technol Health Care. 2023;31(6):2267-2278. doi: 10.3233/THC-230229.
8
Evaluating the accuracy of a state-of-the-art large language model for prediction of admissions from the emergency room.评估最先进的大型语言模型在预测急诊入院方面的准确性。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1921-1928. doi: 10.1093/jamia/ocae103.
9
Using ChatGPT-4 to Create Structured Medical Notes From Audio Recordings of Physician-Patient Encounters: Comparative Study.利用 ChatGPT-4 从医患对话的音频记录中创建结构化的医疗记录:比较研究。
J Med Internet Res. 2024 Apr 22;26:e54419. doi: 10.2196/54419.
10
Aligning Large Language Models for Enhancing Psychiatric Interviews Through Symptom Delineation and Summarization: Pilot Study.通过症状描述和总结调整大型语言模型以增强精神病学访谈:初步研究。
JMIR Form Res. 2024 Oct 24;8:e58418. doi: 10.2196/58418.

引用本文的文献

1
Advancing Question-Answering in Ophthalmology With Retrieval-Augmented Generation: Benchmarking Open-Source and Proprietary Large Language Models.通过检索增强生成推进眼科问答:对开源和专有大语言模型进行基准测试
Transl Vis Sci Technol. 2025 Sep 2;14(9):18. doi: 10.1167/tvst.14.9.18.
2
Social Listening as a Tool to Understand Nutrition-Related Information Needs: A Case Study in Inflammatory Bowel Disease.社交倾听作为了解营养相关信息需求的工具:炎症性肠病的案例研究
J Hum Nutr Diet. 2025 Oct;38(5):e70116. doi: 10.1111/jhn.70116.
3
Large language models in clinical nutrition: an overview of its applications, capabilities, limitations, and potential future prospects.
临床营养中的大语言模型:其应用、能力、局限性及潜在未来前景概述
Front Nutr. 2025 Aug 7;12:1635682. doi: 10.3389/fnut.2025.1635682. eCollection 2025.
4
The role of generative AI tools in case-based learning and teaching evaluation of medical biochemistry.生成式人工智能工具在医学生物化学基于案例的学习与教学评估中的作用
BMC Med Educ. 2025 Aug 22;25(1):1185. doi: 10.1186/s12909-025-07567-z.
5
A Pipeline for Automating Emergency Medicine Documentation Using LLMs with Retrieval-Augmented Text Generation.一种使用带有检索增强文本生成功能的大语言模型来自动化急诊医学文档记录的流程。
Appl Artif Intell. 2025 Jun 18;39(1):2519169. doi: 10.1080/08839514.2025.2519169. eCollection 2025.
6
Comparing artificial intelligence- vs clinician-authored summaries of simulated primary care electronic health records.比较人工智能撰写的与临床医生撰写的模拟初级保健电子健康记录摘要。
JAMIA Open. 2025 Jul 30;8(4):ooaf082. doi: 10.1093/jamiaopen/ooaf082. eCollection 2025 Aug.
7
A scoping review of natural language processing in addressing medically inaccurate information: Errors, misinformation, and hallucination.关于自然语言处理在处理医学错误信息方面的范围综述:错误、错误信息和幻觉。
J Biomed Inform. 2025 Jul 22:104866. doi: 10.1016/j.jbi.2025.104866.
8
Language Models for Multilabel Document Classification of Surgical Concepts in Exploratory Laparotomy Operative Notes: Algorithm Development Study.用于探索性剖腹手术记录中手术概念多标签文档分类的语言模型:算法开发研究
JMIR Med Inform. 2025 Jul 9;13:e71176. doi: 10.2196/71176.
9
The PERFORM Study: Artificial Intelligence Versus Human Residents in Cross-Sectional Obstetrics-Gynecology Scenarios Across Languages and Time Constraints.PERFORM研究:跨语言和时间限制的妇产科横断面场景中人工智能与住院医师的比较
Mayo Clin Proc Digit Health. 2025 Mar 8;3(2):100206. doi: 10.1016/j.mcpdig.2025.100206. eCollection 2025 Jun.
10
Semantic Search of FDA Guidance Documents Using Generative AI.使用生成式人工智能对美国食品药品监督管理局指南文件进行语义搜索。
Ther Innov Regul Sci. 2025 Jun 14. doi: 10.1007/s43441-025-00798-8.