使用大语言模型的临床记录摘要系统的开发与评估

Development and evaluation of a clinical note summarization system using large language models.

作者信息

Oliveira Juliana Damasio, Santos Henrique D P, Ulbrich Ana Helena D P S, Couto Julia Colleoni, Arocha Marcelo, Santos Joaquim, Costa Manuela Martins, Faccio Daniela, Tabalipa Fabio O, Nogueira Rodrigo F

机构信息

Institute of A.I. in Healthcare, Porto Alegre, RS, Brazil.

Memed, Florianópolis, SC, Brazil.

出版信息

Commun Med (Lond). 2025 Aug 28;5(1):376. doi: 10.1038/s43856-025-01091-3.

DOI:10.1038/s43856-025-01091-3

PMID:40877595

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12394402/

Abstract

BACKGROUND

Clinical notes are a vital and detailed source of information about patient hospitalizations. However, the sheer volume and complexity of these notes make evaluation and summarization challenging. Nonetheless, summarizing clinical notes is essential for accurate and efficient clinical decision-making in patient care. Generative language models, particularly large language models such as GPT-4, offer a promising solution by creating coherent, contextually relevant text based on patterns learned from large datasets.

METHODS

This study describes the development of a discharge summary system using large language models. By conducting an online survey and interviews, we gather feedback from end users, including physicians and patients, to ensure the system meets their practical needs and fits their experiences. Additionally, we develop a rating system to evaluate prompt effectiveness by comparing model-generated outputs with human assessments, which serve as benchmarks to evaluate the performance of the automated model.

RESULTS

Here we show that the model's ability to interpret diagnoses borders on humanlevel accuracy, demonstrating its potential to assist healthcare professionals in routine tasks such as generating discharge summaries.

CONCLUSIONS

This advancement underscores the potential of large language models in clinical settings and opens up possibilities for broader applications in healthcare documentation and decision-making support.

摘要

背景

临床记录是患者住院信息的重要且详细的来源。然而，这些记录的数量庞大且复杂，使得评估和总结具有挑战性。尽管如此，总结临床记录对于患者护理中准确高效的临床决策至关重要。生成式语言模型，特别是像GPT-4这样的大型语言模型，通过基于从大型数据集中学习到的模式创建连贯、上下文相关的文本，提供了一个有前景的解决方案。

方法

本研究描述了使用大型语言模型开发出院小结系统的过程。通过开展在线调查和访谈，我们收集了包括医生和患者在内的终端用户的反馈，以确保该系统满足他们的实际需求并符合他们的体验。此外，我们开发了一个评分系统，通过将模型生成的输出与人工评估进行比较来评估提示有效性，人工评估作为评估自动化模型性能的基准。

结果

我们在此表明，该模型解释诊断的能力接近人类水平的准确性，证明了其在诸如生成出院小结等日常任务中协助医疗保健专业人员的潜力。

结论

这一进展凸显了大型语言模型在临床环境中的潜力，并为医疗文档和决策支持中的更广泛应用开辟了可能性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57f5/12394402/932f97130d45/43856_2025_1091_Fig1_HTML.jpg

相似文献

Development and evaluation of a clinical note summarization system using large language models.使用大语言模型的临床记录摘要系统的开发与评估

Commun Med (Lond). 2025 Aug 28;5(1):376. doi: 10.1038/s43856-025-01091-3.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Improving Large Language Models' Summarization Accuracy by Adding Highlights to Discharge Notes: Comparative Evaluation.通过在出院小结中添加重点内容提高大语言模型的总结准确性：比较评估

JMIR Med Inform. 2025 Jul 24;13:e66476. doi: 10.2196/66476.

Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作：定性证据综合评价。

Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.

Sexual Harassment and Prevention Training性骚扰与预防培训

Interventions to improve safe and effective medicines use by consumers: an overview of systematic reviews.改善消费者安全有效用药的干预措施：系统评价概述

Cochrane Database Syst Rev. 2014 Apr 29;2014(4):CD007768. doi: 10.1002/14651858.CD007768.pub3.

Patient buy-in to social prescribing through link workers as part of person-centred care: a realist evaluation.患者通过联络人员接受社会处方作为以患者为中心的护理的一部分：一项现实主义评价。

Health Soc Care Deliv Res. 2024 Sep 25:1-17. doi: 10.3310/ETND8254.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验：定性证据综合。

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

Adapting Safety Plans for Autistic Adults with Involvement from the Autism Community.在自闭症群体的参与下为成年自闭症患者调整安全计划。

Autism Adulthood. 2025 May 28;7(3):293-302. doi: 10.1089/aut.2023.0124. eCollection 2025 Jun.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

本文引用的文献

Evaluating LLMs for Diagnosis Summarization.评估用于诊断总结的语言模型。

Annu Int Conf IEEE Eng Med Biol Soc. 2024 Jul;2024:1-7. doi: 10.1109/EMBC53108.2024.10782231.

Generative artificial intelligence in primary care: an online survey of UK general practitioners.初级保健中的生成式人工智能：英国全科医生的在线调查。

BMJ Health Care Inform. 2024 Sep 17;31(1):e101102. doi: 10.1136/bmjhci-2024-101102.

Generative Artificial Intelligence to Transform Inpatient Discharge Summaries to Patient-Friendly Language and Format.生成式人工智能将住院病历摘要转换为患者友好型语言和格式。

JAMA Netw Open. 2024 Mar 4;7(3):e240357. doi: 10.1001/jamanetworkopen.2024.0357.

Primary Care Physicians' Perspectives on High-Quality Discharge Summaries.基层医疗医生对高质量出院小结的看法。

J Gen Intern Med. 2024 Jun;39(8):1438-1443. doi: 10.1007/s11606-023-08541-5. Epub 2023 Nov 27.

Writing a high-quality discharge summary through structured training and assessment.

Med Educ. 2023 Aug;57(8):773-774. doi: 10.1111/medu.15102. Epub 2023 Apr 22.

Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine.GPT-4作为医学人工智能聊天机器人的益处、局限性和风险

N Engl J Med. 2023 Mar 30;388(13):1233-1239. doi: 10.1056/NEJMsr2214184.

ChatGPT: the future of discharge summaries?ChatGPT：出院小结的未来？

Lancet Digit Health. 2023 Mar;5(3):e107-e108. doi: 10.1016/S2589-7500(23)00021-3. Epub 2023 Feb 6.

A Coding Framework for Usability Evaluation of Digital Health Technologies.数字健康技术可用性评估的编码框架

Hum Comput Interact Theor Approaches Des Method (2022). 2022 Jun-Jul;13302:185-196. doi: 10.1007/978-3-031-05311-5_12. Epub 2022 Jun 16.

[Quality of discharge summary for patients with limited life expectancy].[预期寿命有限患者的出院小结质量]

Ned Tijdschr Geneeskd. 2022 Jul 14;166:D6575.

Discharging patients from acute care hospitals.急性护理医院的患者出院。

Nurs Stand. 2016 Feb 10;30(24):49-57; quiz 60. doi: 10.7748/ns.30.24.49.s47.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用大语言模型的临床记录摘要系统的开发与评估

Development and evaluation of a clinical note summarization system using large language models.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献