• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估大型语言模型以简化出院小结并提供心脏科生活方式建议。

Evaluation of a large language model to simplify discharge summaries and provide cardiological lifestyle recommendations.

作者信息

Rust Paul, Frings Julian, Meister Sven, Fehring Leonard

机构信息

Faculty of Health, School of Medicine, Witten/Herdecke University, Alfred-Herrhausen-Strasse 50, 58455, Witten, Germany.

Health Care Informatics, Faculty of Health, School of Medicine, Witten/Herdecke University, Pferdebachstrasse 11, 58455, Witten, Germany.

出版信息

Commun Med (Lond). 2025 May 29;5(1):208. doi: 10.1038/s43856-025-00927-2.

DOI:10.1038/s43856-025-00927-2
PMID:40442348
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12122782/
Abstract

BACKGROUND

Hospital discharge summaries are essential for the continuity of care. However, medical jargon, abbreviations, and technical language often make them too complex for patients to understand, and they frequently omit lifestyle recommendations important for self-management. This study explored using a large language model (LLM) to enhance discharge summary readability and augment it with lifestyle recommendations.

METHODS

We collected 20 anonymized cardiology discharge summaries. GPT-4o was prompted using full-text and segment-wise approaches to simplify each summary and generate lifestyle recommendations. Readability was measured via three standardized metrics (modified Flesch-Reading-Ease, Vienna Non-fiction Text Formula, Lesbarkeitsindex), and multiple quality dimensions were evaluated by 12 medical experts.

RESULTS

LLM-generated summaries from both prompting approaches are significantly more readable compared to the original summaries across all metrics (p < 0.0001). Based on 60 expert ratings for the full-text approach and 60 for the segment-wise approach, experts '(strongly) agree' that LLM-summaries are correct (full-text: 85%; segment-wise: 80%), complete (78%; 92%), harmless (83%; 88%), and comprehensible for patients (88%; 97%). Experts '(strongly) agree' that LLM-generated recommendations are relevant in 92%, evidence-based in 88%, personalized in 70%, complete in 88%, consistent in 93%, and harmless in 88% of 60 ratings.

CONCLUSIONS

LLM-generated summaries achieve a 10th-grade readability level and high-quality ratings. While LLM-generated lifestyle recommendations are generally of high quality, personalization is limited. These findings suggest that LLMs could help create more patient-centric discharge summaries. Further research is needed to confirm clinical utility and address quality assurance, regulatory compliance, and clinical integration challenges.

摘要

背景

出院小结对于医疗护理的连续性至关重要。然而,医学术语、缩写和专业语言常常使其过于复杂,患者难以理解,而且它们经常遗漏对自我管理很重要的生活方式建议。本研究探讨了使用大语言模型(LLM)来提高出院小结的可读性,并补充生活方式建议。

方法

我们收集了20份匿名的心脏病学出院小结。使用全文和逐段方法提示GPT-4o,以简化每份小结并生成生活方式建议。通过三个标准化指标(修改后的弗莱什易读性、维也纳非虚构文本公式、易读性指数)测量可读性,并由12名医学专家评估多个质量维度。

结果

与原始小结相比,两种提示方法生成的LLM小结在所有指标上的可读性均显著提高(p < 0.0001)。基于对全文方法的60次专家评分和对逐段方法的60次专家评分,专家们“(强烈)同意”LLM小结是正确的(全文:85%;逐段:80%)、完整的(78%;92%)、无害的(83%;88%),并且对患者来说是可理解的(88%;97%)。在60次评分中,专家们“(强烈)同意”LLM生成的建议在92%的情况下是相关的、在88%的情况下是基于证据的、在70%的情况下是个性化的、在88%的情况下是完整的、在93%的情况下是一致的、在88%的情况下是无害的。

结论

LLM生成的小结达到了十年级的可读性水平和高质量评分。虽然LLM生成的生活方式建议总体质量较高,但个性化程度有限。这些发现表明,LLMs可以帮助创建更以患者为中心的出院小结。需要进一步研究来确认临床效用,并应对质量保证、法规遵从性和临床整合方面的挑战。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/012ab49d35c9/43856_2025_927_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/2cdce3b2178a/43856_2025_927_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/c2b829db98d5/43856_2025_927_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/3a309bb921cb/43856_2025_927_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/55a986852aa4/43856_2025_927_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/972fccf52272/43856_2025_927_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/012ab49d35c9/43856_2025_927_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/2cdce3b2178a/43856_2025_927_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/c2b829db98d5/43856_2025_927_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/3a309bb921cb/43856_2025_927_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/55a986852aa4/43856_2025_927_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/972fccf52272/43856_2025_927_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f8/12122782/012ab49d35c9/43856_2025_927_Fig6_HTML.jpg

相似文献

1
Evaluation of a large language model to simplify discharge summaries and provide cardiological lifestyle recommendations.评估大型语言模型以简化出院小结并提供心脏科生活方式建议。
Commun Med (Lond). 2025 May 29;5(1):208. doi: 10.1038/s43856-025-00927-2.
2
Generative Artificial Intelligence to Transform Inpatient Discharge Summaries to Patient-Friendly Language and Format.生成式人工智能将住院病历摘要转换为患者友好型语言和格式。
JAMA Netw Open. 2024 Mar 4;7(3):e240357. doi: 10.1001/jamanetworkopen.2024.0357.
3
From jargon to clarity: Improving the readability of foot and ankle radiology reports with an artificial intelligence large language model.从行话到清晰明了:利用人工智能大语言模型提高足踝放射学报告的可读性
Foot Ankle Surg. 2024 Jun;30(4):331-337. doi: 10.1016/j.fas.2024.01.008. Epub 2024 Feb 5.
4
Patient-Representing Population's Perceptions of GPT-Generated Versus Standard Emergency Department Discharge Instructions: Randomized Blind Survey Assessment.患者群体对 GPT 生成的与标准急诊部门出院医嘱的看法:随机盲法调查评估。
J Med Internet Res. 2024 Aug 2;26:e60336. doi: 10.2196/60336.
5
Reader's digest version of scientific writing: comparative evaluation of summarization capacity between large language models and medical students in analyzing scientific writing in sleep medicine.科学写作的读者文摘版:大型语言模型与医学生在分析睡眠医学科学写作方面的总结能力比较评估
Front Artif Intell. 2024 Dec 24;7:1477535. doi: 10.3389/frai.2024.1477535. eCollection 2024.
6
Assessing the Capability of Large Language Model Chatbots in Generating Plain Language Summaries.评估大语言模型聊天机器人生成通俗易懂摘要的能力。
Cureus. 2025 Mar 21;17(3):e80976. doi: 10.7759/cureus.80976. eCollection 2025 Mar.
7
From technical to understandable: Artificial Intelligence Large Language Models improve the readability of knee radiology reports.从技术到易懂:人工智能大语言模型提高了膝关节放射学报告的可读性。
Knee Surg Sports Traumatol Arthrosc. 2024 May;32(5):1077-1086. doi: 10.1002/ksa.12133. Epub 2024 Mar 15.
8
The role of large language models in improving the readability of orthopaedic spine patient educational material.大语言模型在提高骨科脊柱患者教育材料可读性方面的作用。
J Orthop Surg Res. 2025 May 28;20(1):531. doi: 10.1186/s13018-025-05955-1.
9
Enhancing Health Literacy: Evaluating the Readability of Patient Handouts Revised by ChatGPT's Large Language Model.提高健康素养:评估经 ChatGPT 大型语言模型修订的患者手册的可读性。
Otolaryngol Head Neck Surg. 2024 Dec;171(6):1751-1757. doi: 10.1002/ohn.927. Epub 2024 Aug 6.
10
Evaluating Large Language Models for Drafting Emergency Department Discharge Summaries.评估用于起草急诊科出院小结的大语言模型。
medRxiv. 2024 Apr 4:2024.04.03.24305088. doi: 10.1101/2024.04.03.24305088.

本文引用的文献

1
Current applications and challenges in large language models for patient care: a systematic review.用于患者护理的大语言模型的当前应用与挑战:一项系统综述
Commun Med (Lond). 2025 Jan 21;5(1):26. doi: 10.1038/s43856-024-00717-2.
2
The TRIPOD-LLM reporting guideline for studies using large language models.使用大语言模型的研究的TRIPOD-LLM报告指南。
Nat Med. 2025 Jan;31(1):60-69. doi: 10.1038/s41591-024-03425-5. Epub 2025 Jan 8.
3
The path forward for large language models in medicine is open.医学领域大语言模型的未来发展道路是开放的。
NPJ Digit Med. 2024 Nov 27;7(1):339. doi: 10.1038/s41746-024-01344-w.
4
Generative artificial intelligence and ethical considerations in health care: a scoping review and ethics checklist.生成式人工智能与医疗保健中的伦理考量:范围综述与伦理检查表。
Lancet Digit Health. 2024 Nov;6(11):e848-e856. doi: 10.1016/S2589-7500(24)00143-2. Epub 2024 Sep 17.
5
Closing the gap between open source and commercial large language models for medical evidence summarization.弥合用于医学证据总结的开源大型语言模型与商业大型语言模型之间的差距。
NPJ Digit Med. 2024 Sep 9;7(1):239. doi: 10.1038/s41746-024-01239-w.
6
Potential of Large Language Models in Health Care: Delphi Study.大语言模型在医疗保健中的潜力:德尔菲研究。
J Med Internet Res. 2024 May 13;26:e52399. doi: 10.2196/52399.
7
Patient-Friendly Discharge Summaries in Korea Based on ChatGPT: Software Development and Validation.韩国基于 ChatGPT 的患者友好型出院小结:软件开发与验证。
J Korean Med Sci. 2024 Apr 29;39(16):e148. doi: 10.3346/jkms.2024.39.e148.
8
Generative Artificial Intelligence to Transform Inpatient Discharge Summaries to Patient-Friendly Language and Format.生成式人工智能将住院病历摘要转换为患者友好型语言和格式。
JAMA Netw Open. 2024 Mar 4;7(3):e240357. doi: 10.1001/jamanetworkopen.2024.0357.
9
Generative Pre-trained Transformer 4 makes cardiovascular magnetic resonance reports easy to understand.生成式预训练转换器 4 使得心血管磁共振报告易于理解。
J Cardiovasc Magn Reson. 2024 Summer;26(1):101035. doi: 10.1016/j.jocmr.2024.101035. Epub 2024 Mar 7.
10
From jargon to clarity: Improving the readability of foot and ankle radiology reports with an artificial intelligence large language model.从行话到清晰明了:利用人工智能大语言模型提高足踝放射学报告的可读性
Foot Ankle Surg. 2024 Jun;30(4):331-337. doi: 10.1016/j.fas.2024.01.008. Epub 2024 Feb 5.