韩国基于 ChatGPT 的患者友好型出院小结：软件开发与验证。

Patient-Friendly Discharge Summaries in Korea Based on ChatGPT: Software Development and Validation.

机构信息

College of Nursing, Yonsei University, Seoul, Korea.

Department of Biomedical Systems Informatics, Yonsei University College of Medicine, Seoul, Korea.

出版信息

J Korean Med Sci. 2024 Apr 29;39(16):e148. doi: 10.3346/jkms.2024.39.e148.

DOI:10.3346/jkms.2024.39.e148

PMID:38685890

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11058343/

Abstract

BACKGROUND

Although discharge summaries in patient-friendly language can enhance patient comprehension and satisfaction, they can also increase medical staff workload. Using a large language model, we developed and validated software that generates a patient-friendly discharge summary.

METHODS

We developed and tested the software using 100 discharge summary documents, 50 for patients with myocardial infarction and 50 for patients treated in the Department of General Surgery. For each document, three new summaries were generated using three different prompting methods (Zero-shot, One-shot, and Few-shot) and graded using a 5-point Likert Scale regarding factuality, comprehensiveness, usability, ease, and fluency. We compared the effects of different prompting methods and assessed the relationship between input length and output quality.

RESULTS

The mean overall scores differed across prompting methods (4.19 ± 0.36 in Few-shot, 4.11 ± 0.36 in One-shot, and 3.73 ± 0.44 in Zero-shot; < 0.001). Post-hoc analysis indicated that the scores were higher with Few-shot and One-shot prompts than in zero-shot prompts, whereas there was no significant difference between Few-shot and One-shot prompts. The overall proportion of outputs that scored ≥ 4 was 77.0% (95% confidence interval: 68.8-85.3%), 70.0% (95% confidence interval [CI], 61.0-79.0%), and 32.0% (95% CI, 22.9-41.1%) with Few-shot, One-shot, and Zero-shot prompts, respectively. The mean factuality score was 4.19 ± 0.60 with Few-shot, 4.20 ± 0.55 with One-shot, and 3.82 ± 0.57 with Zero-shot prompts. Input length and the overall score showed negative correlations in the Zero-shot ( = -0.437, < 0.001) and One-shot ( = -0.327, < 0.001) tests but not in the Few-shot ( = -0.050, = 0.625) tests.

CONCLUSION

Large-language models utilizing Few-shot prompts generally produce acceptable discharge summaries without significant misinformation. Our research highlights the potential of such models in creating patient-friendly discharge summaries for Korean patients to support patient-centered care.

摘要

背景

虽然以患者友好的语言书写的出院小结可以提高患者的理解和满意度，但也会增加医务人员的工作量。我们使用大型语言模型开发并验证了一种生成患者友好型出院小结的软件。

方法

我们使用 100 份出院小结文档（50 份心肌梗死患者，50 份普外科患者）开发并测试了该软件。对于每份文档，我们使用三种不同的提示方法（零样本、单样本和少样本）生成了三个新的摘要，并使用 5 分李克特量表对其事实性、全面性、可用性、易用性和流畅性进行评分。我们比较了不同提示方法的效果，并评估了输入长度与输出质量之间的关系。

结果

不同提示方法的总分存在差异（少样本提示方法为 4.19 ± 0.36，单样本提示方法为 4.11 ± 0.36，零样本提示方法为 3.73 ± 0.44；< 0.001）。事后分析表明，少样本提示方法和单样本提示方法的评分均高于零样本提示方法，而少样本提示方法和单样本提示方法之间的评分无显著差异。少样本提示方法、单样本提示方法和零样本提示方法的≥4 分输出比例分别为 77.0%（95%置信区间：68.8%至 85.3%）、70.0%（95%置信区间：61.0%至 79.0%）和 32.0%（95%置信区间：22.9%至 41.1%）。少样本提示方法、单样本提示方法和零样本提示方法的事实性平均得分为 4.19 ± 0.60、4.20 ± 0.55 和 3.82 ± 0.57。零样本（ = -0.437，< 0.001）和单样本（ = -0.327，< 0.001）测试中输入长度与总分呈负相关，但在少样本（ = -0.050， = 0.625）测试中不相关。

结论

利用少样本提示的大型语言模型通常可以生成没有明显错误信息的可接受的出院小结。我们的研究强调了这些模型在为韩国患者创建患者友好型出院小结以支持以患者为中心的护理方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a29/11058343/ace47cb2129f/jkms-39-e148-g001.jpg

相似文献

Patient-Friendly Discharge Summaries in Korea Based on ChatGPT: Software Development and Validation.

J Korean Med Sci. 2024 Apr 29;39(16):e148. doi: 10.3346/jkms.2024.39.e148.

Extraction of Substance Use Information From Clinical Notes: Generative Pretrained Transformer-Based Investigation.

JMIR Med Inform. 2024 Aug 19;12:e56243. doi: 10.2196/56243.

An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.

JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.

Generative Artificial Intelligence to Transform Inpatient Discharge Summaries to Patient-Friendly Language and Format.

JAMA Netw Open. 2024 Mar 4;7(3):e240357. doi: 10.1001/jamanetworkopen.2024.0357.

Unlocking the Secrets Behind Advanced Artificial Intelligence Language Models in Deidentifying Chinese-English Mixed Clinical Text: Development and Validation Study.

J Med Internet Res. 2024 Jan 25;26:e48443. doi: 10.2196/48443.

Performance of ChatGPT in Board Examinations for Specialists in the Japanese Ophthalmology Society.

Cureus. 2023 Dec 4;15(12):e49903. doi: 10.7759/cureus.49903. eCollection 2023 Dec.

Comparison of the Quality of Discharge Letters Written by Large Language Models and Junior Clinicians: Single-Blinded Study.

J Med Internet Res. 2024 Jul 24;26:e57721. doi: 10.2196/57721.

Usability of Electronic Health Record-Generated Discharge Summaries: Heuristic Evaluation.

J Med Internet Res. 2021 Apr 15;23(4):e25657. doi: 10.2196/25657.

Bridging the Gap Between Urological Research and Patient Understanding: The Role of Large Language Models in Automated Generation of Layperson's Summaries.

Urol Pract. 2023 Sep;10(5):436-443. doi: 10.1097/UPJ.0000000000000428. Epub 2023 Jul 5.

ChatGPT and Ophthalmology: Exploring Its Potential with Discharge Summaries and Operative Notes.

Semin Ophthalmol. 2023 Jul;38(5):503-507. doi: 10.1080/08820538.2023.2209166. Epub 2023 May 3.

引用本文的文献

Generative artificial intelligence in cardiovascular specialty care: a scoping review.

BMC Nurs. 2025 Jul 19;24(1):947. doi: 10.1186/s12912-025-03594-9.

Evaluation of a large language model to simplify discharge summaries and provide cardiological lifestyle recommendations.

Commun Med (Lond). 2025 May 29;5(1):208. doi: 10.1038/s43856-025-00927-2.

Generative AI-Based Nursing Diagnosis and Documentation Recommendation Using Virtual Patient Electronic Nursing Record Data.

Healthc Inform Res. 2025 Apr;31(2):156-165. doi: 10.4258/hir.2025.31.2.156. Epub 2025 Apr 30.

Using ChatGPT for writing hospital inpatient discharge summaries - perspectives from an inpatient infectious diseases service.

BMC Health Serv Res. 2025 Feb 10;25(1):221. doi: 10.1186/s12913-025-12373-w.

Evaluating the Impact of Artificial Intelligence (AI) on Clinical Documentation Efficiency and Accuracy Across Clinical Settings: A Scoping Review.

Cureus. 2024 Nov 19;16(11):e73994. doi: 10.7759/cureus.73994. eCollection 2024 Nov.

Using Large Language Models to Extract Core Injury Information From Emergency Department Notes.

J Korean Med Sci. 2024 Dec 2;39(46):e291. doi: 10.3346/jkms.2024.39.e291.

本文引用的文献

CancerGPT for few shot drug pair synergy prediction using large pretrained language models.

NPJ Digit Med. 2024 Feb 19;7(1):40. doi: 10.1038/s41746-024-01024-9.

Performance of ChatGPT, human radiologists, and context-aware ChatGPT in identifying AO codes from radiology reports.

Sci Rep. 2023 Aug 30;13(1):14215. doi: 10.1038/s41598-023-41512-8.

Evaluating large language models on medical evidence summarization.

NPJ Digit Med. 2023 Aug 24;6(1):158. doi: 10.1038/s41746-023-00896-7.

A Context-based Chatbot Surpasses Trained Radiologists and Generic ChatGPT in Following the ACR Appropriateness Guidelines.

Radiology. 2023 Jul;308(1):e230970. doi: 10.1148/radiol.230970.

Large language models in medicine.

Nat Med. 2023 Aug;29(8):1930-1940. doi: 10.1038/s41591-023-02448-8. Epub 2023 Jul 17.

Comparison of History of Present Illness Summaries Generated by a Chatbot and Senior Internal Medicine Residents.

JAMA Intern Med. 2023 Sep 1;183(9):1026-1027. doi: 10.1001/jamainternmed.2023.2561.

Large language models encode clinical knowledge.

Nature. 2023 Aug;620(7972):172-180. doi: 10.1038/s41586-023-06291-2. Epub 2023 Jul 12.

Beyond the Keyboard: Academic Writing in the Era of ChatGPT.

J Korean Med Sci. 2023 Jul 3;38(26):e207. doi: 10.3346/jkms.2023.38.e207.

Potential Benefits and Perils of Incorporating ChatGPT to the Movement Disorders Clinic.

J Mov Disord. 2023 May;16(2):158-162. doi: 10.14802/jmd.23072. Epub 2023 May 24.

ChatGPT is not the solution to physicians' documentation burden.

Nat Med. 2023 Jun;29(6):1296-1297. doi: 10.1038/s41591-023-02341-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

韩国基于 ChatGPT 的患者友好型出院小结：软件开发与验证。

Patient-Friendly Discharge Summaries in Korea Based on ChatGPT: Software Development and Validation.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献