Suppr超能文献

通过提示调整使用大语言模型对医患会诊对话进行自动摘要

Automatic Summarization of Doctor-Patient Encounter Dialogues Using Large Language Model through Prompt Tuning.

作者信息

Lyu Mengxian, Peng Cheng, Li Xiaohan, Balian Patrick, Bian Jiang, Wu Yonghui

机构信息

Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA.

Cancer Informatics Shared Resource, University of Florida Health Cancer Center.

出版信息

AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:342-349. eCollection 2025.

Abstract

Automatic text summarization (ATS) is an emerging technology to assist clinicians in providing continuous and coordinated care. This study presents an approach to summarize doctor-patient dialogues using generative large language models (LLMs). We developed prompt-tuning algorithms to instruct generative LLMs to summarize clinical text. We examined the prompt-tuning strategies, the size of soft prompts, and the few-short learning ability of GatorTronGPT, a generative clinical LLM developed using 277 billion clinical and general English words with up to 20 billion parameters. We compared GatorTronGPT with a previous solution based on fine-tuning of a widely used T5 model, using a clinical benchmark dataset MTS-DIALOG. The experimental results show that the GatorTronGPT-20B model achieved the best performance on all evaluation metrics. The proposed solution has a low computing cost as the LLM parameters are not updated during prompt-tuning. This study demonstrates the efficiency of generative clinical LLMs for clinical ATS through prompt tuning.

摘要

自动文本摘要(ATS)是一项新兴技术,旨在协助临床医生提供持续且协调的护理。本研究提出了一种使用生成式大语言模型(LLMs)来总结医患对话的方法。我们开发了提示调整算法,以指导生成式大语言模型总结临床文本。我们研究了提示调整策略、软提示的大小以及GatorTronGPT的少样本学习能力,GatorTronGPT是一个使用2770亿个临床和通用英语单词开发的生成式临床大语言模型,参数多达200亿个。我们使用临床基准数据集MTS-DIALOG,将GatorTronGPT与基于广泛使用的T5模型微调的先前解决方案进行了比较。实验结果表明,GatorTronGPT-20B模型在所有评估指标上均取得了最佳性能。由于在提示调整过程中不更新大语言模型参数,因此所提出的解决方案计算成本较低。本研究通过提示调整证明了生成式临床大语言模型在临床自动文本摘要方面的效率。

相似文献

本文引用的文献

5
A large language model for electronic health records.用于电子健康记录的大型语言模型。
NPJ Digit Med. 2022 Dec 26;5(1):194. doi: 10.1038/s41746-022-00742-2.
8
Summarization of clinical information: a conceptual model.临床信息总结:概念模型。
J Biomed Inform. 2011 Aug;44(4):688-99. doi: 10.1016/j.jbi.2011.03.008. Epub 2011 Mar 31.
9
Use of electronic clinical documentation: time spent and team interactions.电子临床文档的使用:时间消耗和团队交互。
J Am Med Inform Assoc. 2011 Mar-Apr;18(2):112-7. doi: 10.1136/jamia.2010.008441. Epub 2011 Feb 2.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验