De Rouck Ruben, Wille Evy, Gilbert Allison, Vermeersch Nick
AZ Sint Maria Halle, Ziekenhuislaan 100, Halle, 1500, Belgium.
Research Group on Emergency and Disaster Medicine, Vrije Universiteit Brussel, Laarbeeklaan 103, Brussels, 1090, Belgium.
Int J Emerg Med. 2025 Apr 25;18(1):85. doi: 10.1186/s12245-025-00885-5.
Effective patient discharge information (PDI) in emergency departments (EDs) is vital and often more crucial than the diagnosis itself. Patients who are well informed at discharge tend to be more satisfied and experience better health outcomes. The combination of written and verbal instructions tends to improve patient recall. However, creating written discharge materials is both time-consuming and costly. With the emergence of generative artificial intelligence (AI) and large language models (LMMs), there is potential for the efficient production of patient discharge documents. This study aimed to investigate several predefined key performance indicators (KPIs) of AI-generated patient discharge information.
This study focused on three significant patients' complaints in the ED: nonspecific abdominal pain, nonspecific low back pain, and fever in children. To generate the brochures, we used an English query for ChatGPT using the GPT-4 LLM and DeepL software to translate the brochures to Dutch. Five KPIs were defined to assess these PDI brochures: quality, accessibility, clarity, correctness and usability. The brochures were evaluated for each KPI by 8 experienced emergency physicians using a rating scale from 1 (very poor) to 10 (excellent). To quantify the readability of the brochures, frequently used indices were employed: the Flesch Reading Ease, Flesch-Kincaid Grade Level, Simple Measure of Gobbledygook, and Coleman-Liau Index on the translated text.
The brochures generated by ChatGPT/GPT-4 were well received, scoring an average of 7 to 8 out of 10 across all evaluated aspects. However, the results also indicated a need for some revisions to perfect these documents. Readability analysis indicated that brochures require high school- to college-level comprehension, but this is likely an overestimation due to context-specific reasons as well as features inherent to the Dutch language.
Our findings indicate that AI tools such as LLM could represent a new opportunity to quickly produce patient discharge information brochures. However, human review and editing are essential to ensure accurate and reliable information. A follow-up study with more topics and validation in the intended population is necessary to assess their performance.
急诊科有效的患者出院信息(PDI)至关重要,往往比诊断本身更为关键。出院时得到充分信息的患者往往更满意,健康结局也更好。书面和口头指导相结合有助于提高患者的记忆效果。然而,编写出院材料既耗时又昂贵。随着生成式人工智能(AI)和大语言模型(LMMs)的出现,高效生成患者出院文件成为可能。本研究旨在调查人工智能生成的患者出院信息的几个预定义关键绩效指标(KPI)。
本研究聚焦于急诊科患者的三大主要诉求:非特异性腹痛、非特异性腰痛和儿童发热。为生成宣传册,我们使用英语查询ChatGPT,借助GPT-4大语言模型,并使用DeepL软件将宣传册翻译成荷兰语。定义了五个关键绩效指标来评估这些患者出院信息宣传册:质量、可及性、清晰度、正确性和可用性。8名经验丰富的急诊科医生使用从1(非常差)到10(优秀)的评分量表对每个关键绩效指标的宣传册进行评估。为量化宣传册的可读性,采用了常用指标:弗莱什易读性、弗莱什-金凯德年级水平、简单费解度测量和翻译文本的科尔曼-廖指数。
ChatGPT/GPT-4生成的宣传册受到好评,在所有评估方面的平均得分在7到8分之间。然而,结果也表明需要进行一些修订以完善这些文件。可读性分析表明,宣传册需要高中到大学水平的理解能力,但由于特定背景原因以及荷兰语本身的特点,这可能存在高估。
我们的研究结果表明,大语言模型等人工智能工具可能为快速生成患者出院信息宣传册带来新机遇。然而,人工审核和编辑对于确保信息准确可靠至关重要。有必要开展一项涉及更多主题并在目标人群中进行验证的后续研究,以评估它们的性能。