• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

研究先进的大语言模型在生成患者指导和患者教育材料方面的能力。

Investigating the capabilities of advanced large language models in generating patient instructions and patient educational material.

作者信息

Sridharan Kannan, Sivaramakrishnan Gowri

机构信息

Pharmacology & Therapeutics, College of Medicine & Health Sciences, Arabian Gulf University, Manama, Manama, Bahrain

Bahrain Defence Force Royal Medical Services, Riffa, Bahrain.

出版信息

Eur J Hosp Pharm. 2024 Dec 30. doi: 10.1136/ejhpharm-2024-004245.

DOI:10.1136/ejhpharm-2024-004245
PMID:39393839
Abstract

OBJECTIVES

Large language models (LLMs) with advanced language generation capabilities have the potential to enhance patient interactions. This study evaluates the effectiveness of ChatGPT 4.0 and Gemini 1.0 Pro in providing patient instructions and creating patient educational material (PEM).

METHODS

A cross-sectional study employed ChatGPT 4.0 and Gemini 1.0 Pro across six medical scenarios using simple and detailed prompts. The Patient Education Materials Assessment Tool for Print materials (PEMAT-P) evaluated the understandability, actionability, and readability of the outputs.

RESULTS

LLMs provided consistent responses, especially regarding drug information, therapeutic goals, administration, common side effects, and interactions. However, they lacked guidance on expiration dates and proper medication disposal. Detailed prompts yielded comprehensible outputs for the average adult. ChatGPT 4.0 had mean understandability and actionability scores of 80% and 60%, respectively, compared with 67% and 60% for Gemini 1.0 Pro. ChatGPT 4.0 produced longer outputs, achieving 85% readability with detailed prompts, while Gemini 1.0 Pro maintained consistent readability. Simple prompts resulted in ChatGPT 4.0 outputs at a 10th-grade reading level, while Gemini 1.0 Pro outputs were at a 7th-grade level. Both LLMs produced outputs at a 6th-grade level with detailed prompts.

CONCLUSION

LLMs show promise in generating patient instructions and PEM. However, healthcare professional oversight and patient education on LLM use are essential for effective implementation.

摘要

目的

具有先进语言生成能力的大语言模型(LLMs)有潜力改善医患互动。本研究评估了ChatGPT 4.0和Gemini 1.0 Pro在提供患者指导和创建患者教育材料(PEM)方面的有效性。

方法

一项横断面研究在六种医疗场景中使用简单和详细的提示词对ChatGPT 4.0和Gemini 1.0 Pro进行了测试。用于印刷材料的患者教育材料评估工具(PEMAT-P)评估了输出内容的可理解性、可操作性和可读性。

结果

大语言模型给出了一致的回答,特别是在药物信息、治疗目标、给药方式、常见副作用和相互作用方面。然而,它们在有效期和正确的药物处理方面缺乏指导。详细的提示词为普通成年人产生了可理解的输出。ChatGPT 4.0的平均可理解性和可操作性得分分别为80%和60%,而Gemini 1.0 Pro的这两项得分分别为67%和60%。ChatGPT 4.0生成的输出更长,在使用详细提示词时可读性达到85%,而Gemini 1.0 Pro的可读性保持一致。简单提示词使ChatGPT 4.0的输出达到十年级阅读水平,而Gemini 1.0 Pro的输出为七年级水平。在使用详细提示词时,两个大语言模型的输出均为六年级水平。

结论

大语言模型在生成患者指导和患者教育材料方面显示出前景。然而,医疗保健专业人员的监督以及对大语言模型使用的患者教育对于有效实施至关重要。

相似文献

1
Investigating the capabilities of advanced large language models in generating patient instructions and patient educational material.研究先进的大语言模型在生成患者指导和患者教育材料方面的能力。
Eur J Hosp Pharm. 2024 Dec 30. doi: 10.1136/ejhpharm-2024-004245.
2
Assessing the quality and readability of patient education materials on chemotherapy cardiotoxicity from artificial intelligence chatbots: An observational cross-sectional study.评估人工智能聊天机器人提供的关于化疗心脏毒性的患者教育材料的质量和可读性:一项观察性横断面研究。
Medicine (Baltimore). 2025 Apr 11;104(15):e42135. doi: 10.1097/MD.0000000000042135.
3
Leveraging large language models to improve patient education on dry eye disease.利用大语言模型改善干眼症患者教育。
Eye (Lond). 2025 Apr;39(6):1115-1122. doi: 10.1038/s41433-024-03476-5. Epub 2024 Dec 16.
4
Large Language Models: Pioneering New Educational Frontiers in Childhood Myopia.大语言模型:开创儿童近视教育新前沿
Ophthalmol Ther. 2025 Jun;14(6):1281-1295. doi: 10.1007/s40123-025-01142-x. Epub 2025 Apr 21.
5
Leveraging artificial intelligence chatbots for anemia prevention: A comparative study of ChatGPT-3.5, copilot, and Gemini outputs against Google Search results.利用人工智能聊天机器人预防贫血:ChatGPT-3.5、Copilot和Gemini输出与谷歌搜索结果的对比研究。
PEC Innov. 2025 Apr 1;6:100390. doi: 10.1016/j.pecinn.2025.100390. eCollection 2025 Jun.
6
Assessing the Application of Large Language Models in Generating Dermatologic Patient Education Materials According to Reading Level: Qualitative Study.评估大语言模型在根据阅读水平生成皮肤科患者教育材料方面的应用:定性研究。
JMIR Dermatol. 2024 May 16;7:e55898. doi: 10.2196/55898.
7
Using Large Language Models to Generate Educational Materials on Childhood Glaucoma.利用大语言模型生成儿童青光眼教育材料。
Am J Ophthalmol. 2024 Sep;265:28-38. doi: 10.1016/j.ajo.2024.04.004. Epub 2024 Apr 16.
8
Enhancing readability of USFDA patient communications through large language models: a proof-of-concept study.通过大型语言模型提高美国 FDA 患者通讯的可读性:概念验证研究。
Expert Rev Clin Pharmacol. 2024 Aug;17(8):731-741. doi: 10.1080/17512433.2024.2363840. Epub 2024 Jun 4.
9
Large language models: a new frontier in paediatric cataract patient education.大语言模型:小儿白内障患者教育的新前沿。
Br J Ophthalmol. 2024 Sep 20;108(10):1470-1476. doi: 10.1136/bjo-2024-325252.
10
Assessing the Responses of Large Language Models (ChatGPT-4, Gemini, and Microsoft Copilot) to Frequently Asked Questions in Breast Imaging: A Study on Readability and Accuracy.评估大语言模型(ChatGPT-4、Gemini和Microsoft Copilot)对乳腺成像常见问题的回答:可读性和准确性研究
Cureus. 2024 May 9;16(5):e59960. doi: 10.7759/cureus.59960. eCollection 2024 May.