• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生成式人工智能撰写开放式病历:对GPT 3.5和GPT 4.0功能的混合方法评估

Generative artificial intelligence writing open notes: A mixed methods assessment of the functionality of GPT 3.5 and GPT 4.0.

作者信息

Kharko Anna, McMillan Brian, Hagström Josefin, Muli Irene, Davidge Gail, Hägglund Maria, Blease Charlotte

机构信息

Participatory eHealth and Health Data Research Group, Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden.

Medtech Science & Innovation Centre, Uppsala University Hospital, Uppsala, Sweden.

出版信息

Digit Health. 2024 Oct 29;10:20552076241291384. doi: 10.1177/20552076241291384. eCollection 2024 Jan-Dec.

DOI:10.1177/20552076241291384
PMID:39493632
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11528788/
Abstract

BACKGROUND

Worldwide, patients are increasingly being offered access to their full online clinical records including the narrative reports written by clinicians (so-called "open notes"). Against these developments, there is growing interest in the use of generative artificial intelligence (AI) such as OpenAI's ChatGPT to co-assist clinicians with patient-facing documentation.

OBJECTIVE

This study aimed to explore the effectiveness of OpenAI's ChatGPT 3.5 and GPT 4.0 in generating three patient-facing clinical notes from fictional general practice narrative reports.

METHODS

On 1 October 2023 and 1 November 2023, we used ChatGPT 3.5 and 4.0 to generate notes for three validated fictional general practice notes, using a prompt in the style of a British primary care note for three commonly presented conditions: (1) type 2 diabetes, (2) major depressive disorder, and (3) a differential diagnosis for suspected bowel cancer. Outputs were analyzed for reading ease, sentiment analysis, empathy, and medical fidelity.

RESULTS

ChatGPT 3.5 and 4.0 wrote longer notes than the original, and embedded more second person pronouns, with ChatGPT 3.5 scoring higher on both. ChatGPT expanded abbreviations, but readability metrics showed that the notes required a higher reading proficiency, with ChatGPT 3.5 demanding the most advanced level. Across all notes, ChatGPT offered higher signatures of empathy across cognitive, compassion/sympathy, and prosocial cues. Medical fidelity ratings varied across all three cases with ChatGPT 4.0 rated superior.

CONCLUSIONS

While ChatGPT improved sentiment and empathy metrics in the transformed notes, compared to the original they also required higher reading proficiency and omitted details impacting medical fidelity.

摘要

背景

在全球范围内,患者越来越能够获取自己完整的在线临床记录,包括临床医生撰写的叙述性报告(即所谓的“开放病历”)。针对这些发展趋势,人们越来越关注使用生成式人工智能(AI),如OpenAI的ChatGPT,来协助临床医生进行面向患者的文档记录。

目的

本研究旨在探讨OpenAI的ChatGPT 3.5和GPT 4.0从虚构的全科医疗叙述性报告中生成三份面向患者的临床记录的有效性。

方法

在2023年10月1日和2023年11月1日,我们使用ChatGPT 3.5和4.0为三份经过验证的虚构全科医疗记录生成记录,使用英国初级保健记录风格的提示语,针对三种常见病症:(1)2型糖尿病,(2)重度抑郁症,以及(3)疑似肠癌的鉴别诊断。对输出内容进行易读性分析、情感分析、同理心分析和医学准确性分析。

结果

ChatGPT 3.5和4.0生成的记录比原文更长,且嵌入了更多第二人称代词,ChatGPT 3.5在这两方面得分更高。ChatGPT扩展了缩写,但可读性指标显示这些记录需要更高的阅读水平,ChatGPT 3.5要求的阅读水平最高。在所有记录中,ChatGPT在认知、同情/怜悯和亲社会线索方面表现出更高的同理心特征。在所有三个病例中,医学准确性评分各不相同,ChatGPT 4.0的评分更高。

结论

虽然ChatGPT在转换后的记录中改善了情感和同理心指标,但与原文相比,它们也需要更高的阅读水平,并且遗漏了影响医学准确性的细节。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5863/11528788/a52f174f49a3/10.1177_20552076241291384-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5863/11528788/a52f174f49a3/10.1177_20552076241291384-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5863/11528788/a52f174f49a3/10.1177_20552076241291384-fig1.jpg

相似文献

1
Generative artificial intelligence writing open notes: A mixed methods assessment of the functionality of GPT 3.5 and GPT 4.0.生成式人工智能撰写开放式病历:对GPT 3.5和GPT 4.0功能的混合方法评估
Digit Health. 2024 Oct 29;10:20552076241291384. doi: 10.1177/20552076241291384. eCollection 2024 Jan-Dec.
2
Generative Language Models and Open Notes: Exploring the Promise and Limitations.生成式语言模型与开放病历:探索其前景与局限。
JMIR Med Educ. 2024 Jan 4;10:e51183. doi: 10.2196/51183.
3
Using ChatGPT-4 to Create Structured Medical Notes From Audio Recordings of Physician-Patient Encounters: Comparative Study.利用 ChatGPT-4 从医患对话的音频记录中创建结构化的医疗记录:比较研究。
J Med Internet Res. 2024 Apr 22;26:e54419. doi: 10.2196/54419.
4
Optimizing ChatGPT's Interpretation and Reporting of Delirium Assessment Outcomes: Exploratory Study.优化 ChatGPT 对谵妄评估结果的解释和报告:探索性研究。
JMIR Form Res. 2024 Oct 1;8:e51383. doi: 10.2196/51383.
5
Harnessing the Power of Generative AI for Clinical Summaries: Perspectives From Emergency Physicians.利用生成式人工智能为临床总结提供助力:来自急诊医师的观点。
Ann Emerg Med. 2024 Aug;84(2):128-138. doi: 10.1016/j.annemergmed.2024.01.039. Epub 2024 Mar 12.
6
Assessing the Ability of a Large Language Model to Score Free-Text Medical Student Clinical Notes: Quantitative Study.评估大型语言模型对自由文本医学生临床笔记评分的能力:定量研究。
JMIR Med Educ. 2024 Jul 25;10:e56342. doi: 10.2196/56342.
7
Application of artificial intelligence chatbots, including ChatGPT, in education, scholarly work, programming, and content generation and its prospects: a narrative review.人工智能聊天机器人(包括 ChatGPT)在教育、学术工作、编程、内容生成等领域的应用及其前景:叙述性综述。
J Educ Eval Health Prof. 2023;20:38. doi: 10.3352/jeehp.2023.20.38. Epub 2023 Dec 27.
8
Assessing the Application of Large Language Models in Generating Dermatologic Patient Education Materials According to Reading Level: Qualitative Study.评估大语言模型在根据阅读水平生成皮肤科患者教育材料方面的应用:定性研究。
JMIR Dermatol. 2024 May 16;7:e55898. doi: 10.2196/55898.
9
Generative artificial intelligence chatbots may provide appropriate informational responses to common vascular surgery questions by patients.生成式人工智能聊天机器人可能会为患者关于常见血管外科问题提供恰当的信息性回复。
Vascular. 2025 Feb;33(1):229-237. doi: 10.1177/17085381241240550. Epub 2024 Mar 18.
10
Comparison of the Quality of Discharge Letters Written by Large Language Models and Junior Clinicians: Single-Blinded Study.大语言模型与初级临床医生撰写的出院小结质量比较:单盲研究
J Med Internet Res. 2024 Jul 24;26:e57721. doi: 10.2196/57721.

引用本文的文献

1
Placebo, Nocebo, and Machine Learning: How Generative AI Could Shape Patient Perception in Mental Health Care.安慰剂、反安慰剂与机器学习:生成式人工智能如何塑造精神卫生保健中的患者认知。
JMIR Ment Health. 2025 Aug 15;12:e78663. doi: 10.2196/78663.
2
Health Care Professionals' Experiences and Opinions About Generative AI and Ambient Scribes in Clinical Documentation: Protocol for a Scoping Review.医疗保健专业人员对生成式人工智能和临床文档中的环境抄写员的经验与看法:一项范围综述的方案
JMIR Res Protoc. 2025 Aug 8;14:e73602. doi: 10.2196/73602.
3
Generative Artificial Intelligence in Primary Care: Qualitative Study of UK General Practitioners' Views.

本文引用的文献

1
Generative artificial intelligence in primary care: an online survey of UK general practitioners.初级保健中的生成式人工智能:英国全科医生的在线调查。
BMJ Health Care Inform. 2024 Sep 17;31(1):e101102. doi: 10.1136/bmjhci-2024-101102.
2
ChatGPT-4 generates orthopedic discharge documents faster than humans maintaining comparable quality: a pilot study of 6 cases.ChatGPT-4 生成骨科出院文件的速度快于人类,且保持相同质量:6 例初步研究。
Acta Orthop. 2024 Mar 21;95:152-156. doi: 10.2340/17453674.2024.40182.
3
Why we should not mistake accuracy of medical AI for efficiency.
基层医疗中的生成式人工智能:对英国全科医生观点的定性研究
J Med Internet Res. 2025 Aug 6;27:e74428. doi: 10.2196/74428.
4
General practitioners' opinions of generative artificial intelligence in the UK: An online survey.英国全科医生对生成式人工智能的看法:一项在线调查。
Digit Health. 2025 Jul 17;11:20552076251360863. doi: 10.1177/20552076251360863. eCollection 2025 Jan-Dec.
5
Big Data-Driven Health Portraits for Personalized Management in Noncommunicable Diseases: Scoping Review.用于非传染性疾病个性化管理的大数据驱动健康画像:范围综述
J Med Internet Res. 2025 Jun 5;27:e72636. doi: 10.2196/72636.
6
A proof-of-concept study for patient use of open notes with large language models.一项关于患者使用带有大语言模型的开放病历的概念验证研究。
JAMIA Open. 2025 Apr 9;8(2):ooaf021. doi: 10.1093/jamiaopen/ooaf021. eCollection 2025 Apr.
为什么我们不应将医学人工智能的准确性误认为是效率。
NPJ Digit Med. 2024 Mar 4;7(1):57. doi: 10.1038/s41746-024-01047-2.
4
Artificial intelligence compared with human-derived patient educational materials on cirrhosis.人工智能与人类生成的肝硬化患者教育材料的比较。
Hepatol Commun. 2024 Feb 14;8(3). doi: 10.1097/HC9.0000000000000367. eCollection 2024 Mar 1.
5
AI-Generated Clinical Summaries Require More Than Accuracy.人工智能生成的临床总结需要的不仅仅是准确性。
JAMA. 2024 Feb 27;331(8):637-638. doi: 10.1001/jama.2024.0555.
6
Mapping Patients' Online Record Access Worldwide: Preliminary Results from an International Survey of Healthcare Experts.全球范围内的患者在线记录访问情况调查:一项针对医疗保健专家的国际调查的初步结果。
Stud Health Technol Inform. 2024 Jan 25;310:114-118. doi: 10.3233/SHTI230938.
7
Psychiatrists' experiences and opinions of generative artificial intelligence in mental healthcare: An online mixed methods survey.精神科医生在精神卫生保健中对生成式人工智能的体验与看法:一项在线混合方法调查。
Psychiatry Res. 2024 Mar;333:115724. doi: 10.1016/j.psychres.2024.115724. Epub 2024 Jan 11.
8
Generative artificial intelligence in mental health care: potential benefits and current challenges.心理健康护理中的生成式人工智能:潜在益处与当前挑战。
World Psychiatry. 2024 Feb;23(1):1-2. doi: 10.1002/wps.21148.
9
Generative Language Models and Open Notes: Exploring the Promise and Limitations.生成式语言模型与开放病历:探索其前景与局限。
JMIR Med Educ. 2024 Jan 4;10:e51183. doi: 10.2196/51183.
10
Open AI meets open notes: surveillance capitalism, patient privacy and online record access.Open AI 遭遇开放笔记:监控资本主义、患者隐私与在线记录访问。
J Med Ethics. 2024 Jan 23;50(2):84-89. doi: 10.1136/jme-2023-109574.