• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GPT-3和GPT-4在介入放射学程序前进行深入患者教育的可行性:一项比较分析。

Feasibility of GPT-3 and GPT-4 for in-Depth Patient Education Prior to Interventional Radiological Procedures: A Comparative Analysis.

作者信息

Scheschenja Michael, Viniol Simon, Bastian Moritz B, Wessendorf Joel, König Alexander M, Mahnken Andreas H

机构信息

Department of Diagnostic and Interventional Radiology, University Hospital Marburg, Philipps-University of Marburg, Baldingerstrasse 1, 35043, Marburg, DE, Germany.

出版信息

Cardiovasc Intervent Radiol. 2024 Feb;47(2):245-250. doi: 10.1007/s00270-023-03563-2. Epub 2023 Oct 23.

DOI:10.1007/s00270-023-03563-2
PMID:37872295
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10844465/
Abstract

PURPOSE

This study explores the utility of the large language models, GPT-3 and GPT-4, for in-depth patient education prior to interventional radiology procedures. Further, differences in answer accuracy between the models were assessed.

MATERIALS AND METHODS

A total of 133 questions related to three specific interventional radiology procedures (Port implantation, PTA and TACE) covering general information as well as preparation details, risks and complications and post procedural aftercare were compiled. Responses of GPT-3 and GPT-4 were assessed for their accuracy by two board-certified radiologists using a 5-point Likert scale. The performance difference between GPT-3 and GPT-4 was analyzed.

RESULTS

Both GPT-3 and GPT-4 responded with (5) "completely correct" (4) "very good" answers for the majority of questions ((5) 30.8% + (4) 48.1% for GPT-3 and (5) 35.3% + (4) 47.4% for GPT-4). GPT-3 and GPT-4 provided (3) "acceptable" responses 15.8% and 15.0% of the time, respectively. GPT-3 provided (2) "mostly incorrect" responses in 5.3% of instances, while GPT-4 had a lower rate of such occurrences, at just 2.3%. No response was identified as potentially harmful. GPT-4 was found to give significantly more accurate responses than GPT-3 (p = 0.043).

CONCLUSION

GPT-3 and GPT-4 emerge as relatively safe and accurate tools for patient education in interventional radiology. GPT-4 showed a slightly better performance. The feasibility and accuracy of these models suggest their promising role in revolutionizing patient care. Still, users need to be aware of possible limitations.

摘要

目的

本研究探讨大语言模型GPT-3和GPT-4在介入放射学手术前进行深入患者教育的效用。此外,还评估了模型之间答案准确性的差异。

材料与方法

共收集了133个与三种特定介入放射学手术(端口植入、经皮腔内血管成形术和经动脉化疗栓塞术)相关的问题,涵盖一般信息以及准备细节、风险和并发症以及术后护理。两名获得董事会认证的放射科医生使用5点李克特量表评估GPT-3和GPT-4的回答准确性。分析了GPT-3和GPT-4之间的性能差异。

结果

对于大多数问题,GPT-3和GPT-4的回答都是(5)“完全正确”(4)“非常好”(GPT-3为(5)30.8% + (4)48.1%,GPT-4为(5)35.3% + (4)47.4%)。GPT-3和GPT-4分别有15.8%和15.0%的时间提供(3)“可接受”的回答。GPT-3在5.3%的情况下提供(2)“大多不正确”的回答,而GPT-4的此类发生率较低,仅为2.3%。未发现有回答具有潜在危害。发现GPT-4的回答比GPT-3明显更准确(p = 0.043)。

结论

GPT-3和GPT-4成为介入放射学患者教育中相对安全且准确的工具。GPT-4表现稍好。这些模型的可行性和准确性表明它们在彻底改变患者护理方面具有广阔前景。不过,用户需要意识到可能存在的局限性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da48/10844465/11b7f7f6b72b/270_2023_3563_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da48/10844465/11b7f7f6b72b/270_2023_3563_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da48/10844465/11b7f7f6b72b/270_2023_3563_Fig1_HTML.jpg

相似文献

1
Feasibility of GPT-3 and GPT-4 for in-Depth Patient Education Prior to Interventional Radiological Procedures: A Comparative Analysis.GPT-3和GPT-4在介入放射学程序前进行深入患者教育的可行性:一项比较分析。
Cardiovasc Intervent Radiol. 2024 Feb;47(2):245-250. doi: 10.1007/s00270-023-03563-2. Epub 2023 Oct 23.
2
Diagnostic accuracy of vision-language models on Japanese diagnostic radiology, nuclear medicine, and interventional radiology specialty board examinations.视觉语言模型在日本放射诊断学、核医学和介入放射学专业委员会考试中的诊断准确性。
Jpn J Radiol. 2024 Dec;42(12):1392-1398. doi: 10.1007/s11604-024-01633-0. Epub 2024 Jul 20.
3
Large Language Models for Simplified Interventional Radiology Reports: A Comparative Analysis.用于简化介入放射学报告的大语言模型:一项比较分析
Acad Radiol. 2025 Feb;32(2):888-898. doi: 10.1016/j.acra.2024.09.041. Epub 2024 Sep 30.
4
Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study.GPT-4V 在回答日本耳鼻喉科学委员会认证考试问题方面的表现:评估研究。
JMIR Med Educ. 2024 Mar 28;10:e57054. doi: 10.2196/57054.
5
GPT-4 Turbo with Vision fails to outperform text-only GPT-4 Turbo in the Japan Diagnostic Radiology Board Examination.GPT-4 Turbo with Vision 在日本诊断放射学委员会考试中未能优于仅文本的 GPT-4 Turbo。
Jpn J Radiol. 2024 Aug;42(8):918-926. doi: 10.1007/s11604-024-01561-z. Epub 2024 May 11.
6
Response accuracy of GPT-4 across languages: insights from an expert-level diagnostic radiology examination in Japan.GPT-4在多种语言中的回答准确性:来自日本专家级诊断放射学考试的见解。
Jpn J Radiol. 2025 Feb;43(2):319-329. doi: 10.1007/s11604-024-01673-6. Epub 2024 Oct 28.
7
Appropriateness of ChatGPT in Answering Heart Failure Related Questions.ChatGPT 在回答心力衰竭相关问题方面的适宜性。
Heart Lung Circ. 2024 Sep;33(9):1314-1318. doi: 10.1016/j.hlc.2024.03.005. Epub 2024 May 31.
8
Enhanced PROcedural Information READability for Patient-Centered Care in Interventional Radiology With Large Language Models (PRO-READ IR).利用大语言模型提高介入放射学中以患者为中心的护理的程序信息可读性(PRO-READ IR)。
J Am Coll Radiol. 2025 Jan;22(1):84-97. doi: 10.1016/j.jacr.2024.08.010. Epub 2024 Aug 30.
9
GPT-4o’s competency in answering the simulated written European Board of Interventional Radiology exam compared to a medical student and experts in Germany and its ability to generate exam items on interventional radiology: a descriptive study.GPT-4o 在回答模拟的欧洲介入放射学委员会考试方面的能力与德国医学生和专家相比,以及其在介入放射学方面生成考试项目的能力:一项描述性研究。
J Educ Eval Health Prof. 2024;21:21. doi: 10.3352/jeehp.2024.21.21. Epub 2024 Aug 20.
10
Large Language Models as Tools to Generate Radiology Board-Style Multiple-Choice Questions.大语言模型在生成放射科 Board 式多项选择题中的应用。
Acad Radiol. 2024 Sep;31(9):3872-3878. doi: 10.1016/j.acra.2024.06.046. Epub 2024 Jul 15.

引用本文的文献

1
Assessing the ability of large language models to simplify lumbar spine imaging reports into patient-facing text: a pilot study of GPT-4.评估大语言模型将腰椎影像报告简化为面向患者文本的能力:一项关于GPT-4的初步研究
Skeletal Radiol. 2025 Sep 9. doi: 10.1007/s00256-025-05027-9.
2
ChatGPT-4 Responses on Ankle Cartilage Surgery Often Diverge from Expert Consensus: A Comparative Analysis.ChatGPT-4对踝关节软骨手术的回答往往与专家共识存在分歧:一项比较分析。
Foot Ankle Orthop. 2025 Aug 13;10(3):24730114251352494. doi: 10.1177/24730114251352494. eCollection 2025 Jul.
3
Comparison of ChatGPT and Internet Research for Clinical Research and Decision-Making in Occupational Medicine: Randomized Controlled Trial.

本文引用的文献

1
Evaluation of an Artificial Intelligence Chatbot for Delivery of IR Patient Education Material: A Comparison with Societal Website Content.评估人工智能聊天机器人在传递 IR 患者教育材料方面的效果:与社会网站内容的比较。
J Vasc Interv Radiol. 2023 Oct;34(10):1760-1768.e32. doi: 10.1016/j.jvir.2023.05.037. Epub 2023 Jun 16.
2
GPT-4 for Automated Determination of Radiological Study and Protocol based on Radiology Request Forms: A Feasibility Study.基于放射学申请单的放射学研究和方案的 GPT-4 自动确定:一项可行性研究。
Radiology. 2023 Jun;307(5):e230877. doi: 10.1148/radiol.230877.
3
Large language models for structured reporting in radiology: performance of GPT-4, ChatGPT-3.5, Perplexity and Bing.
ChatGPT与互联网搜索用于职业医学临床研究和决策的比较:随机对照试验
JMIR Form Res. 2025 May 20;9:e63857. doi: 10.2196/63857.
4
Bridging Gaps in Cancer Care: Utilizing Large Language Models for Accessible Dietary Recommendations.弥合癌症护理差距:利用大语言模型提供可获取的饮食建议。
Nutrients. 2025 Mar 28;17(7):1176. doi: 10.3390/nu17071176.
5
IR-GPT: AI Foundation Models to Optimize Interventional Radiology.IR-GPT:用于优化介入放射学的人工智能基础模型。
Cardiovasc Intervent Radiol. 2025 May;48(5):585-592. doi: 10.1007/s00270-024-03945-0. Epub 2025 Mar 26.
6
Current applications and challenges in large language models for patient care: a systematic review.用于患者护理的大语言模型的当前应用与挑战:一项系统综述
Commun Med (Lond). 2025 Jan 21;5(1):26. doi: 10.1038/s43856-024-00717-2.
7
ChatGPT4's diagnostic accuracy in inpatient neurology: A retrospective cohort study.ChatGPT4在住院神经内科的诊断准确性:一项回顾性队列研究。
Heliyon. 2024 Dec 9;10(24):e40964. doi: 10.1016/j.heliyon.2024.e40964. eCollection 2024 Dec 30.
8
Large language model doctor: assessing the ability of ChatGPT-4 to deliver interventional radiology procedural information to patients during the consent process.大语言模型医生:评估ChatGPT-4在知情同意过程中向患者提供介入放射学程序信息的能力。
CVIR Endovasc. 2024 Nov 29;7(1):83. doi: 10.1186/s42155-024-00477-z.
9
Large language models in patient education: a scoping review of applications in medicine.用于患者教育的大语言模型:医学应用的范围综述
Front Med (Lausanne). 2024 Oct 29;11:1477898. doi: 10.3389/fmed.2024.1477898. eCollection 2024.
10
Assessing the Role of the Generative Pretrained Transformer (GPT) in Alzheimer's Disease Management: Comparative Study of Neurologist- and Artificial Intelligence-Generated Responses.评估生成式预训练转换器(GPT)在阿尔茨海默病管理中的作用:神经科医生和人工智能生成的回复的对比研究。
J Med Internet Res. 2024 Oct 31;26:e51095. doi: 10.2196/51095.
用于放射科结构化报告的大型语言模型:GPT-4、ChatGPT-3.5、Perplexity 和 Bing 的性能。
Radiol Med. 2023 Jul;128(7):808-812. doi: 10.1007/s11547-023-01651-4. Epub 2023 May 29.
4
Translating radiology reports into plain language using ChatGPT and GPT-4 with prompt learning: results, limitations, and potential.使用ChatGPT和GPT-4通过提示学习将放射学报告翻译成通俗易懂的语言:结果、局限性和潜力。
Vis Comput Ind Biomed Art. 2023 May 18;6(1):9. doi: 10.1186/s42492-023-00136-5.
5
Exploring the Boundaries of Reality: Investigating the Phenomenon of Artificial Intelligence Hallucination in Scientific Writing Through ChatGPT References.探索现实的边界:通过ChatGPT参考文献研究科学写作中的人工智能幻觉现象。
Cureus. 2023 Apr 11;15(4):e37432. doi: 10.7759/cureus.37432. eCollection 2023 Apr.
6
Accuracy of Information and References Using ChatGPT-3 for Retrieval of Clinical Radiological Information.使用 ChatGPT-3 检索临床放射学信息的信息和参考文献的准确性。
Can Assoc Radiol J. 2024 Feb;75(1):69-73. doi: 10.1177/08465371231171125. Epub 2023 Apr 20.
7
ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns.ChatGPT在医学教育、研究与实践中的应用:对其前景与合理担忧的系统评价
Healthcare (Basel). 2023 Mar 19;11(6):887. doi: 10.3390/healthcare11060887.
8
Revolutionizing radiology with GPT-based models: Current applications, future possibilities and limitations of ChatGPT.基于 GPT 的模型推动放射学革命:ChatGPT 的当前应用、未来可能性和局限性。
Diagn Interv Imaging. 2023 Jun;104(6):269-274. doi: 10.1016/j.diii.2023.02.003. Epub 2023 Feb 28.
9
Public Awareness of Interventional Radiology: Population-Based Analysis of the Current State of and Pathways for Improvement.公众对介入放射学的认知:基于人群的当前状况分析及改善途径。
J Vasc Interv Radiol. 2023 Jun;34(6):960-967.e6. doi: 10.1016/j.jvir.2023.01.033. Epub 2023 Feb 9.
10
Open artificial intelligence platforms in nursing education: Tools for academic progress or abuse?护理教育中的开放人工智能平台:学术进步的工具还是滥用的手段?
Nurse Educ Pract. 2023 Jan;66:103537. doi: 10.1016/j.nepr.2022.103537. Epub 2022 Dec 16.