• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于临床常规放射学申请表,使用GPT-4自动进行MRI协议的序列级确定。

GPT-4 for automated sequence-level determination of MRI protocols based on radiology request forms from clinical routine.

作者信息

Terzis Robert, Kaya Kenan, Schömig Thomas, Janssen Jan Paul, Iuga Andra-Iza, Kottlors Jonathan, Lennartz Simon, Gietzen Carsten, Gözdas Cansin, Müller Lukas, Hahnfeldt Robert, Maintz David, Dratsch Thomas, Pennig Lenhard

机构信息

Institute for Diagnostic and Interventional Radiology, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany.

Department of Diagnostic and Interventional Radiology, University Medical Center of the Johannes Gutenberg-University, Mainz, Germany.

出版信息

Eur Radiol. 2025 Aug 8. doi: 10.1007/s00330-025-11888-4.

DOI:10.1007/s00330-025-11888-4
PMID:40779162
Abstract

OBJECTIVES

This study evaluated GPT-4's accuracy in MRI sequence selection based on radiology request forms (RRFs), comparing its performance to radiology residents.

MATERIALS AND METHODS

This retrospective study included 100 RRFs across four subspecialties (cardiac imaging, neuroradiology, musculoskeletal, and oncology). GPT-4 and two radiology residents (R1: 2 years, R2: 5 years MRI experience) selected sequences based on each patient's medical history and clinical questions. Considering imaging society guidelines, five board-certified specialized radiologists assessed protocols based on completeness, quality, and utility in consensus, using 5-point Likert scales. Clinical applicability was rated binarily by the institution's lead radiographer.

RESULTS

GPT-4 achieved median scores of 3 (1-5) for completeness, 4 (1-5) for quality, and 4 (1-5) for utility, comparable to R1 (3 (1-5), 4 (1-5), 4 (1-5); each p > 0.05) but inferior to R2 (4 (1-5), 5 (1-5); p < 0.01, respectively, and 5 (1-5); p < 0.001). Subspecialty protocol quality varied: GPT-4 matched R1 (4 (2-4) vs. 4 (2-5), p = 0.20) and R2 (4 (2-5); p = 0.47) in cardiac imaging; showed no differences in neuroradiology (all 5 (1-5), p > 0.05); scored lower than R1 and R2 in musculoskeletal imaging (3 (2-5) vs. 4 (3-5); p < 0.01, and 5 (3-5); p < 0.001); and matched R1 (4 (1-5) vs. 2 (1-4), p = 0.12) as well as R2 (5 (2-5); p = 0.20) in oncology. GPT-4-based protocols were clinically applicable in 95% of cases, comparable to R1 (95%) and R2 (96%).

CONCLUSION

GPT-4 generated MRI protocols with notable completeness, quality, utility, and clinical applicability, excelling in standardized subspecialties like cardiac and neuroradiology imaging while yielding lower accuracy in musculoskeletal examinations.

KEY POINTS

Question Long MRI acquisition times limit patient access, making accurate protocol selection crucial for efficient diagnostics, though it's time-consuming and error-prone, especially for inexperienced residents. Findings GPT-4 generated MRI protocols of remarkable yet inconsistent quality, performing on par with an experienced resident in standardized fields, but moderately in musculoskeletal examinations. Clinical relevance The large language model can assist less experienced radiologists in determining detailed MRI protocols and counteract increasing workloads. The model could function as a semi-automatic tool, generating MRI protocols for radiologists' confirmation, optimizing resource allocation, and improving diagnostics and cost-effectiveness.

摘要

目的

本研究评估了GPT-4基于放射学申请表(RRFs)选择MRI序列的准确性,并将其表现与放射科住院医师进行比较。

材料与方法

这项回顾性研究纳入了来自四个亚专业(心脏成像、神经放射学、肌肉骨骼和肿瘤学)的100份RRFs。GPT-4和两名放射科住院医师(R1:2年经验,R2:5年MRI经验)根据每位患者的病史和临床问题选择序列。根据影像学会指南,五位获得委员会认证的专业放射科医生使用5分制李克特量表,基于完整性、质量和实用性对方案进行共识评估。临床适用性由该机构的首席放射技师进行二元评级。

结果

GPT-4在完整性方面的中位数评分为3(1 - 5),质量方面为4(1 - 5),实用性方面为4(1 - 5),与R1(3(1 - 5),4(1 - 5),4(1 - 5);各p>0.05)相当,但不如R2(分别为4(1 - 5),5(1 - 5);p<0.01,以及5(1 - 5);p<0.001)。亚专业方案质量各不相同:GPT-4在心脏成像方面与R1(4(2 - 4)对4(2 - 5),p = 0.20)和R2(4(2 - 5);p = 0.47)相当;在神经放射学方面无差异(均为5(1 - 5),p>0.05);在肌肉骨骼成像方面得分低于R1和R2(3(2 - 5)对4(3 - 5);p<0.01,以及5(3 - 5);p<0.001);在肿瘤学方面与R1(4(1 - 5)对2(1 - 4),p = 0.12)以及R2(5(2 - 5);p = 0.20)相当。基于GPT-4的方案在95%的病例中具有临床适用性,与R1(95%)和R2(96%)相当。

结论

GPT-4生成的MRI方案具有显著的完整性、质量、实用性和临床适用性,在心脏和神经放射学成像等标准化亚专业中表现出色,而在肌肉骨骼检查中的准确性较低。

关键点

问题MRI采集时间长限制了患者的检查机会,准确选择方案对高效诊断至关重要,尽管这既耗时又容易出错,尤其是对于经验不足的住院医师。发现GPT-4生成的MRI方案质量显著但不一致,在标准化领域与经验丰富的住院医师表现相当,但在肌肉骨骼检查中表现中等。临床意义大语言模型可以帮助经验不足的放射科医生确定详细的MRI方案,并应对不断增加的工作量。该模型可作为半自动工具,生成MRI方案供放射科医生确认,优化资源分配,提高诊断效率和成本效益。

相似文献

1
GPT-4 for automated sequence-level determination of MRI protocols based on radiology request forms from clinical routine.基于临床常规放射学申请表,使用GPT-4自动进行MRI协议的序列级确定。
Eur Radiol. 2025 Aug 8. doi: 10.1007/s00330-025-11888-4.
2
Using Natural Language Processing (GPT-4) for Computed Tomography Image Analysis of Cerebral Hemorrhages in Radiology: Retrospective Analysis.使用自然语言处理(GPT-4)进行放射学中脑溢血的计算机断层扫描图像分析:回顾性分析。
J Med Internet Res. 2024 Sep 26;26:e58741. doi: 10.2196/58741.
3
Use of ChatGPT Large Language Models to Extract Details of Recommendations for Additional Imaging From Free-Text Impressions of Radiology Reports.使用ChatGPT大型语言模型从放射学报告的自由文本印象中提取额外影像学检查建议的详细信息。
AJR Am J Roentgenol. 2025 Apr;224(4):e2432341. doi: 10.2214/AJR.24.32341. Epub 2025 Jan 29.
4
An Institutional Large Language Model for Musculoskeletal MRI Improves Protocol Adherence and Accuracy.用于肌肉骨骼磁共振成像的机构大语言模型可提高方案依从性和准确性。
J Bone Joint Surg Am. 2025 Jul 8. doi: 10.2106/JBJS.24.01429.
5
Improving radiology reporting accuracy: use of GPT-4 to reduce errors in reports.提高放射学报告准确性:使用GPT-4减少报告中的错误。
Abdom Radiol (NY). 2025 Jun 27. doi: 10.1007/s00261-025-05079-4.
6
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
7
Performance of ChatGPT Across Different Versions in Medical Licensing Examinations Worldwide: Systematic Review and Meta-Analysis.ChatGPT 在全球医学执照考试不同版本中的表现:系统评价和荟萃分析。
J Med Internet Res. 2024 Jul 25;26:e60807. doi: 10.2196/60807.
8
Thyroid Eye Disease and Artificial Intelligence: A Comparative Study of ChatGPT-3.5, ChatGPT-4o, and Gemini in Patient Information Delivery.甲状腺眼病与人工智能:ChatGPT-3.5、ChatGPT-4o和Gemini在患者信息传递方面的比较研究
Ophthalmic Plast Reconstr Surg. 2024 Dec 24. doi: 10.1097/IOP.0000000000002882.
9
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
10
Enhancing Magnetic Resonance Imaging (MRI) Report Comprehension in Spinal Trauma: Readability Analysis of AI-Generated Explanations for Thoracolumbar Fractures.提高脊柱创伤磁共振成像(MRI)报告的理解:胸腰椎骨折人工智能生成解释的可读性分析
JMIR AI. 2025 Jul 1;4:e69654. doi: 10.2196/69654.

本文引用的文献

1
Monitoring Patients with Glioblastoma by Using a Large Language Model: Accurate Summarization of Radiology Reports with GPT-4.使用大语言模型监测胶质母细胞瘤患者:利用GPT-4对放射学报告进行准确总结
Radiology. 2024 Jul;312(1):e232640. doi: 10.1148/radiol.232640.
2
ESR essentials: MRI of the knee-practice recommendations by ESSR.ESR 精华:膝关节 MRI——ESR 实践推荐。
Eur Radiol. 2024 Oct;34(10):6590-6599. doi: 10.1007/s00330-024-10706-7. Epub 2024 Mar 27.
3
Five-minute knee MRI: An AI-based super resolution reconstruction approach for compressed sensing. A validation study on healthy volunteers.
五分钟膝关节 MRI:基于 AI 的压缩感知超分辨率重建方法。一项针对健康志愿者的验证研究。
Eur J Radiol. 2024 Jun;175:111418. doi: 10.1016/j.ejrad.2024.111418. Epub 2024 Mar 9.
4
ChatGPT's Accuracy on Magnetic Resonance Imaging Basics: Characteristics and Limitations Depending on the Question Type.ChatGPT在磁共振成像基础知识方面的准确性:取决于问题类型的特征与局限性
Diagnostics (Basel). 2024 Jan 12;14(2):171. doi: 10.3390/diagnostics14020171.
5
Diagnostic Approach for Suspected Acute Myocarditis: Considerations for Standardization and Broadening Clinical Spectrum.疑似急性心肌炎的诊断方法:考虑标准化和拓宽临床谱。
J Am Heart Assoc. 2023 Sep 5;12(17):e031454. doi: 10.1161/JAHA.123.031454. Epub 2023 Aug 17.
6
GPT-4 for Automated Determination of Radiological Study and Protocol based on Radiology Request Forms: A Feasibility Study.基于放射学申请单的放射学研究和方案的 GPT-4 自动确定:一项可行性研究。
Radiology. 2023 Jun;307(5):e230877. doi: 10.1148/radiol.230877.
7
Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine.GPT-4作为医学人工智能聊天机器人的益处、局限性和风险
N Engl J Med. 2023 Mar 30;388(13):1233-1239. doi: 10.1056/NEJMsr2214184.
8
How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment.ChatGPT在美国医师执照考试(USMLE)中的表现如何?大语言模型对医学教育和知识评估的影响。
JMIR Med Educ. 2023 Feb 8;9:e45312. doi: 10.2196/45312.
9
Automated Protocoling for MRI Exams-Challenges and Solutions.MRI 检查的自动化协议制定:挑战与解决方案。
J Digit Imaging. 2022 Oct;35(5):1293-1302. doi: 10.1007/s10278-022-00610-1. Epub 2022 Aug 30.
10
Radiological Structured Report Integrated with Quantitative Imaging Biomarkers and Qualitative Scoring Systems.放射学结构化报告与定量成像生物标志物和定性评分系统的整合。
J Digit Imaging. 2022 Jun;35(3):396-407. doi: 10.1007/s10278-022-00589-9. Epub 2022 Feb 1.