• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大语言模型时代神经放射学中的自动MRI协议制定

Automated MRI protocoling in neuroradiology in the era of large language models.

作者信息

Reiner Lara Noelle, Chelbi Moudather, Fetscher Leonard, Stöckel Juliane C, Csapó-Schmidt Christoph, Guseynova Shakhnaz, Al Mohamad Fares, Bressem Keno Kyrill, Nawabi Jawed, Siebert Eberhard, Wattjes Mike P, Scheel Michael, Meddeb Aymen

机构信息

Department of Neuroradiology, Charité-Universitätsmedizin Berlin, Augustenburger Platz 1, 13353, Berlin, Germany.

Department of Radiology, Technical University Munich, Klinikum Rechts Der Isar, Ismaninger Str. 22, 81675, Munich, Germany.

出版信息

Radiol Med. 2025 Jul 11. doi: 10.1007/s11547-025-02040-9.

DOI:10.1007/s11547-025-02040-9
PMID:40643871
Abstract

PURPOSE

This study investigates the automation of MRI protocoling, a routine task in radiology, using large language models (LLMs), comparing an open-source (LLama 3.1 405B) and a proprietary model (GPT-4o) with and without retrieval-augmented generation (RAG), a method for incorporating domain-specific knowledge.

MATERIAL AND METHODS

This retrospective study included MRI studies conducted between January and December 2023, along with institution-specific protocol assignment guidelines. Clinical questions were extracted, and a neuroradiologist established the gold standard protocol. LLMs were tasked with assigning MRI protocols and contrast medium administration with and without RAG. The results were compared to protocols selected by four radiologists. Token-based symmetric accuracy, the Wilcoxon signed-rank test, and the McNemar test were used for evaluation.

RESULTS

Data from 100 neuroradiology reports (mean age = 54.2 years ± 18.41, women 50%) were included. RAG integration significantly improved accuracy in sequence and contrast media prediction for LLama 3.1 (Sequences: 38% vs. 70%, P < .001, Contrast Media: 77% vs. 94%, P < .001), and GPT-4o (Sequences: 43% vs. 81%, P < .001, Contrast Media: 79% vs. 92%, P = .006). GPT-4o outperformed LLama 3.1 in MRI sequence prediction (81% vs. 70%, P < .001), with comparable accuracies to the radiologists (81% ± 0.21, P = .43). Both models equaled radiologists in predicting contrast media administration (LLama 3.1 RAG: 94% vs. 91% ± 0.2, P = .37, GPT-4o RAG: 92% vs. 91% ± 0.24, P = .48).

CONCLUSION

Large language models show great potential as decision-support tools for MRI protocoling, with performance similar to radiologists. RAG enhances the ability of LLMs to provide accurate, institution-specific protocol recommendations.

摘要

目的

本研究使用大语言模型(LLMs)探究放射学中的常规任务——磁共振成像(MRI)检查方案制定的自动化,比较一个开源模型(Llama 3.1 405B)和一个专有模型(GPT-4o)在有无检索增强生成(RAG,一种纳入特定领域知识的方法)情况下的表现。

材料与方法

这项回顾性研究纳入了2023年1月至12月期间进行的MRI检查,以及机构特定的检查方案分配指南。提取临床问题,由一名神经放射科医生确定金标准检查方案。要求大语言模型在有无RAG的情况下分配MRI检查方案和造影剂使用方案。将结果与四位放射科医生选择的检查方案进行比较。使用基于令牌的对称准确率、Wilcoxon符号秩检验和McNemar检验进行评估。

结果

纳入了100份神经放射学报告的数据(平均年龄 = 54.2岁 ± 18.41,女性占50%)。对于Llama 3.1,RAG整合显著提高了序列和造影剂预测的准确率(序列:38% 对 70%,P <.001,造影剂:77% 对 94%,P <.001),对于GPT-4o也是如此(序列:43% 对 81%,P <.001,造影剂:79% 对 92%,P =.006)。在MRI序列预测方面,GPT-4o优于Llama 3.1(81% 对 70%,P <.001),与放射科医生的准确率相当(81% ± 0.21,P =.43)。在预测造影剂使用方面,两个模型与放射科医生相当(Llama 3.1 RAG:94% 对 91% ± 0.2,P =.37,GPT-4o RAG:92% 对 91% ± 0.24,P =.48)。

结论

大语言模型作为MRI检查方案制定的决策支持工具显示出巨大潜力,其表现与放射科医生相似。RAG增强了大语言模型提供准确的、机构特定检查方案建议的能力。

相似文献

1
Automated MRI protocoling in neuroradiology in the era of large language models.大语言模型时代神经放射学中的自动MRI协议制定
Radiol Med. 2025 Jul 11. doi: 10.1007/s11547-025-02040-9.
2
Data extraction from free-text stroke CT reports using GPT-4o and Llama-3.3-70B: the impact of annotation guidelines.使用GPT-4o和Llama-3.3-70B从自由文本中风CT报告中提取数据:注释指南的影响
Eur Radiol Exp. 2025 Jun 19;9(1):61. doi: 10.1186/s41747-025-00600-2.
3
Performance of ChatGPT-4o and Four Open-Source Large Language Models in Generating Diagnoses Based on China's Rare Disease Catalog: Comparative Study.ChatGPT-4o与四个开源大语言模型基于中国罕见病目录生成诊断的性能:比较研究
J Med Internet Res. 2025 Jun 18;27:e69929. doi: 10.2196/69929.
4
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
5
An Institutional Large Language Model for Musculoskeletal MRI Improves Protocol Adherence and Accuracy.用于肌肉骨骼磁共振成像的机构大语言模型可提高方案依从性和准确性。
J Bone Joint Surg Am. 2025 Jul 8. doi: 10.2106/JBJS.24.01429.
6
Predicting 30-Day Postoperative Mortality and American Society of Anesthesiologists Physical Status Using Retrieval-Augmented Large Language Models: Development and Validation Study.使用检索增强大语言模型预测术后30天死亡率和美国麻醉医师协会身体状况:开发与验证研究
J Med Internet Res. 2025 Jun 3;27:e75052. doi: 10.2196/75052.
7
Large language models for error detection in radiology reports: a comparative analysis between closed-source and privacy-compliant open-source models.用于放射学报告错误检测的大语言模型:闭源模型与符合隐私规定的开源模型的对比分析
Eur Radiol. 2025 Feb 20. doi: 10.1007/s00330-025-11438-y.
8
Enhancing Magnetic Resonance Imaging (MRI) Report Comprehension in Spinal Trauma: Readability Analysis of AI-Generated Explanations for Thoracolumbar Fractures.提高脊柱创伤磁共振成像(MRI)报告的理解:胸腰椎骨折人工智能生成解释的可读性分析
JMIR AI. 2025 Jul 1;4:e69654. doi: 10.2196/69654.
9
Scalable evaluation framework for retrieval augmented generation in tobacco research using large Language models.用于烟草研究中使用大语言模型的检索增强生成的可扩展评估框架。
Sci Rep. 2025 Jul 2;15(1):22760. doi: 10.1038/s41598-025-05726-2.
10
A comparative study of recent large language models on generating hospital discharge summaries for lung cancer patients.近期大型语言模型在生成肺癌患者出院小结方面的比较研究。
J Biomed Inform. 2025 Aug;168:104867. doi: 10.1016/j.jbi.2025.104867. Epub 2025 Jun 20.

本文引用的文献

1
The immune landscape and viral shedding of Omicron SARS-CoV-2 variants implicate immune escape.奥密克戎 SARS-CoV-2 变体的免疫格局和病毒脱落表明存在免疫逃逸。
Front Med (Lausanne). 2025 Jan 22;11:1478466. doi: 10.3389/fmed.2024.1478466. eCollection 2024.
2
Large Language Model Ability to Translate CT and MRI Free-Text Radiology Reports Into Multiple Languages.大型语言模型将CT和MRI自由文本放射学报告翻译成多种语言的能力。
Radiology. 2024 Dec;313(3):e241736. doi: 10.1148/radiol.241736.
3
The application of large language models in medicine: A scoping review.
大语言模型在医学中的应用:一项范围综述。
iScience. 2024 Apr 23;27(5):109713. doi: 10.1016/j.isci.2024.109713. eCollection 2024 May 17.
4
Integrating Retrieval-Augmented Generation with Large Language Models in Nephrology: Advancing Practical Applications.将检索增强生成与大型语言模型在肾脏病学中的整合:推进实际应用。
Medicina (Kaunas). 2024 Mar 8;60(3):445. doi: 10.3390/medicina60030445.
5
Feasibility of Using the Privacy-preserving Large Language Model Vicuna for Labeling Radiology Reports.使用隐私保护的大型语言模型 Vicuna 对放射科报告进行标注的可行性研究。
Radiology. 2023 Oct;309(1):e231147. doi: 10.1148/radiol.231147.
6
Potential Use Cases for ChatGPT in Radiology Reporting.ChatGPT 在放射科报告中的潜在应用案例。
AJR Am J Roentgenol. 2023 Sep;221(3):373-376. doi: 10.2214/AJR.23.29198. Epub 2023 Apr 19.
7
Automated Protocoling for MRI Exams-Challenges and Solutions.MRI 检查的自动化协议制定:挑战与解决方案。
J Digit Imaging. 2022 Oct;35(5):1293-1302. doi: 10.1007/s10278-022-00610-1. Epub 2022 Aug 30.
8
Automatic medical protocol classification using machine learning approaches.使用机器学习方法进行自动医疗协议分类。
Comput Methods Programs Biomed. 2021 Mar;200:105939. doi: 10.1016/j.cmpb.2021.105939. Epub 2021 Jan 16.
9
Machine Learning for Automation of Radiology Protocols for Quality and Efficiency Improvement.机器学习在放射学协议自动化中的应用,以提高质量和效率。
J Am Coll Radiol. 2020 Sep;17(9):1149-1158. doi: 10.1016/j.jacr.2020.03.012. Epub 2020 Apr 9.
10
Efficiency Improvement in a Busy Radiology Practice: Determination of Musculoskeletal Magnetic Resonance Imaging Protocol Using Deep-Learning Convolutional Neural Networks.繁忙放射科实践中的效率提升:使用深度学习卷积神经网络确定肌肉骨骼磁共振成像方案。
J Digit Imaging. 2018 Oct;31(5):604-610. doi: 10.1007/s10278-018-0066-y.