放射学中的人工智能（AI）：深入探讨ChatGPT 4.0与《美国神经放射学杂志》（AJNR）“月度病例”的准确性。

Artificial Intelligence (AI) in Radiology: A Deep Dive Into ChatGPT 4.0's Accuracy with the American Journal of Neuroradiology's (AJNR) "Case of the Month".

作者信息

Suthar Pokhraj P, Kounsal Avin, Chhetri Lavanya, Saini Divya, Dua Sumeet G

机构信息

Department of Diagnostic Radiology and Nuclear Medicine, Rush University Medical Center, Chicago, USA.

Department of Clinical Nutrition, Rush University Medical Center, Chicago, USA.

出版信息

Cureus. 2023 Aug 23;15(8):e43958. doi: 10.7759/cureus.43958. eCollection 2023 Aug.

DOI:10.7759/cureus.43958

PMID:37746411

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10516448/

Abstract

The advent of artificial intelligence (AI), particularly large language models (LLMs) such as ChatGPT 4.0, holds significant potential in healthcare, specifically in radiology. This study examined the accuracy of ChatGPT 4.0 (July 20, 2023, version) in solving diagnostic quizzes from the American Journal of Neuroradiology's (AJNR) "Case of the Month." We evaluated the diagnostic accuracy of ChatGPT 4.0 when provided with a patient's history and imaging findings weekly over four weeks, using 140 cases from the AJNR "Case of the Month" portal (from November 2011 to July 2023). The overall diagnostic accuracy was found to be 57.86% (81 out of 140 cases). The diagnostic performance varied across brain, head and neck, and spine subgroups, with accuracy rates of 54.65%, 67.65%, and 55.0%, respectively. These findings suggest that AI models such as ChatGPT 4.0 could serve as useful adjuncts in radiological diagnostics, thus potentially enhancing patient care and revolutionizing medical education.

摘要

人工智能（AI）的出现，尤其是像ChatGPT 4.0这样的大型语言模型（LLM），在医疗保健领域，特别是放射学领域具有巨大潜力。本研究考察了ChatGPT 4.0（2023年7月20日版本）解答美国神经放射学杂志（AJNR）“月度病例”诊断测验的准确性。我们使用来自AJNR“月度病例”门户（2011年11月至2023年7月）的140个病例，在四周内每周向ChatGPT 4.0提供患者病史和影像检查结果，评估其诊断准确性。总体诊断准确率为57.86%（140个病例中的81个）。诊断表现因脑、头颈部和脊柱亚组而异，准确率分别为54.65%、67.65%和55.0%。这些发现表明，像ChatGPT 4.0这样的人工智能模型可以作为放射诊断中的有用辅助工具，从而有可能改善患者护理并彻底改变医学教育。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da7a/10516448/e3e22b14e765/cureus-0015-00000043958-i01.jpg

相似文献

Artificial Intelligence (AI) in Radiology: A Deep Dive Into ChatGPT 4.0's Accuracy with the American Journal of Neuroradiology's (AJNR) "Case of the Month".放射学中的人工智能（AI）：深入探讨ChatGPT 4.0与《美国神经放射学杂志》（AJNR）“月度病例”的准确性。

Cureus. 2023 Aug 23;15(8):e43958. doi: 10.7759/cureus.43958. eCollection 2023 Aug.

Comparative Evaluation of AI Models Such as ChatGPT 3.5, ChatGPT 4.0, and Google Gemini in Neuroradiology Diagnostics.ChatGPT 3.5、ChatGPT 4.0和谷歌Gemini等人工智能模型在神经放射学诊断中的比较评估

Cureus. 2024 Aug 25;16(8):e67766. doi: 10.7759/cureus.67766. eCollection 2024 Aug.

Accuracy of ChatGPT generated diagnosis from patient's medical history and imaging findings in neuroradiology cases.ChatGPT根据患者病史和影像学检查结果对神经放射学病例进行诊断的准确性。

Neuroradiology. 2024 Jan;66(1):73-79. doi: 10.1007/s00234-023-03252-4. Epub 2023 Nov 23.

Evaluating ChatGPT-4's Diagnostic Accuracy: Impact of Visual Data Integration.评估ChatGPT-4的诊断准确性：视觉数据整合的影响。

JMIR Med Inform. 2024 Apr 9;12:e55627. doi: 10.2196/55627.

Comparing the Diagnostic Performance of GPT-4-based ChatGPT, GPT-4V-based ChatGPT, and Radiologists in Challenging Neuroradiology Cases.比较基于 GPT-4 的 ChatGPT、基于 GPT-4V 的 ChatGPT 和放射科医生在神经放射学挑战性病例中的诊断性能。

Clin Neuroradiol. 2024 Dec;34(4):779-787. doi: 10.1007/s00062-024-01426-y. Epub 2024 May 28.

Evaluating the Artificial Intelligence Performance Growth in Ophthalmic Knowledge.评估眼科知识领域中人工智能性能的增长情况。

Cureus. 2023 Sep 21;15(9):e45700. doi: 10.7759/cureus.45700. eCollection 2023 Sep.

Revolutionizing radiology with GPT-based models: Current applications, future possibilities and limitations of ChatGPT.基于 GPT 的模型推动放射学革命：ChatGPT 的当前应用、未来可能性和局限性。

Diagn Interv Imaging. 2023 Jun;104(6):269-274. doi: 10.1016/j.diii.2023.02.003. Epub 2023 Feb 28.

Assessing ChatGPT 4.0's test performance and clinical diagnostic accuracy on USMLE STEP 2 CK and clinical case reports.评估ChatGPT 4.0在美国医师执照考试第二步临床知识考试（USMLE STEP 2 CK）及临床病例报告中的测试表现和临床诊断准确性。

Sci Rep. 2024 Apr 23;14(1):9330. doi: 10.1038/s41598-024-58760-x.

Large language models for structured reporting in radiology: performance of GPT-4, ChatGPT-3.5, Perplexity and Bing.用于放射科结构化报告的大型语言模型：GPT-4、ChatGPT-3.5、Perplexity 和 Bing 的性能。

Radiol Med. 2023 Jul;128(7):808-812. doi: 10.1007/s11547-023-01651-4. Epub 2023 May 29.

Benchmarking large language models' performances for myopia care: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, and Google Bard.比较分析 ChatGPT-3.5、ChatGPT-4.0 和谷歌巴德在近视防控方面的表现：大型语言模型的基准测试。

EBioMedicine. 2023 Sep;95:104770. doi: 10.1016/j.ebiom.2023.104770. Epub 2023 Aug 23.

引用本文的文献

Current trends and future prospects of language models and processing systems in spine surgery - a scoping review.脊柱手术中语言模型和处理系统的当前趋势与未来前景——一项范围综述

Neurosurg Rev. 2025 Sep 5;48(1):633. doi: 10.1007/s10143-025-03785-7.

Layer by Layer: Assessing AI Diagnostic Accuracy With Incremental Case Information in Neuroradiology.逐层分析：利用神经放射学中的增量病例信息评估人工智能诊断准确性

Cureus. 2025 Jun 12;17(6):e85874. doi: 10.7759/cureus.85874. eCollection 2025 Jun.

Diagnostic Accuracy of Microsoft's Copilot Artificial Intelligence in Chronic Wound Assessment: A Comparative Study.微软Copilot人工智能在慢性伤口评估中的诊断准确性：一项比较研究。

Plast Reconstr Surg Glob Open. 2025 Jun 12;13(6):e6871. doi: 10.1097/GOX.0000000000006871. eCollection 2025 Jun.

Comparative Evaluation of Large Language and Multimodal Models in Detecting Spinal Stabilization Systems on X-Ray Images.大语言模型和多模态模型在X射线图像中检测脊柱稳定系统的比较评估

J Clin Med. 2025 May 8;14(10):3282. doi: 10.3390/jcm14103282.

The Accuracy of ChatGPT-4o in Interpreting Chest and Abdominal X-Ray Images.ChatGPT-4o 在解读胸部和腹部 X 光图像方面的准确性。

J Pers Med. 2025 May 10;15(5):194. doi: 10.3390/jpm15050194.

[Potential applications of large language models in trauma surgery : Opportunities, risks and perspectives].[大语言模型在创伤外科中的潜在应用：机遇、风险与展望]

Unfallchirurgie (Heidelb). 2025 May 12. doi: 10.1007/s00113-025-01581-y.

A systematic review and meta-analysis of diagnostic performance comparison between generative AI and physicians.生成式人工智能与医生诊断性能比较的系统评价与荟萃分析

NPJ Digit Med. 2025 Mar 22;8(1):175. doi: 10.1038/s41746-025-01543-z.

Letter to the Editor: "Comparative analysis of GPT-4-based ChatGPT's diagnostic performance with radiologists using real-world radiology reports of brain tumors".致编辑的信：“基于GPT-4的ChatGPT与放射科医生在脑肿瘤真实世界放射学报告中的诊断性能比较分析”

Eur Radiol. 2025 Mar;35(3):1107-1108. doi: 10.1007/s00330-024-11280-8. Epub 2025 Jan 2.

ChatGPT-4 Turbo and Meta's LLaMA 3.1: A Relative Analysis of Answering Radiology Text-Based Questions.ChatGPT-4 Turbo与Meta的LLaMA 3.1：基于放射学文本问题回答的相关性分析

Cureus. 2024 Nov 24;16(11):e74359. doi: 10.7759/cureus.74359. eCollection 2024 Nov.

Analyzing evaluation methods for large language models in the medical field: a scoping review.分析医学领域大语言模型的评价方法：范围综述。

BMC Med Inform Decis Mak. 2024 Nov 29;24(1):366. doi: 10.1186/s12911-024-02709-7.

本文引用的文献

ChatGPT's Diagnostic Performance from Patient History and Imaging Findings on the Diagnosis Please Quizzes.ChatGPT在诊断问答中基于患者病史和影像检查结果的诊断性能。

Radiology. 2023 Jul;308(1):e231040. doi: 10.1148/radiol.231040.

Chatbot vs Medical Student Performance on Free-Response Clinical Reasoning Examinations.聊天机器人与医学生在自由应答临床推理考试中的表现对比

JAMA Intern Med. 2023 Sep 1;183(9):1028-1030. doi: 10.1001/jamainternmed.2023.2909.

Evaluating GPT4 on Impressions Generation in Radiology Reports.评估GPT4在生成放射学报告印象方面的表现。

Radiology. 2023 Jun;307(5):e231259. doi: 10.1148/radiol.231259.

ChatGPT's quiz skills in different otolaryngology subspecialties: an analysis of 2576 single-choice and multiple-choice board certification preparation questions.ChatGPT 在不同耳鼻喉科亚专业中的测验技能：对 2576 道选择题和多选题进行 board certification 准备的分析。

Eur Arch Otorhinolaryngol. 2023 Sep;280(9):4271-4278. doi: 10.1007/s00405-023-08051-4. Epub 2023 Jun 7.

GPT-4 in Radiology: Improvements in Advanced Reasoning.GPT-4 在放射学中的应用：高级推理能力的提升。

Radiology. 2023 Jun;307(5):e230987. doi: 10.1148/radiol.230987. Epub 2023 May 16.

Potential Use Cases for ChatGPT in Radiology Reporting.ChatGPT 在放射科报告中的潜在应用案例。

AJR Am J Roentgenol. 2023 Sep;221(3):373-376. doi: 10.2214/AJR.23.29198. Epub 2023 Apr 19.

Integrating Al Algorithms into the Clinical Workflow.将人工智能算法整合到临床工作流程中。

Radiol Artif Intell. 2021 Aug 4;3(6):e210013. doi: 10.1148/ryai.2021210013. eCollection 2021 Nov.

Artificial Intelligence in Low- and Middle-Income Countries: Innovating Global Health Radiology.人工智能在中低收入国家：创新全球放射健康。

Radiology. 2020 Dec;297(3):513-520. doi: 10.1148/radiol.2020201434. Epub 2020 Oct 6.

Advances in natural language processing.自然语言处理的进展。

Science. 2015 Jul 17;349(6245):261-6. doi: 10.1126/science.aaa8685.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

放射学中的人工智能（AI）：深入探讨ChatGPT 4.0与《美国神经放射学杂志》（AJNR）“月度病例”的准确性。

Artificial Intelligence (AI) in Radiology: A Deep Dive Into ChatGPT 4.0's Accuracy with the American Journal of Neuroradiology's (AJNR) "Case of the Month".

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献