• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

逐层分析:利用神经放射学中的增量病例信息评估人工智能诊断准确性

Layer by Layer: Assessing AI Diagnostic Accuracy With Incremental Case Information in Neuroradiology.

作者信息

Lotfian Golnaz, Jhaveri Miral, Dua Sumeet G, Suthar Pokhraj P

机构信息

Department of Diagnostic Radiology and Nuclear Medicine, Rush University Medical Center, Chicago, USA.

出版信息

Cureus. 2025 Jun 12;17(6):e85874. doi: 10.7759/cureus.85874. eCollection 2025 Jun.

DOI:10.7759/cureus.85874
PMID:40656311
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12255534/
Abstract

Aim Artificial intelligence (AI) has proven tremendous potential in improving diagnostic accuracy and efficiency in radiology. This study assesses the diagnostic performance of Google Gemini (version 1.5 Flash; Google DeepMind, Mountain View, California, USA), a proprietary large language model, in interpreting challenging diagnostic cases from the "Case of the Month" series. Materials and methods We analyzed 143 neuroradiology cases spanning brain, head and neck, and spine areas. Each case evolved over four weeks, starting with clinical history and followed by incremental imaging findings. Google Gemini was often prompted with the question, "What is the diagnosis?" Its accuracy was assessed at each level and across specialty categories. The data used were publicly available, and no ethical approval was necessary. Results Gemini's diagnosis accuracy improved with new case data, from 3.5% with history alone to 45.7% after complete imaging was supplied. Accuracy by category was highest in spine cases (51.9%), followed by head and neck (45.5%) and brain (44.0%). A chi-square test for trend verified that the performance increase over time was statistically significant (p < 0.0000000001). Conclusion Google Gemini displays moderate diagnosis accuracy that improves with accumulated information. While encouraging, its shortcomings underline the necessity for continual validation and transparency. This study shows the expanding relevance of AI in neuroradiology and the necessity of comprehensive evaluation before clinical integration.

摘要

目的 人工智能(AI)已在提高放射学诊断准确性和效率方面展现出巨大潜力。本研究评估了专有大语言模型谷歌Gemini(1.5 Flash版本;谷歌DeepMind,美国加利福尼亚州山景城)在解读“月度病例”系列中具有挑战性的诊断病例时的诊断性能。材料与方法 我们分析了143例涵盖脑、头颈部和脊柱区域的神经放射学病例。每个病例历时四周,从临床病史开始,随后是逐步增加的影像学检查结果。谷歌Gemini经常被问到“诊断是什么?”这个问题。在每个阶段以及跨专业类别评估其准确性。所使用的数据是公开可用的,无需伦理批准。结果 随着新病例数据的增加,Gemini的诊断准确性有所提高,仅根据病史时为3.5%,在提供完整影像学检查结果后提高到45.7%。按类别划分,脊柱病例的准确性最高(51.9%),其次是头颈部(45.5%)和脑(44.0%)。趋势的卡方检验证实,随着时间推移性能的提高具有统计学意义(p < 0.0000000001)。结论 谷歌Gemini显示出适度的诊断准确性,且随着信息积累而提高。尽管令人鼓舞,但其缺点凸显了持续验证和透明度的必要性。本研究表明AI在神经放射学中的相关性不断扩大,以及在临床整合之前进行全面评估的必要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f844/12255534/a26ee3c463c2/cureus-0017-00000085874-i05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f844/12255534/8b1aba9e4dc3/cureus-0017-00000085874-i01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f844/12255534/9f5353fa8452/cureus-0017-00000085874-i02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f844/12255534/22e3aa94a9a9/cureus-0017-00000085874-i03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f844/12255534/e4249e15772e/cureus-0017-00000085874-i04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f844/12255534/a26ee3c463c2/cureus-0017-00000085874-i05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f844/12255534/8b1aba9e4dc3/cureus-0017-00000085874-i01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f844/12255534/9f5353fa8452/cureus-0017-00000085874-i02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f844/12255534/22e3aa94a9a9/cureus-0017-00000085874-i03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f844/12255534/e4249e15772e/cureus-0017-00000085874-i04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f844/12255534/a26ee3c463c2/cureus-0017-00000085874-i05.jpg

相似文献

1
Layer by Layer: Assessing AI Diagnostic Accuracy With Incremental Case Information in Neuroradiology.逐层分析:利用神经放射学中的增量病例信息评估人工智能诊断准确性
Cureus. 2025 Jun 12;17(6):e85874. doi: 10.7759/cureus.85874. eCollection 2025 Jun.
2
Artificial Intelligence in Peripheral Artery Disease Education: A Battle Between ChatGPT and Google Gemini.外周动脉疾病教育中的人工智能:ChatGPT与谷歌Gemini的较量
Cureus. 2025 Jun 1;17(6):e85174. doi: 10.7759/cureus.85174. eCollection 2025 Jun.
3
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
4
123I-MIBG scintigraphy and 18F-FDG-PET imaging for diagnosing neuroblastoma.用于诊断神经母细胞瘤的123I-间碘苄胍闪烁扫描术和18F-氟代脱氧葡萄糖正电子发射断层显像
Cochrane Database Syst Rev. 2015 Sep 29;2015(9):CD009263. doi: 10.1002/14651858.CD009263.pub2.
5
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
6
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
7
Clinical symptoms, signs and tests for identification of impending and current water-loss dehydration in older people.老年人即将发生和当前失水脱水的识别的临床症状、体征及检查
Cochrane Database Syst Rev. 2015 Apr 30;2015(4):CD009647. doi: 10.1002/14651858.CD009647.pub2.
8
Intravenous magnesium sulphate and sotalol for prevention of atrial fibrillation after coronary artery bypass surgery: a systematic review and economic evaluation.静脉注射硫酸镁和索他洛尔预防冠状动脉搭桥术后房颤:系统评价与经济学评估
Health Technol Assess. 2008 Jun;12(28):iii-iv, ix-95. doi: 10.3310/hta12280.
9
Artificial Intelligence Shows Limited Success in Improving Readability Levels of Spanish-language Orthopaedic Patient Education Materials.人工智能在提高西班牙语骨科患者教育材料的可读性方面成效有限。
Clin Orthop Relat Res. 2025 Feb 11. doi: 10.1097/CORR.0000000000003413.
10
Artificial intelligence for detecting keratoconus.人工智能在圆锥角膜检测中的应用。
Cochrane Database Syst Rev. 2023 Nov 15;11(11):CD014911. doi: 10.1002/14651858.CD014911.pub2.

本文引用的文献

1
Evaluating Brain Tumor Detection with Deep Learning Convolutional Neural Networks Across Multiple MRI Modalities.使用深度学习卷积神经网络跨多种磁共振成像模态评估脑肿瘤检测
J Imaging. 2024 Nov 21;10(12):296. doi: 10.3390/jimaging10120296.
2
Evaluation of ChatGPT 4.0 in Thoracic Imaging and Diagnostics.ChatGPT 4.0在胸部影像学与诊断中的评估
Cureus. 2024 Nov 15;16(11):e73741. doi: 10.7759/cureus.73741. eCollection 2024 Nov.
3
Comparative Accuracy of ChatGPT 4.0 and Google Gemini in Answering Pediatric Radiology Text-Based Questions.
ChatGPT 4.0与谷歌Gemini在回答基于文本的儿科放射学问题时的比较准确性
Cureus. 2024 Oct 5;16(10):e70897. doi: 10.7759/cureus.70897. eCollection 2024 Oct.
4
Clinical Impact of an AI Decision Support System for Detection of Intracranial Hemorrhage in CT Scans.人工智能决策支持系统对CT扫描中颅内出血检测的临床影响
Neurotrauma Rep. 2024 Oct 14;5(1):1009-1015. doi: 10.1089/neur.2024.0017. eCollection 2024.
5
Generalization-a key challenge for responsible AI in patient-facing clinical applications.泛化——面向患者的临床应用中负责任人工智能的关键挑战。
NPJ Digit Med. 2024 May 21;7(1):126. doi: 10.1038/s41746-024-01127-3.
6
Generative AI in healthcare: an implementation science informed translational path on application, integration and governance.生成式人工智能在医疗保健领域的应用、整合和治理:基于实施科学的转化途径。
Implement Sci. 2024 Mar 15;19(1):27. doi: 10.1186/s13012-024-01357-9.
7
PulmoNet: a novel deep learning based pulmonary diseases detection model.PulmoNet:一种新型基于深度学习的肺部疾病检测模型。
BMC Med Imaging. 2024 Feb 28;24(1):51. doi: 10.1186/s12880-024-01227-2.
8
Ethical implications of AI and robotics in healthcare: A review.人工智能和机器人技术在医疗保健中的伦理问题:综述。
Medicine (Baltimore). 2023 Dec 15;102(50):e36671. doi: 10.1097/MD.0000000000036671.
9
Artificial Intelligence (AI) in Radiology: A Deep Dive Into ChatGPT 4.0's Accuracy with the American Journal of Neuroradiology's (AJNR) "Case of the Month".放射学中的人工智能(AI):深入探讨ChatGPT 4.0与《美国神经放射学杂志》(AJNR)“月度病例”的准确性。
Cureus. 2023 Aug 23;15(8):e43958. doi: 10.7759/cureus.43958. eCollection 2023 Aug.
10
Considerations for addressing bias in artificial intelligence for health equity.解决人工智能中影响健康公平性的偏差的考量因素。
NPJ Digit Med. 2023 Sep 12;6(1):170. doi: 10.1038/s41746-023-00913-9.