• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GPT-4 是否患有神经恐惧症?人工智能聊天机器人在临床病例中的定位和诊断准确性。

Does GPT-4 have neurophobia? Localization and diagnostic accuracy of an artificial intelligence-powered chatbot in clinical vignettes.

机构信息

Department of Neurology, Stanford University School of Medicine, Palo Alto, CA, United States of America.

Department of Neurology, University of Texas at Austin, Austin, TX, United States of America.

出版信息

J Neurol Sci. 2023 Oct 15;453:120804. doi: 10.1016/j.jns.2023.120804. Epub 2023 Sep 15.

DOI:10.1016/j.jns.2023.120804
PMID:37741773
Abstract

BACKGROUND AND OBJECTIVES

This is an observational study of the performance of an artificial intelligence-powered chatbot tasked with solving unknown neurologic case vignettes. The primary objective of the study is to assess the current capabilities of widely-accessible artificial intelligence within the field of clinical neurology in order to determine how this technology can be deployed in clinical practice, and what insights can be learned from its performance and translated to clinical education.

METHODS

This observational study tested the accuracy of GPT-4, an artificial intelligence-powered chatbot, at appropriately localizing and generating a differential diagnosis for a series of 29 clinical case vignettes. The cases were from previously published educational material prepared for learners. No cases required more than text input, a current limitation of GPT-4. The primary outcome measures were ranked accuracy of localization and differential diagnosis based on clinical history and exam alone and after ancillary clinical data was provided. Secondary outcome measures included a comparison of accuracy by case difficulty.

RESULTS

GPT-4 identified the correct localization less than 50% of the time and performed worse when provided ancillary testing. GPT-4 was more accurate with localization and diagnosis of easier versus harder cases. Diagnostic accuracy was independent of its ability to localize the lesion.

DISCUSSION

GPT-4 did not perform as well on neurology clinical vignettes as compared to reported accuracy when provided other medical clinical vignettes. Incorporation of an AI chatbot into the practice of clinical neurology will require neurology-focused teaching.

摘要

背景和目的

这是一项观察性研究,旨在评估人工智能驱动的聊天机器人在解决未知神经科病例小费时的表现。该研究的主要目的是评估广泛可及的人工智能在临床神经学领域的当前能力,以确定如何在临床实践中部署这项技术,以及可以从其性能中获得哪些见解并转化为临床教育。

方法

这项观察性研究测试了人工智能聊天机器人 GPT-4 对一系列 29 个临床病例小费时进行适当定位和生成鉴别诊断的准确性。这些病例来自为学习者准备的先前发表的教育材料。GPT-4 目前仅能接受文本输入,因此没有案例需要更多输入。主要结局指标是根据临床病史和检查进行定位和鉴别诊断的准确性排名,以及在提供辅助临床数据后的准确性排名。次要结局指标包括按病例难度比较准确性。

结果

GPT-4 确定正确定位的准确率不到 50%,在提供辅助测试时表现更差。GPT-4 对较简单病例的定位和诊断更准确。诊断准确性与其定位病变的能力无关。

讨论

与提供其他医学临床病例时的报告准确性相比,GPT-4 在神经科病例小费时的表现并不理想。将人工智能聊天机器人纳入临床神经科实践将需要针对神经科的教学。

相似文献

1
Does GPT-4 have neurophobia? Localization and diagnostic accuracy of an artificial intelligence-powered chatbot in clinical vignettes.GPT-4 是否患有神经恐惧症?人工智能聊天机器人在临床病例中的定位和诊断准确性。
J Neurol Sci. 2023 Oct 15;453:120804. doi: 10.1016/j.jns.2023.120804. Epub 2023 Sep 15.
2
Embracing the future-is artificial intelligence already better? A comparative study of artificial intelligence performance in diagnostic accuracy and decision-making.拥抱未来——人工智能已经更胜一筹了吗?人工智能在诊断准确性和决策方面的性能比较研究。
Eur J Neurol. 2024 Apr;31(4):e16195. doi: 10.1111/ene.16195. Epub 2024 Jan 18.
3
A retrieval-augmented chatbot based on GPT-4 provides appropriate differential diagnosis in gastrointestinal radiology: a proof of concept study.基于 GPT-4 的检索增强型聊天机器人可在胃肠放射学中提供适当的鉴别诊断:概念验证研究。
Eur Radiol Exp. 2024 May 17;8(1):60. doi: 10.1186/s41747-024-00457-x.
4
Diagnostic Accuracy of Differential-Diagnosis Lists Generated by Generative Pretrained Transformer 3 Chatbot for Clinical Vignettes with Common Chief Complaints: A Pilot Study.基于生成式预训练 Transformer 3 聊天机器人为常见主诉临床病例生成鉴别诊断列表的诊断准确性:一项初步研究。
Int J Environ Res Public Health. 2023 Feb 15;20(4):3378. doi: 10.3390/ijerph20043378.
5
The diagnostic and triage accuracy of the GPT-3 artificial intelligence model: an observational study.GPT-3 人工智能模型的诊断和分诊准确性:一项观察性研究。
Lancet Digit Health. 2024 Aug;6(8):e555-e561. doi: 10.1016/S2589-7500(24)00097-9.
6
The Accuracy and Potential Racial and Ethnic Biases of GPT-4 in the Diagnosis and Triage of Health Conditions: Evaluation Study.GPT-4在健康状况诊断和分诊中的准确性及潜在的种族和民族偏见:评估研究
JMIR Med Educ. 2023 Nov 2;9:e47532. doi: 10.2196/47532.
7
Diagnostic accuracy of large language models in psychiatry.精神科大语言模型的诊断准确性。
Asian J Psychiatr. 2024 Oct;100:104168. doi: 10.1016/j.ajp.2024.104168. Epub 2024 Jul 25.
8
A content-aware chatbot based on GPT 4 provides trustworthy recommendations for Cone-Beam CT guidelines in dental imaging.基于GPT 4的内容感知聊天机器人为牙科成像中的锥形束CT指南提供可靠建议。
Dentomaxillofac Radiol. 2024 Feb 8;53(2):109-114. doi: 10.1093/dmfr/twad015.
9
ChatGPT-Generated Differential Diagnosis Lists for Complex Case-Derived Clinical Vignettes: Diagnostic Accuracy Evaluation.基于复杂病例临床案例生成的ChatGPT鉴别诊断列表:诊断准确性评估。
JMIR Med Inform. 2023 Oct 9;11:e48808. doi: 10.2196/48808.
10
The Diagnostic and Triage Accuracy of the GPT-3 Artificial Intelligence Model.GPT-3人工智能模型的诊断与分诊准确性
medRxiv. 2023 Feb 1:2023.01.30.23285067. doi: 10.1101/2023.01.30.23285067.

引用本文的文献

1
A systematic review and meta-analysis of diagnostic performance comparison between generative AI and physicians.生成式人工智能与医生诊断性能比较的系统评价与荟萃分析
NPJ Digit Med. 2025 Mar 22;8(1):175. doi: 10.1038/s41746-025-01543-z.
2
Accuracy of ChatGPT in Neurolocalization.ChatGPT在神经定位方面的准确性。
Cureus. 2024 Apr 27;16(4):e59143. doi: 10.7759/cureus.59143. eCollection 2024 Apr.
3
GPT-4 Performance for Neurologic Localization.GPT-4在神经定位方面的表现。
Neurol Clin Pract. 2024 Jun;14(3):e200293. doi: 10.1212/CPJ.0000000000200293. Epub 2024 Mar 27.