Suppr超能文献

评估大型语言模型的性能:ChatGPT 和 Google Bard 在神经退行性疾病临床病理会议中生成鉴别诊断的能力。

Evaluating the performance of large language models: ChatGPT and Google Bard in generating differential diagnoses in clinicopathological conferences of neurodegenerative disorders.

机构信息

Department of Neuroscience, Mayo Clinic, Jacksonville, Florida, USA.

出版信息

Brain Pathol. 2024 May;34(3):e13207. doi: 10.1111/bpa.13207. Epub 2023 Aug 8.

Abstract

This study explores the utility of the large language models (LLMs), specifically ChatGPT and Google Bard, in predicting neuropathologic diagnoses from clinical summaries. A total of 25 cases of neurodegenerative disorders presented at Mayo Clinic brain bank Clinico-Pathological Conferences were analyzed. The LLMs provided multiple pathologic diagnoses and their rationales, which were compared with the final clinical diagnoses made by physicians. ChatGPT-3.5, ChatGPT-4, and Google Bard correctly made primary diagnoses in 32%, 52%, and 40% of cases, respectively, while correct diagnoses were included in 76%, 84%, and 76% of cases, respectively. These findings highlight the potential of artificial intelligence tools like ChatGPT in neuropathology, suggesting they may facilitate more comprehensive discussions in clinicopathological conferences.

摘要

这项研究探讨了大型语言模型(LLMs),特别是 ChatGPT 和 Google Bard,在从临床总结中预测神经病理诊断方面的效用。总共分析了 25 例在梅奥诊所脑库临床病理会议上呈现的神经退行性疾病病例。LLMs 提供了多种病理诊断及其理由,并与医生做出的最终临床诊断进行了比较。ChatGPT-3.5、ChatGPT-4 和 Google Bard 分别正确做出了 32%、52%和 40%的主要诊断,而正确诊断分别包含在 76%、84%和 76%的病例中。这些发现强调了像 ChatGPT 这样的人工智能工具在神经病理学中的潜力,表明它们可能有助于在临床病理会议上进行更全面的讨论。

相似文献

引用本文的文献

本文引用的文献

3
Passing is Great: Can ChatGPT Conduct USMLE Exams?及格很棒:ChatGPT能进行美国医师执照考试吗?
Ann Biomed Eng. 2023 Sep;51(9):1885-1886. doi: 10.1007/s10439-023-03224-y. Epub 2023 May 8.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验