• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Challenging cases of hyponatremia incorrectly interpreted by ChatGPT.ChatGPT对低钠血症疑难病例的错误解读
BMC Med Educ. 2025 May 22;25(1):751. doi: 10.1186/s12909-025-07235-2.
2
A retrospective evaluation of the potential of ChatGPT in the accurate diagnosis of acute stroke.对ChatGPT在急性中风准确诊断中的潜力进行回顾性评估。
Diagn Interv Radiol. 2025 Apr 28;31(3):187-195. doi: 10.4274/dir.2024.242892. Epub 2024 Sep 2.
3
The ChatGPT effect and transforming nursing education with generative AI: Discussion paper.ChatGPT 效应与生成式人工智能在护理教育中的变革:讨论文件。
Nurse Educ Pract. 2024 Feb;75:103888. doi: 10.1016/j.nepr.2024.103888. Epub 2024 Jan 10.
4
Comparison of ChatGPT and Internet Research for Clinical Research and Decision-Making in Occupational Medicine: Randomized Controlled Trial.ChatGPT与互联网搜索用于职业医学临床研究和决策的比较:随机对照试验
JMIR Form Res. 2025 May 20;9:e63857. doi: 10.2196/63857.
5
Assessing Familiarity, Usage Patterns, and Attitudes of Medical Students Toward ChatGPT and Other Chat-Based AI Apps in Medical Education: Cross-Sectional Questionnaire Study.评估医学生在医学教育中对ChatGPT和其他基于聊天的人工智能应用程序的熟悉程度、使用模式和态度:横断面问卷调查研究
JMIR Med Educ. 2025 Jan 30;11:e63065. doi: 10.2196/63065.
6
How do we teach generative artificial intelligence to medical educators? Pilot of a faculty development workshop using ChatGPT.我们如何向医学教育工作者传授生成式人工智能?使用ChatGPT开展的教师发展工作坊试点项目。
Med Teach. 2025 Jan;47(1):160-162. doi: 10.1080/0142159X.2024.2341806. Epub 2024 Apr 22.
7
AI-powered standardised patients: evaluating ChatGPT-4o's impact on clinical case management in intern physicians.人工智能驱动的标准化病人:评估ChatGPT-4o对实习医生临床病例管理的影响。
BMC Med Educ. 2025 Feb 20;25(1):278. doi: 10.1186/s12909-025-06877-6.
8
Navigating the future of pediatric cardiovascular surgery: Insights and innovation powered by Chat Generative Pre-Trained Transformer (ChatGPT).探索小儿心血管外科的未来:由聊天生成预训练变换器(ChatGPT)推动的见解与创新。
J Thorac Cardiovasc Surg. 2025 Feb 1. doi: 10.1016/j.jtcvs.2025.01.022.
9
The impact of Chat Generative Pre-trained Transformer (ChatGPT) on medical education.ChatGPT 对医学教育的影响。
Postgrad Med J. 2023 Sep 21;99(1176):1125-1127. doi: 10.1093/postmj/qgad058.
10
Evaluating the accuracy of Chat Generative Pre-trained Transformer version 4 (ChatGPT-4) responses to United States Food and Drug Administration (FDA) frequently asked questions about dental amalgam.评估 Chat Generative Pre-trained Transformer 版本 4(ChatGPT-4)对美国食品和药物管理局(FDA)关于牙银合金常见问题的回答的准确性。
BMC Oral Health. 2024 May 24;24(1):605. doi: 10.1186/s12903-024-04358-8.

本文引用的文献

1
The role of the clinical laboratory in diagnosing hyponatremia disorders.临床实验室在诊断低钠血症疾病中的作用。
Crit Rev Clin Lab Sci. 2025 Jun;62(4):240-265. doi: 10.1080/10408363.2025.2462814. Epub 2025 Mar 1.
2
Let's chat! Integrating ChatGPT in medical student assignments to enhance critical analysis.让我们来聊一聊!将ChatGPT整合到医学生作业中以增强批判性分析能力。
Med Teach. 2025 May;47(5):791-793. doi: 10.1080/0142159X.2024.2421997. Epub 2024 Oct 31.
3
Large Language Model Influence on Diagnostic Reasoning: A Randomized Clinical Trial.大语言模型对诊断推理的影响:一项随机临床试验。
JAMA Netw Open. 2024 Oct 1;7(10):e2440969. doi: 10.1001/jamanetworkopen.2024.40969.
4
The Impact of AI Usage on University Students' Willingness for Autonomous Learning.人工智能的使用对大学生自主学习意愿的影响。
Behav Sci (Basel). 2024 Oct 16;14(10):956. doi: 10.3390/bs14100956.
5
Enhancing self-directed learning with custom GPT AI facilitation among medical students: A randomized controlled trial.在医学生中通过定制GPT人工智能促进自主学习:一项随机对照试验。
Med Teach. 2025 Jul;47(7):1126-1133. doi: 10.1080/0142159X.2024.2413023. Epub 2024 Oct 19.
6
Evaluation of ChatGPT as a diagnostic tool for medical learners and clinicians.评估 ChatGPT 作为医学学习者和临床医生的诊断工具。
PLoS One. 2024 Jul 31;19(7):e0307383. doi: 10.1371/journal.pone.0307383. eCollection 2024.
7
Assessing ChatGPT 4.0's test performance and clinical diagnostic accuracy on USMLE STEP 2 CK and clinical case reports.评估ChatGPT 4.0在美国医师执照考试第二步临床知识考试(USMLE STEP 2 CK)及临床病例报告中的测试表现和临床诊断准确性。
Sci Rep. 2024 Apr 23;14(1):9330. doi: 10.1038/s41598-024-58760-x.
8
Harnessing the open access version of ChatGPT for enhanced clinical opinions.利用ChatGPT的开放获取版本以获得更完善的临床意见。
PLOS Digit Health. 2024 Feb 5;3(2):e0000355. doi: 10.1371/journal.pdig.0000355. eCollection 2024 Feb.
9
A machine learning approach for predicting treatment response of hyponatremia.机器学习在低钠血症治疗反应预测中的应用。
Endocr J. 2024 Apr 30;71(4):345-355. doi: 10.1507/endocrj.EJ23-0561. Epub 2024 Mar 2.
10
Use of GPT-4 to Analyze Medical Records of Patients With Extensive Investigations and Delayed Diagnosis.使用GPT-4分析进行了广泛检查且诊断延迟的患者的病历。
JAMA Netw Open. 2023 Aug 1;6(8):e2325000. doi: 10.1001/jamanetworkopen.2023.25000.

ChatGPT对低钠血症疑难病例的错误解读

Challenging cases of hyponatremia incorrectly interpreted by ChatGPT.

作者信息

Berend Kenrick, Duits Ashley, Gans Reinold O B

机构信息

Department of Medicine, Curaçao Medical Center, Willemstad, Curaçao.

Institute for Medical Education, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands.

出版信息

BMC Med Educ. 2025 May 22;25(1):751. doi: 10.1186/s12909-025-07235-2.

DOI:10.1186/s12909-025-07235-2
PMID:40405178
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12100905/
Abstract

BACKGROUND

In clinical medicine, the assessment of hyponatremia is frequently required but also known as a source of major diagnostic errors, substantial mismanagement, and iatrogenic morbidity. Because artificial intelligence techniques are efficient in analyzing complex problems, their use may possibly overcome current assessment limitations. There is no literature concerning Chat Generative Pre-trained Transformer (ChatGPT-3.5) use for evaluating difficult hyponatremia cases. Because of the interesting pathophysiology, hyponatremia cases are often used in medical education for students to evaluate patients with students increasingly using artificial intelligence as a diagnostic tool. To evaluate this possibility, four challenging hyponatremia cases published previously, were presented to the free ChatGPT-3.5 for diagnosis and treatment suggestions.

METHODS

We used four challenging hyponatremia cases, that were evaluated by 46 physicians in Canada, the Netherlands, South-Africa, Taiwan, and USA, and published previously. These four cases were presented two times in the free ChatGPT, version 3.5 in December 2023 as well as in September 2024 with the request to recommend diagnosis and therapy. Responses by ChatGPT were compared with those of the clinicians.

RESULTS

Case 1 and 3 have a single cause of hyponatremia. Case 2 and 4 have two contributing hyponatremia features. Neither ChatGPT, in 2023, nor the previously published assessment by 46 clinicians, whose assessment was described in the original publication, recognized the most crucial cause of hyponatremia with major therapeutic consequences in all four cases. In 2024 ChatGPT properly diagnosed and suggested adequate management in one case. Concurrent Addison's disease was correctly recognized in case 1 by ChatGPT in 2023 and 2024, whereas 81% of the clinicians missed this diagnosis. No proper therapeutic recommendations were given by ChatGPT in 2023 in any of the four cases, but in one case adequate advice was given by ChatGPT in 2024. The 46 clinicians recommended inadequate therapy in 65%, 57%, 2%, and 76%, respectively in case 1 to 4.

CONCLUSION

Our study currently does not support the use of the free version ChatGPT 3.5 in difficult hyponatremia cases, but a small improvement was observed after ten months with the same ChatGPT 3.5 version. Patients, health professionals, medical educators and students should be aware of the shortcomings of diagnosis and therapy suggestions by ChatGPT.

摘要

背景

在临床医学中,低钠血症的评估经常需要进行,但它也是主要诊断错误、严重管理不善和医源性发病的根源。由于人工智能技术在分析复杂问题方面效率很高,其应用可能会克服当前评估的局限性。目前尚无关于使用聊天生成预训练变换器(ChatGPT - 3.5)评估疑难低钠血症病例的文献。鉴于有趣的病理生理学特点,低钠血症病例常用于医学教育中,供学生评估患者,且学生越来越多地将人工智能用作诊断工具。为评估这种可能性,我们将之前发表的4例具有挑战性的低钠血症病例提交给免费的ChatGPT - 3.5,以获取诊断和治疗建议。

方法

我们使用了4例具有挑战性的低钠血症病例,这些病例曾由加拿大、荷兰、南非、中国台湾和美国的46名医生进行评估,并于之前发表。这4例病例于2023年12月以及2024年9月分两次提交给免费的ChatGPT 3.5版本,要求其给出诊断和治疗建议。将ChatGPT的回复与临床医生的回复进行比较。

结果

病例1和病例3的低钠血症有单一病因。病例2和病例4有两个导致低钠血症的因素。2023年的ChatGPT以及最初发表的对46名临床医生评估(原始出版物中有描述)均未识别出所有4例病例中具有重大治疗后果的最关键低钠血症病因。2024年,ChatGPT正确诊断并给出适当管理建议的有1例。ChatGPT在2023年和2024年都正确识别出病例1并发艾迪生病,而81%的临床医生漏诊了该诊断。2023年ChatGPT在4例病例中均未给出恰当的治疗建议,但2024年在1例病例中给出了适当建议。在病例1至病例4中,46名临床医生分别有65%、57%、2%和76%推荐了不恰当的治疗方法。

结论

我们的研究目前不支持在疑难低钠血症病例中使用免费版ChatGPT 3.5,但在使用同一ChatGPT 3.5版本十个月后观察到有小幅改进。患者、卫生专业人员、医学教育工作者和学生应意识到ChatGPT给出的诊断和治疗建议存在的不足。