Suppr超能文献

GPT-3.5和GPT-4在日本医师执照考试中的表现:比较研究。

Performance of GPT-3.5 and GPT-4 on the Japanese Medical Licensing Examination: Comparison Study.

作者信息

Takagi Soshi, Watari Takashi, Erabi Ayano, Sakaguchi Kota

机构信息

Faculty of Medicine, Shimane University, Izumo, Japan.

General Medicine Center, Shimane University Hospital, Izumo, Japan.

出版信息

JMIR Med Educ. 2023 Jun 29;9:e48002. doi: 10.2196/48002.

Abstract

BACKGROUND

The competence of ChatGPT (Chat Generative Pre-Trained Transformer) in non-English languages is not well studied.

OBJECTIVE

This study compared the performances of GPT-3.5 (Generative Pre-trained Transformer) and GPT-4 on the Japanese Medical Licensing Examination (JMLE) to evaluate the reliability of these models for clinical reasoning and medical knowledge in non-English languages.

METHODS

This study used the default mode of ChatGPT, which is based on GPT-3.5; the GPT-4 model of ChatGPT Plus; and the 117th JMLE in 2023. A total of 254 questions were included in the final analysis, which were categorized into 3 types, namely general, clinical, and clinical sentence questions.

RESULTS

The results indicated that GPT-4 outperformed GPT-3.5 in terms of accuracy, particularly for general, clinical, and clinical sentence questions. GPT-4 also performed better on difficult questions and specific disease questions. Furthermore, GPT-4 achieved the passing criteria for the JMLE, indicating its reliability for clinical reasoning and medical knowledge in non-English languages.

CONCLUSIONS

GPT-4 could become a valuable tool for medical education and clinical support in non-English-speaking regions, such as Japan.

摘要

背景

ChatGPT(聊天生成预训练变换器)在非英语语言方面的能力尚未得到充分研究。

目的

本研究比较了GPT-3.5(生成式预训练变换器)和GPT-4在日本医师执照考试(JMLE)中的表现,以评估这些模型在非英语语言临床推理和医学知识方面的可靠性。

方法

本研究使用了基于GPT-3.5的ChatGPT默认模式、ChatGPT Plus的GPT-4模型以及2023年第117次JMLE。最终分析共纳入254道题,分为一般、临床和临床句子题3种类型。

结果

结果表明,GPT-4在准确性方面优于GPT-3.5,尤其是在一般、临床和临床句子题上。GPT-4在难题和特定疾病问题上也表现更好。此外,GPT-4达到了JMLE的及格标准,表明其在非英语语言临床推理和医学知识方面的可靠性。

结论

GPT-4可能成为日本等非英语地区医学教育和临床支持的宝贵工具。

相似文献

引用本文的文献

本文引用的文献

1
ChatGPT in healthcare: A taxonomy and systematic review.ChatGPT 在医疗保健中的应用:分类法与系统综述。
Comput Methods Programs Biomed. 2024 Mar;245:108013. doi: 10.1016/j.cmpb.2024.108013. Epub 2024 Jan 15.
6
Role of Chat GPT in Public Health.Chat GPT 在公共卫生中的作用。
Ann Biomed Eng. 2023 May;51(5):868-869. doi: 10.1007/s10439-023-03172-7. Epub 2023 Mar 15.
7
ChatGPT: not all languages are equal.ChatGPT:并非所有语言都是平等的。
Nature. 2023 Mar;615(7951):216. doi: 10.1038/d41586-023-00680-3.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验