Suppr超能文献

GPT-3.5和GPT-4在日本医师执照考试中的表现:比较研究。

Performance of GPT-3.5 and GPT-4 on the Japanese Medical Licensing Examination: Comparison Study.

作者信息

Takagi Soshi, Watari Takashi, Erabi Ayano, Sakaguchi Kota

机构信息

Faculty of Medicine, Shimane University, Izumo, Japan.

General Medicine Center, Shimane University Hospital, Izumo, Japan.

出版信息

JMIR Med Educ. 2023 Jun 29;9:e48002. doi: 10.2196/48002.

Abstract

BACKGROUND

The competence of ChatGPT (Chat Generative Pre-Trained Transformer) in non-English languages is not well studied.

OBJECTIVE

This study compared the performances of GPT-3.5 (Generative Pre-trained Transformer) and GPT-4 on the Japanese Medical Licensing Examination (JMLE) to evaluate the reliability of these models for clinical reasoning and medical knowledge in non-English languages.

METHODS

This study used the default mode of ChatGPT, which is based on GPT-3.5; the GPT-4 model of ChatGPT Plus; and the 117th JMLE in 2023. A total of 254 questions were included in the final analysis, which were categorized into 3 types, namely general, clinical, and clinical sentence questions.

RESULTS

The results indicated that GPT-4 outperformed GPT-3.5 in terms of accuracy, particularly for general, clinical, and clinical sentence questions. GPT-4 also performed better on difficult questions and specific disease questions. Furthermore, GPT-4 achieved the passing criteria for the JMLE, indicating its reliability for clinical reasoning and medical knowledge in non-English languages.

CONCLUSIONS

GPT-4 could become a valuable tool for medical education and clinical support in non-English-speaking regions, such as Japan.

摘要

背景

ChatGPT(聊天生成预训练变换器)在非英语语言方面的能力尚未得到充分研究。

目的

本研究比较了GPT-3.5(生成式预训练变换器)和GPT-4在日本医师执照考试(JMLE)中的表现,以评估这些模型在非英语语言临床推理和医学知识方面的可靠性。

方法

本研究使用了基于GPT-3.5的ChatGPT默认模式、ChatGPT Plus的GPT-4模型以及2023年第117次JMLE。最终分析共纳入254道题,分为一般、临床和临床句子题3种类型。

结果

结果表明,GPT-4在准确性方面优于GPT-3.5,尤其是在一般、临床和临床句子题上。GPT-4在难题和特定疾病问题上也表现更好。此外,GPT-4达到了JMLE的及格标准,表明其在非英语语言临床推理和医学知识方面的可靠性。

结论

GPT-4可能成为日本等非英语地区医学教育和临床支持的宝贵工具。

相似文献

引用本文的文献

本文引用的文献

1
ChatGPT in healthcare: A taxonomy and systematic review.ChatGPT 在医疗保健中的应用:分类法与系统综述。
Comput Methods Programs Biomed. 2024 Mar;245:108013. doi: 10.1016/j.cmpb.2024.108013. Epub 2024 Jan 15.
6
Role of Chat GPT in Public Health.Chat GPT 在公共卫生中的作用。
Ann Biomed Eng. 2023 May;51(5):868-869. doi: 10.1007/s10439-023-03172-7. Epub 2023 Mar 15.
7
ChatGPT: not all languages are equal.ChatGPT:并非所有语言都是平等的。
Nature. 2023 Mar;615(7951):216. doi: 10.1038/d41586-023-00680-3.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验