Takagi Soshi, Watari Takashi, Erabi Ayano, Sakaguchi Kota
Faculty of Medicine, Shimane University, Izumo, Japan.
General Medicine Center, Shimane University Hospital, Izumo, Japan.
JMIR Med Educ. 2023 Jun 29;9:e48002. doi: 10.2196/48002.
BACKGROUND: The competence of ChatGPT (Chat Generative Pre-Trained Transformer) in non-English languages is not well studied. OBJECTIVE: This study compared the performances of GPT-3.5 (Generative Pre-trained Transformer) and GPT-4 on the Japanese Medical Licensing Examination (JMLE) to evaluate the reliability of these models for clinical reasoning and medical knowledge in non-English languages. METHODS: This study used the default mode of ChatGPT, which is based on GPT-3.5; the GPT-4 model of ChatGPT Plus; and the 117th JMLE in 2023. A total of 254 questions were included in the final analysis, which were categorized into 3 types, namely general, clinical, and clinical sentence questions. RESULTS: The results indicated that GPT-4 outperformed GPT-3.5 in terms of accuracy, particularly for general, clinical, and clinical sentence questions. GPT-4 also performed better on difficult questions and specific disease questions. Furthermore, GPT-4 achieved the passing criteria for the JMLE, indicating its reliability for clinical reasoning and medical knowledge in non-English languages. CONCLUSIONS: GPT-4 could become a valuable tool for medical education and clinical support in non-English-speaking regions, such as Japan.
PLOS Digit Health. 2024-1-23
Int Urogynecol J. 2025-6-18
Comput Methods Programs Biomed. 2024-3
J Med Syst. 2023-8-15
N Engl J Med. 2023-3-30
Ann Biomed Eng. 2023-5
Nature. 2023-3
PLOS Digit Health. 2023-2-9