ChatGPT（GPT-4）于 2022 年通过了日本药师国家考试，回答了包括图表题在内的所有题目：一项描述性研究。

ChatGPT (GPT-4) passed the Japanese National License Examination for Pharmacists in 2022, answering all items including those with diagrams: a descriptive study.

机构信息

Department of Pharmacy, Abashiri Kosei General Hospital, Abashiri, Japan.

Graduate School of Health Sciences, Hokkaido University, Sapporo, Japan.

出版信息

J Educ Eval Health Prof. 2024;21:4. doi: 10.3352/jeehp.2024.21.4. Epub 2024 Feb 28.

DOI:10.3352/jeehp.2024.21.4

PMID:38413129

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10948916/

Abstract

PURPOSE

The objective of this study was to assess the performance of ChatGPT (GPT-4) on all items, including those with diagrams, in the Japanese National License Examination for Pharmacists (JNLEP) and compare it with the previous GPT-3.5 model’s performance.

METHODS

The 107th JNLEP, conducted in 2022, with 344 items input into the GPT-4 model, was targeted for this study. Separately, 284 items, excluding those with diagrams, were entered into the GPT-3.5 model. The answers were categorized and analyzed to determine accuracy rates based on categories, subjects, and presence or absence of diagrams. The accuracy rates were compared to the main passing criteria (overall accuracy rate ≥62.9%).

RESULTS

The overall accuracy rate for all items in the 107th JNLEP in GPT-4 was 72.5%, successfully meeting all the passing criteria. For the set of items without diagrams, the accuracy rate was 80.0%, which was significantly higher than that of the GPT-3.5 model (43.5%). The GPT-4 model demonstrated an accuracy rate of 36.1% for items that included diagrams.

CONCLUSION

Advancements that allow GPT-4 to process images have made it possible for LLMs to answer all items in medical-related license examinations. This study’s findings confirm that ChatGPT (GPT-4) possesses sufficient knowledge to meet the passing criteria.

摘要

目的

本研究旨在评估 ChatGPT（GPT-4）在所有项目中的表现，包括有图表的项目，并将其与之前的 GPT-3.5 模型进行比较。

方法

本研究针对 2022 年进行的第 107 次日本药师国家执照考试（JNLEP），将 344 个项目输入到 GPT-4 模型中。另外，将 284 个不包括图表的项目输入到 GPT-3.5 模型中。根据类别、科目以及是否有图表对答案进行分类和分析，以确定基于类别、科目以及是否有图表的准确率。将准确率与主要通过标准（整体准确率≥62.9%）进行比较。

结果

在 GPT-4 中，第 107 次 JNLEP 所有项目的总体准确率为 72.5%，成功达到了所有通过标准。对于没有图表的项目集，准确率为 80.0%，明显高于 GPT-3.5 模型（43.5%）。GPT-4 模型对于包含图表的项目的准确率为 36.1%。

结论

GPT-4 能够处理图像的进步使得 LLM 可以回答医学相关执照考试的所有项目。本研究的结果证实了 ChatGPT（GPT-4）具备满足通过标准的足够知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57f1/10948916/04387bf89b0e/jeehp-21-04f1.jpg

相似文献

ChatGPT (GPT-4) passed the Japanese National License Examination for Pharmacists in 2022, answering all items including those with diagrams: a descriptive study.ChatGPT（GPT-4）于 2022 年通过了日本药师国家考试，回答了包括图表题在内的所有题目：一项描述性研究。

J Educ Eval Health Prof. 2024;21:4. doi: 10.3352/jeehp.2024.21.4. Epub 2024 Feb 28.

The Potential of GPT-4 as a Support Tool for Pharmacists: Analytical Study Using the Japanese National Examination for Pharmacists.GPT-4作为药剂师辅助工具的潜力：使用日本药剂师国家考试的分析研究

JMIR Med Educ. 2023 Oct 30;9:e48452. doi: 10.2196/48452.

Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study.GPT-4V 在回答日本耳鼻喉科学委员会认证考试问题方面的表现：评估研究。

JMIR Med Educ. 2024 Mar 28;10:e57054. doi: 10.2196/57054.

Accuracy of ChatGPT on Medical Questions in the National Medical Licensing Examination in Japan: Evaluation Study.ChatGPT在日本国家医师资格考试医学问题上的准确性：评估研究

JMIR Form Res. 2023 Oct 13;7:e48023. doi: 10.2196/48023.

Influence of Model Evolution and System Roles on ChatGPT's Performance in Chinese Medical Licensing Exams: Comparative Study.模型演进和系统角色对 ChatGPT 在中文医师资格考试中表现的影响：对比研究。

JMIR Med Educ. 2024 Aug 13;10:e52784. doi: 10.2196/52784.

Artificial Intelligence in Childcare: Assessing the Performance and Acceptance of ChatGPT Responses.人工智能在儿童保育中的应用：评估ChatGPT回复的性能与可接受性

Cureus. 2023 Aug 31;15(8):e44484. doi: 10.7759/cureus.44484. eCollection 2023 Aug.

Evaluating Large Language Models for the National Premedical Exam in India: Comparative Analysis of GPT-3.5, GPT-4, and Bard.评估印度全国医预考用大型语言模型：GPT-3.5、GPT-4 和 Bard 的比较分析。

JMIR Med Educ. 2024 Feb 21;10:e51523. doi: 10.2196/51523.

Performance of ChatGPT on the Peruvian National Licensing Medical Examination: Cross-Sectional Study.ChatGPT在秘鲁国家医学执照考试中的表现：横断面研究

JMIR Med Educ. 2023 Sep 28;9:e48039. doi: 10.2196/48039.

Performance evaluation of ChatGPT, GPT-4, and Bard on the official board examination of the Japan Radiology Society.ChatGPT、GPT-4 和 Bard 在日本放射学会官方董事会考试中的表现评估。

Jpn J Radiol. 2024 Feb;42(2):201-207. doi: 10.1007/s11604-023-01491-2. Epub 2023 Oct 4.

Assessing the Performance of GPT-3.5 and GPT-4 on the 2023 Japanese Nursing Examination.评估GPT-3.5和GPT-4在2023年日本护理考试中的表现。

Cureus. 2023 Aug 3;15(8):e42924. doi: 10.7759/cureus.42924. eCollection 2023 Aug.

引用本文的文献

An exploratory assessment of GPT-4o and GPT-4 performance on the Japanese National Dental Examination.对GPT-4o和GPT-4在日本国家牙科考试中的表现进行的探索性评估。

Saudi Dent J. 2024 Dec;36(12):1577-1581. doi: 10.1016/j.sdentj.2024.11.006. Epub 2024 Nov 26.

Performance evaluation of large language models for the national nursing examination in Japan.日本国家护士考试中大型语言模型的性能评估

Digit Health. 2025 May 27;11:20552076251346571. doi: 10.1177/20552076251346571. eCollection 2025 Jan-Dec.

Medication counseling for OTC drugs using customized ChatGPT-4: Comparison with ChatGPT-3.5 and ChatGPT-4o.使用定制的ChatGPT-4进行非处方药用药咨询：与ChatGPT-3.5和ChatGPT-4o的比较

Digit Health. 2025 Feb 25;11:20552076251323810. doi: 10.1177/20552076251323810. eCollection 2025 Jan-Dec.

ChatGPT (GPT-4V) Performance on the Healthcare Information Technologist Examination in Japan.ChatGPT（GPT - 4V）在日本医疗信息技术专家考试中的表现。

Cureus. 2025 Jan 1;17(1):e76775. doi: 10.7759/cureus.76775. eCollection 2025 Jan.

Evaluating the Accuracy of ChatGPT in the Japanese Board-Certified Physiatrist Examination.评估ChatGPT在日本物理治疗师资格考试中的准确性。

Cureus. 2024 Dec 22;16(12):e76214. doi: 10.7759/cureus.76214. eCollection 2024 Dec.

Qwen-2.5 Outperforms Other Large Language Models in the Chinese National Nursing Licensing Examination: Retrospective Cross-Sectional Comparative Study.Qwen-2.5在中国国家护士执业资格考试中表现优于其他大语言模型：回顾性横断面比较研究。

JMIR Med Inform. 2025 Jan 10;13:e63731. doi: 10.2196/63731.

Performance of Generative Pre-trained Transformer (GPT)-4 and Gemini Advanced on the First-Class Radiation Protection Supervisor Examination in Japan.生成式预训练变换器（GPT）-4和Gemini Advanced在日本一级放射防护主管考试中的表现。

Cureus. 2024 Oct 1;16(10):e70614. doi: 10.7759/cureus.70614. eCollection 2024 Oct.

Potential of ChatGPT to Pass the Japanese Medical and Healthcare Professional National Licenses: A Literature Review.ChatGPT通过日本医疗及医护专业国家执照考试的可能性：文献综述

Cureus. 2024 Aug 6;16(8):e66324. doi: 10.7759/cureus.66324. eCollection 2024 Aug.

本文引用的文献

Performance of Generative Pretrained Transformer on the National Medical Licensing Examination in Japan.生成式预训练变换器在日本国家医师资格考试中的表现。

PLOS Digit Health. 2024 Jan 23;3(1):e0000433. doi: 10.1371/journal.pdig.0000433. eCollection 2024 Jan.

JMIR Med Educ. 2023 Oct 30;9:e48452. doi: 10.2196/48452.

Accuracy of ChatGPT on Medical Questions in the National Medical Licensing Examination in Japan: Evaluation Study.ChatGPT在日本国家医师资格考试医学问题上的准确性：评估研究

JMIR Form Res. 2023 Oct 13;7:e48023. doi: 10.2196/48023.

Can ChatGPT pass China's national medical licensing examination?ChatGPT能通过中国国家医师资格考试吗？

Asian J Surg. 2023 Dec;46(12):6112-6113. doi: 10.1016/j.asjsur.2023.09.089. Epub 2023 Sep 27.

Performance of ChatGPT on the Peruvian National Licensing Medical Examination: Cross-Sectional Study.ChatGPT在秘鲁国家医学执照考试中的表现：横断面研究

JMIR Med Educ. 2023 Sep 28;9:e48039. doi: 10.2196/48039.

Performance of ChatGPT in Israeli Hebrew OBGYN national residency examinations.ChatGPT 在以色列希伯来语妇产科住院医师国家考试中的表现。

Arch Gynecol Obstet. 2023 Dec;308(6):1797-1802. doi: 10.1007/s00404-023-07185-4. Epub 2023 Sep 5.

Assessing the Performance of GPT-3.5 and GPT-4 on the 2023 Japanese Nursing Examination.评估GPT-3.5和GPT-4在2023年日本护理考试中的表现。

Cureus. 2023 Aug 3;15(8):e42924. doi: 10.7759/cureus.42924. eCollection 2023 Aug.

Performance of GPT-3.5 and GPT-4 on the Japanese Medical Licensing Examination: Comparison Study.GPT-3.5和GPT-4在日本医师执照考试中的表现：比较研究。

JMIR Med Educ. 2023 Jun 29;9:e48002. doi: 10.2196/48002.

ChatGPT failed Taiwan's Family Medicine Board Exam.ChatGPT 未能通过台湾家庭医学专科医师甄试。

J Chin Med Assoc. 2023 Aug 1;86(8):762-766. doi: 10.1097/JCMA.0000000000000946. Epub 2023 Jun 9.

Performance of ChatGPT on the pharmacist licensing examination in Taiwan.ChatGPT 在台湾药剂师执照考试中的表现。

J Chin Med Assoc. 2023 Jul 1;86(7):653-658. doi: 10.1097/JCMA.0000000000000942. Epub 2023 Jul 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

ChatGPT（GPT-4）于 2022 年通过了日本药师国家考试，回答了包括图表题在内的所有题目：一项描述性研究。

ChatGPT (GPT-4) passed the Japanese National License Examination for Pharmacists in 2022, answering all items including those with diagrams: a descriptive study.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献