Department of Medical Education, Ruijin Hospital Affifiliated to Shanghai Jiao Tong University School of Medicine, 197 Ruijin Rd. II, Shanghai, 200025, China.
WORK Medical Technology Group LTD, Hangzhou, China.
J Med Syst. 2023 Aug 15;47(1):86. doi: 10.1007/s10916-023-01961-0.
ChatGPT, a language model developed by OpenAI, uses a 175 billion parameter Transformer architecture for natural language processing tasks. This study aimed to compare the knowledge and interpretation ability of ChatGPT with those of medical students in China by administering the Chinese National Medical Licensing Examination (NMLE) to both ChatGPT and medical students. We evaluated the performance of ChatGPT in three years' worth of the NMLE, which consists of four units. At the same time, the exam results were compared to those of medical students who had studied for five years at medical colleges. ChatGPT's performance was lower than that of the medical students, and ChatGPT's correct answer rate was related to the year in which the exam questions were released. ChatGPT's knowledge and interpretation ability for the NMLE were not yet comparable to those of medical students in China. It is probable that these abilities will improve through deep learning.
ChatGPT 是由 OpenAI 开发的一种语言模型,它使用了 1750 亿个参数的 Transformer 架构来处理自然语言处理任务。本研究旨在通过对 ChatGPT 和中国医学生进行中国国家医师资格考试(NMLE)来比较 ChatGPT 的知识和解释能力与中国医学生的能力。我们评估了 ChatGPT 在三年 NMLE 中的表现,NMLE 由四个单元组成。同时,将考试结果与在医学院学习五年的医学生的成绩进行了比较。ChatGPT 的表现低于医学生,并且 ChatGPT 的正确答案率与考试问题发布的年份有关。ChatGPT 对 NMLE 的知识和解释能力还无法与中国医学生相媲美。通过深度学习,这些能力可能会得到提高。