Iftikhar Haris, Anjum Shahzad, Bhutta Zain A, Najam Mavia, Bashir Khalid
Emergency Medicine, Hamad General Hospital, Doha, Qatar *Email:
Department of Medical Education, Hamad Medical Corporation, Doha, Qatar.
Qatar Med J. 2024 Nov 11;2024(4):61. doi: 10.5339/qmj.2024.61. eCollection 2024.
The inclusion of artificial intelligence (AI) in the healthcare sector has transformed medical practices by introducing innovative techniques for medical education, diagnosis, and treatment strategies. In medical education, the potential of AI to enhance learning and assessment methods is being increasingly recognized. This study aims to evaluate the performance of OpenAI's Chat Generative Pre-Trained Transformer (ChatGPT) in emergency medicine (EM) residency examinations in Qatar and compare it with the performance of resident physicians.
A retrospective descriptive study with a mixed-methods design was conducted in August 2023. EM residents' examination scores were collected and compared with the performance of ChatGPT on the same examinations. The examinations consisted of multiple-choice questions (MCQs) from the same faculty responsible for Qatari Board EM examinations. ChatGPT's performance on these examinations was analyzed and compared with residents across various postgraduate years (PGY).
The study included 238 emergency department residents from PGY1 to PGY4 and compared their performances with ChatGPT. ChatGPT scored consistently higher than resident groups in all examination categories. However, a notable decline in passing rates was observed among senior residents, indicating a potential misalignment between examination performance and practical competencies. Another likely reason can be the impact of the COVID-19 pandemic on their learning experience, knowledge acquisition, and consolidation.
ChatGPT demonstrated significant proficiency in the theoretical knowledge of EM, outperforming resident physicians in examination settings. This finding suggests the potential of AI as a supplementary tool in medical education.
医疗保健领域引入人工智能(AI),通过引入医学教育、诊断和治疗策略的创新技术,改变了医疗实践。在医学教育中,人工智能提升学习和评估方法的潜力正日益得到认可。本研究旨在评估OpenAI的聊天生成预训练变换器(ChatGPT)在卡塔尔急诊医学(EM)住院医师考试中的表现,并将其与住院医师的表现进行比较。
2023年8月进行了一项采用混合方法设计的回顾性描述性研究。收集了急诊医学住院医师的考试成绩,并与ChatGPT在相同考试中的表现进行比较。考试由负责卡塔尔急诊医学委员会考试的同一教师团队提供的多项选择题(MCQ)组成。分析了ChatGPT在这些考试中的表现,并与不同研究生年级(PGY)的住院医师进行了比较。
该研究纳入了238名从PGY1到PGY4的急诊科住院医师,并将他们的表现与ChatGPT进行了比较。在所有考试类别中,ChatGPT的得分始终高于住院医师组。然而,观察到高年级住院医师的及格率显著下降,这表明考试成绩与实际能力之间可能存在脱节。另一个可能的原因是新冠疫情对他们的学习经历、知识获取和巩固产生了影响。
ChatGPT在急诊医学的理论知识方面表现出显著的熟练程度,在考试环境中优于住院医师。这一发现表明人工智能作为医学教育辅助工具的潜力。