Keshtkar Alireza, Atighi Farnaz, Reihani Hamid
Department of Medicine, Clinical Education Research Center, Shiraz University of Medical Sciences, Shiraz, Iran.
Department of Medicine, Student Research Committee, School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran.
J Educ Health Promot. 2024 Nov 29;13:421. doi: 10.4103/jehp.jehp_1210_24. eCollection 2024.
ChatGPT has demonstrated significant potential in various aspects of medicine, including its performance on licensing examinations. In this study, we systematically investigated ChatGPT's performance in Iranian medical exams and assessed the quality of the included studies using a previously published assessment checklist. The study found that ChatGPT achieved an accuracy range of 32-72% on basic science exams, 34-68.5% on pre-internship exams, and 32-84% on residency exams. Notably, its performance was generally higher when the input was provided in English compared to Persian. One study reported a 40% accuracy rate on an endodontic board exam. To establish ChatGPT as a supplementary tool in medical education and clinical practice, we suggest that dedicated guidelines and checklists are needed to ensure high-quality and consistent research in this emerging field.
ChatGPT在医学的各个方面都展现出了巨大潜力,包括其在执照考试中的表现。在本研究中,我们系统地调查了ChatGPT在伊朗医学考试中的表现,并使用先前发布的评估清单评估了纳入研究的质量。研究发现,ChatGPT在基础科学考试中的准确率范围为32%-72%,在实习前考试中的准确率范围为34%-68.5%,在住院医师考试中的准确率范围为32%-84%。值得注意的是,与波斯语输入相比,当以英语提供输入时,它的表现通常更高。一项研究报告称,在牙髓病学委员会考试中,ChatGPT的准确率为40%。为了将ChatGPT确立为医学教育和临床实践中的辅助工具,我们建议需要专门的指南和清单,以确保在这个新兴领域进行高质量和一致的研究。