University Claude Bernard Lyon I, Lyon, France.
Lady Davis Institute for Cancer Research, Jewish General Hospital, McGill University, Montreal, Quebec, Canada.
Int J Gynaecol Obstet. 2024 Mar;164(3):959-963. doi: 10.1002/ijgo.15083. Epub 2023 Sep 1.
To evaluate the performance of ChatGPT in a French medical school entrance examination.
A cross-sectional study using a consecutive sample of text-based multiple-choice practice questions for the Parcours d'Accès Spécifique Santé. ChatGPT answered questions in French. We compared performance of ChatGPT in obstetrics and gynecology (OBGYN) and in the whole test.
Overall, 885 questions were evaluated. The mean test score was 34.0% (306; maximal score of 900). The performance of ChatGPT was 33.0% (292 correct answers, 885 questions). The performance of ChatGPT was lower in biostatistics (13.3% ± 19.7%) than in anatomy (34.2% ± 17.9%; P = 0.037) and also lower than in histology and embryology (40.0% ± 18.5%; P = 0.004). The OBGYN part had 290 questions. There was no difference in the test scores and the performance of ChatGPT in OBGYN versus the whole entrance test (P = 0.76 vs P = 0.10, respectively).
ChatGPT answered one-third of questions correctly in the French test preparation. The performance in OBGYN was similar.
评估 ChatGPT 在法国医学院入学考试中的表现。
采用横断面研究,使用 Parcours d'Accès Spécifique Santé 的基于文本的多项选择题练习样本进行连续抽样。ChatGPT 用法语回答问题。我们比较了 ChatGPT 在妇产科 (OBGYN) 和整个测试中的表现。
总体而言,评估了 885 个问题。平均考试成绩为 34.0%(306;满分 900)。ChatGPT 的表现为 33.0%(292 个正确答案,885 个问题)。ChatGPT 在生物统计学中的表现(13.3%±19.7%)低于解剖学(34.2%±17.9%;P=0.037),也低于组织学和胚胎学(40.0%±18.5%;P=0.004)。妇产科部分有 290 个问题。在妇产科和整个入学考试的考试成绩和 ChatGPT 的表现方面没有差异(P=0.76 与 P=0.10,分别)。
ChatGPT 在法国考试准备中正确回答了三分之一的问题。妇产科的表现相似。