• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多次重考高风险研究生医学考试:MRCP(UK)考试表现的非线性多层级建模。

Resitting a high-stakes postgraduate medical examination on multiple occasions: nonlinear multilevel modelling of performance in the MRCP(UK) examinations.

机构信息

Academic Centre for Medical Education, Division of Medical Education, University College London, Gower Street, London, WC1E 6BT, UK.

出版信息

BMC Med. 2012 Jun 14;10:60. doi: 10.1186/1741-7015-10-60.

DOI:10.1186/1741-7015-10-60
PMID:22697599
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3394208/
Abstract

BACKGROUND

Failure rates in postgraduate examinations are often high and many candidates therefore retake examinations on several or even many times. Little, however, is known about how candidates perform across those multiple attempts. A key theoretical question to be resolved is whether candidates pass at a resit because they have got better, having acquired more knowledge or skills, or whether they have got lucky, chance helping them to get over the pass mark. In the UK, the issue of resits has become of particular interest since the General Medical Council issued a consultation and is considering limiting the number of attempts candidates may make at examinations.

METHODS

Since 1999 the examination for Membership of the Royal Colleges of Physicians of the United Kingdom (MRCP(UK)) has imposed no limit on the number of attempts candidates can make at its Part 1, Part 2 or PACES (Clinical) examination. The present study examined the performance of candidates on the examinations from 2002/2003 to 2010, during which time the examination structure has been stable. Data were available for 70,856 attempts at Part 1 by 39,335 candidates, 37,654 attempts at Part 2 by 23,637 candidates and 40,303 attempts at PACES by 21,270 candidates, with the maximum number of attempts being 26, 21 and 14, respectively. The results were analyzed using multilevel modelling, fitting negative exponential growth curves to individual candidate performance.

RESULTS

The number of candidates taking the assessment falls exponentially at each attempt. Performance improves across attempts, with evidence in the Part 1 examination that candidates are still improving up to the tenth attempt, with a similar improvement up to the fourth attempt in Part 2 and the sixth attempt at PACES. Random effects modelling shows that candidates begin at a starting level, with performance increasing by a smaller amount at each attempt, with evidence of a maximum, asymptotic level for candidates, and candidates showing variation in starting level, rate of improvement and maximum level. Modelling longitudinal performance across the three diets (sittings) shows that the starting level at Part 1 predicts starting level at both Part 2 and PACES, and the rate of improvement at Part 1 also predicts the starting level at Part 2 and PACES.

CONCLUSION

Candidates continue to show evidence of true improvement in performance up to at least the tenth attempt at MRCP(UK) Part 1, although there are individual differences in the starting level, the rate of improvement and the maximum level that can be achieved. Such findings provide little support for arguments that candidates should only be allowed a fixed number of attempts at an examination. However, unlimited numbers of attempts are also difficult to justify because of the inevitable and ever increasing role that luck must play with increasing numbers of resits, so that the issue of multiple attempts might be better addressed by tackling the difficult question of how a pass mark should increase with each attempt at an exam.

摘要

背景

研究生考试的失败率往往很高,因此许多考生会多次参加考试。然而,人们对考生在多次尝试中的表现知之甚少。需要解决的一个关键理论问题是,考生是否因为获得了更多的知识或技能而通过了补考,或者他们是否只是运气好,碰巧帮助他们通过了及格线。在英国,自从医学总会发布咨询意见并考虑限制考生参加考试的次数以来,补考问题引起了特别关注。

方法

自 1999 年以来,英国皇家内科医师学会会员资格考试(MRCP(UK))对考生在其第一部分、第二部分或 PACES(临床)考试中的尝试次数没有限制。本研究检查了 2002/2003 年至 2010 年期间考生在考试中的表现,在此期间考试结构保持稳定。第一部分有 70856 次尝试,涉及 39335 名考生,第二部分有 37654 次尝试,涉及 23637 名考生,PACES 有 40303 次尝试,涉及 21270 名考生,最大尝试次数分别为 26、21 和 14。使用多层次建模分析数据,为每个考生的表现拟合负指数增长曲线。

结果

每次尝试,参加评估的考生人数呈指数下降。考生的表现随着尝试次数的增加而提高,第一部分的考试结果表明,考生在第十次尝试时仍在提高,第二部分和 PACES 的第四次尝试时也有类似的提高。在第二部分和 PACES 的第六次尝试时,考生的表现达到了最佳水平。随机效应模型显示,考生从一个起始水平开始,每次尝试的表现都会有所提高,考生的表现存在一个最大值,达到了最佳水平。考生的起始水平、提高速度和最大水平存在差异。对三个部分(场次)的纵向表现进行建模表明,第一部分的起始水平预测了第二部分和 PACES 的起始水平,第一部分的提高速度也预测了第二部分和 PACES 的起始水平。

结论

考生在至少第十次参加 MRCP(UK)第一部分考试时,仍表现出明显的真实提高,尽管在起始水平、提高速度和可达到的最大水平方面存在个体差异。这些发现几乎没有支持考生只能参加固定次数考试的论点。然而,由于随着补考次数的增加,运气的作用不可避免地越来越大,无限次的补考也很难证明其合理性,因此解决多次补考的问题可能更好的方法是解决如何根据考生在考试中的每次尝试来提高及格线这一难题。

相似文献

1
Resitting a high-stakes postgraduate medical examination on multiple occasions: nonlinear multilevel modelling of performance in the MRCP(UK) examinations.多次重考高风险研究生医学考试:MRCP(UK)考试表现的非线性多层级建模。
BMC Med. 2012 Jun 14;10:60. doi: 10.1186/1741-7015-10-60.
2
PLAB and UK graduates' performance on MRCP(UK) and MRCGP examinations: data linkage study.PLAB 和英国毕业生在 MRCP(UK) 和 MRCGP 考试中的表现:数据链接研究。
BMJ. 2014 Apr 17;348:g2621. doi: 10.1136/bmj.g2621.
3
Fitness to practise sanctions in UK doctors are predicted by poor performance at MRCGP and MRCP(UK) assessments: data linkage study.英国医生的行医能力制裁预测依据为 MRCGP 和 MRCP(UK)评估中的表现不佳:数据关联研究。
BMC Med. 2018 Dec 7;16(1):230. doi: 10.1186/s12916-018-1214-4.
4
Changes in standard of candidates taking the MRCP(UK) Part 1 examination, 1985 to 2002: analysis of marker questions.1985年至2002年参加英国皇家内科医师学会(MRCP)第一部分考试考生水平的变化:标记问题分析
BMC Med. 2005 Jul 18;3:13. doi: 10.1186/1741-7015-3-13.
5
Passing MRCP (UK) PACES: a cross-sectional study examining the performance of doctors by sex and country.通过 MRCP(英国)PACES:一项按性别和国家划分的医生表现的横断面研究。
BMC Med Educ. 2018 Apr 6;18(1):70. doi: 10.1186/s12909-018-1178-2.
6
Graduates of different UK medical schools show substantial differences in performance on MRCP(UK) Part 1, Part 2 and PACES examinations.英国不同医学院的毕业生在英国皇家内科医师学会(MRCP)第一部分、第二部分及实践技能评估(PACES)考试中的表现存在显著差异。
BMC Med. 2008 Feb 14;6:5. doi: 10.1186/1741-7015-6-5.
7
Assessment of examiner leniency and stringency ('hawk-dove effect') in the MRCP(UK) clinical examination (PACES) using multi-facet Rasch modelling.使用多维度Rasch模型评估英国皇家内科医师学会临床考试(PACES)中主考官的宽松和严格程度(“鹰鸽效应”)。
BMC Med Educ. 2006 Aug 18;6:42. doi: 10.1186/1472-6920-6-42.
8
Performance in the MRCP(UK) Examination 2003-4: analysis of pass rates of UK graduates in relation to self-declared ethnicity and gender.2003 - 2004年英国皇家内科医师学会会员资格考试成绩:英国毕业生及格率与自我申报的种族和性别的关系分析。
BMC Med. 2007 May 3;5:8. doi: 10.1186/1741-7015-5-8.
9
UK postgraduate medicine examinations: opportunities for international candidates.英国研究生医学考试:国际考生的机会。
Clin Med (Lond). 2014 Oct;14(5):500-5. doi: 10.7861/clinmedicine.14-5-500.
10
Investigating possible ethnicity and sex bias in clinical examiners: an analysis of data from the MRCP(UK) PACES and nPACES examinations.探讨临床考官中可能存在的种族和性别偏见:对 MRCP(UK)PACES 和 nPACES 考试数据的分析。
BMC Med Educ. 2013 Jul 30;13:103. doi: 10.1186/1472-6920-13-103.

引用本文的文献

1
Predictors of long-term knowledge retention in the driver theory test.驾驶理论考试中长期知识保持的预测因素。
Sci Rep. 2025 Jul 1;15(1):20948. doi: 10.1038/s41598-025-04796-6.
2
Differential attainment in UK postgraduate medical examinations: examining the relationship between sociodemographic differences and examination performance.英国研究生医学考试中的成绩差异:探究社会人口统计学差异与考试成绩之间的关系。
BMC Med. 2025 Apr 14;23(1):216. doi: 10.1186/s12916-025-04034-w.
3
Differential attainment at national selection for higher surgical training: a retrospective cohort study.

本文引用的文献

1
The Effect of Different Forms of Centering in Hierarchical Linear Models.分层线性模型中不同形式中心化的效应
Multivariate Behav Res. 1995 Jan 1;30(1):1-21. doi: 10.1207/s15327906mbr3001_1.
2
Non-linear Growth Models in M and SAS.M和SAS中的非线性增长模型
Struct Equ Modeling. 2009 Oct;16(4):676-701. doi: 10.1080/10705510903206055.
3
Measurement precision for repeat examinees on a standardized patient examination.标准化患者检查中重复受检者的测量精度。
高等外科培训全国选拔中的差异成就:一项回顾性队列研究。
BMJ Open. 2025 Jan 28;15(1):e091796. doi: 10.1136/bmjopen-2024-091796.
4
Big data analysis: examination of the relationship between candidates' sociodemographic characteristics and performance in the UK's Membership of the Royal College of Physicians Part 1 examination.大数据分析:英国皇家内科医师学院第一部分考试中考生社会人口学特征与考试成绩之间的关系研究
Adv Health Sci Educ Theory Pract. 2025 Feb;30(1):53-68. doi: 10.1007/s10459-024-10406-3. Epub 2024 Dec 20.
5
Language of primary medical qualification and differential MRCGP exam attainment: an observational study.初级医学资格语言与MRCGP考试成绩差异:一项观察性研究。
Br J Gen Pract. 2025 Mar 27;75(753):e285-e291. doi: 10.3399/BJGP.2024.0296. Print 2025 Apr.
6
Back to basics: reflective take on role of MCQs in undergraduate Malaysian dental professional qualifying exams.回归基础:对马来西亚本科牙科专业资格考试中多项选择题作用的反思
Front Med (Lausanne). 2023 Nov 30;10:1287924. doi: 10.3389/fmed.2023.1287924. eCollection 2023.
7
When another assessment attempt is bad for progress.当再次进行评估尝试不利于进展时。
MedEdPublish (2016). 2018 Jul 19;7:147. doi: 10.15694/mep.2018.0000147.1. eCollection 2018.
8
Can achievement at medical admission tests predict future performance in postgraduate clinical assessments? A UK-based national cohort study.医学入学考试成绩能否预测研究生临床评估的未来表现?一项基于英国的全国队列研究。
BMJ Open. 2022 Feb 8;12(2):e056129. doi: 10.1136/bmjopen-2021-056129.
9
Does performance at the intercollegiate Membership of the Royal Colleges of Surgeons (MRCS) examination vary according to UK medical school and course type? A retrospective cohort study.英国皇家外科学院联合会员(MRCS)考试成绩是否因英国医学院校和课程类型而异?一项回顾性队列研究。
BMJ Open. 2022 Jan 5;12(1):e054616. doi: 10.1136/bmjopen-2021-054616.
10
Does performance at medical school predict success at the Intercollegiate Membership of the Royal College of Surgeons (MRCS) examination? A retrospective cohort study.医学院表现能否预测皇家外科学院会员联合考试(MRCS)的成功?一项回顾性队列研究。
BMJ Open. 2021 Aug 16;11(8):e046615. doi: 10.1136/bmjopen-2020-046615.
Adv Health Sci Educ Theory Pract. 2012 Aug;17(3):325-37. doi: 10.1007/s10459-011-9309-0. Epub 2011 Oct 1.
4
Changing PACES: developments to the examination in 2009.考试变革:2009 年考试的新发展。
Clin Med (Lond). 2011 Jun;11(3):231-4. doi: 10.7861/clinmedicine.11-3-231.
5
Evaluating construct equivalence and criterion-related validity for repeat examinees on a standardized patient examination.评估标准化患者考试中重测考生的结构等效性和效标关联效度。
Acad Med. 2011 Oct;86(10):1253-9. doi: 10.1097/ACM.0b013e31822bc0a4.
6
The second time around: accounting for retest effects on oral examinations.第二次机会:考虑重测效应对口腔考试的影响。
Eval Health Prof. 2010 Sep;33(3):386-403. doi: 10.1177/0163278710374855.
7
The standard error of measurement is a more appropriate measure of quality for postgraduate medical assessments than is reliability: an analysis of MRCP(UK) examinations.测量标准误差是研究生医学评估中比可靠性更合适的质量衡量标准:对 MRCP(UK)考试的分析。
BMC Med Educ. 2010 Jun 2;10:40. doi: 10.1186/1472-6920-10-40.
8
Is case-specificity content-specificity? An analysis of data from extended-matching questions.病例特异性是否为特异性?扩展匹配问题数据的分析。
Adv Health Sci Educ Theory Pract. 2010 Mar;15(1):55-63. doi: 10.1007/s10459-009-9169-z. Epub 2009 Jun 4.
9
The practical value of the standard error of measurement in borderline pass/fail decisions.测量标准误在临界及格/不及格决策中的实用价值。
Med Educ. 2008 Aug;42(8):810-5. doi: 10.1111/j.1365-2923.2008.03103.x. Epub 2008 Jun 14.
10
Content specificity: is it the full story? Statistical modelling of a clinical skills examination.内容特异性:这就是全部情况吗?一项临床技能考试的统计建模
Med Educ. 2008 Jun;42(6):589-99. doi: 10.1111/j.1365-2923.2008.03020.x.