National Board of Medical Examiners, Philadelphia, PA, USA.
Eval Health Prof. 2010 Sep;33(3):386-403. doi: 10.1177/0163278710374855.
Years of research with high-stakes written tests indicates that although repeat examinees typically experience score gains between their first and subsequent attempts, their pass rates remain considerably lower than pass rates for first-time examinees. This outcome is consistent with expectations. Comparable studies of the performance of repeat examinees on oral examinations are lacking. The current research evaluated pass rates for more than 50,000 examinees on written and oral exams administered by six medical specialty boards for several recent years. Pass rates for first-time examinees were similar for both written and oral exams, averaging about 84% across all boards. Pass rates for repeat examinees on written exams were expectedly lower, ranging from 22% to 51%, with an average of 36%. However, pass rates for repeat examinees on oral exams were markedly higher than for written exams, ranging from 53% to 77%, with an average of 65%. Four explanations for the elevated repeat pass rates on oral exams are proposed, including an increase in examinee proficiency, construct-irrelevant variance, measurement error (score unreliability), and memorization of test content. Simulated data are used to demonstrate that roughly one third of the score increase can be explained by measurement error alone. The authors suggest that a substantial portion of the score increase can also likely be attributed to construct-irrelevant variance. Results are discussed in terms of their implications for making pass-fail decisions when retesting is allowed. The article concludes by identifying areas for future research.
多年来,针对高风险书面考试的研究表明,尽管重复考生通常在第一次和后续尝试之间经历分数提高,但他们的通过率仍然明显低于首次考生的通过率。这一结果符合预期。对于重复考生在口头考试中的表现,类似的研究尚缺乏。目前的研究评估了六个医学专业委员会在近几年中对书面和口头考试的 50,000 多名考生的通过率。首次考生的书面和口头考试通过率相似,所有委员会的平均通过率约为 84%。书面考试中重复考生的通过率预期较低,范围为 22%至 51%,平均为 36%。然而,重复考生在口头考试中的通过率明显高于书面考试,范围为 53%至 77%,平均为 65%。提出了四个解释口头考试重复考生通过率较高的原因,包括考生熟练程度的提高、与结构无关的差异、测量误差(分数不可靠)和测试内容的记忆。模拟数据用于表明,大约三分之一的分数提高可以仅用测量误差来解释。作者建议,分数提高的很大一部分也可能归因于与结构无关的差异。结果讨论了在允许重新测试时做出通过/失败决策的影响。文章最后确定了未来研究的领域。