Denney MeiLing, Wakeford Richard
a Royal College of General Practitioners , London , UK.
b NHS Education for Scotland (South East) , Edinburgh , UK.
Educ Prim Care. 2016 Jan;27(1):39-43. doi: 10.1080/14739879.2015.1113724. Epub 2015 Dec 20.
The MRCGP clinical skills assessment (CSA), a high-stakes OSCE examination whereby GPs exit their training, uses trained role-players to depict patients (though not to mark candidates). Just as subgroups of examiners can potentially affect the results of subgroups of candidates through biases expressed in discriminatory marking, so possibly could subgroups of role-players alter the difficulty for different candidate groups. This study looks at the contribution of role-players to such possible systematic unfairness in the assessment. Using multiple linear regression, we examined the data from all 52,702 case scores from the MRCGP CSA for the academic year 2012-2013. Candidates were dichotomised by sex, by ethnicity and by source of primary medical qualification (PMQ); role-players were dichotomised by sex and binary ethnicity; and the transaction of candidate/role-player encounters were classified as 'same' or 'different' in terms of the two parties' sex and of their ethnicity. Neither examiner nor role-player characteristics were found to predict any statistically significant portion of case score variance, where the significant (p < .001) predictors were source of PMQ (UK or elsewhere: 11% of case score variance), candidates' ethnicity (1%), candidates' sex (0.6%) and 'transactional' sex (0.1%). We did not therefore find any substantial degree of support for the proposition that role-player subgroups systematically influence candidate subgroups' scores.
英国皇家全科医师学院临床技能评估(CSA)是一项高风险的客观结构化临床考试,全科医生通过此项考试结束培训,该考试使用经过培训的角色扮演人员来扮演患者(但不对考生打分)。正如考官亚组可能通过歧视性评分中表现出的偏见影响考生亚组的成绩一样,角色扮演人员亚组也可能改变不同考生群体面临的考试难度。本研究探讨了角色扮演人员对评估中这种可能存在的系统性不公平现象的影响。我们使用多元线性回归分析了2012 - 2013学年英国皇家全科医师学院临床技能评估中所有52702个病例分数的数据。考生按性别、种族和初级医学资格(PMQ)来源进行二分法分类;角色扮演人员按性别和二元种族进行二分法分类;考生与角色扮演人员互动的情况根据双方的性别和种族分为“相同”或“不同”。研究发现,无论是考官还是角色扮演人员的特征,都无法预测病例分数方差的任何具有统计学意义的部分,其中具有显著意义(p <.001)的预测因素是初级医学资格来源(英国或其他地区:占病例分数方差的11%)、考生种族(1%)、考生性别(0.6%)和“互动”性别(0.1%)。因此,我们没有找到任何实质性证据支持角色扮演人员亚组会系统性影响考生亚组成绩这一观点。