a Keele University School of Medicine , UK.
Med Teach. 2018 Nov;40(11):1159-1165. doi: 10.1080/0142159X.2018.1463088. Epub 2018 Apr 27.
OSCE examiners' scores are variable and may discriminate domains of performance poorly. Examiners must hold their observations of OSCE performances in "episodic memory" until performances end. We investigated whether examiners vary in their recollection of performances; and whether this relates to their score variability or ability to separate disparate performance domains. Secondary analysis was performed on data where examiners had: 1/scored videos of OSCE performances showing disparate student ability in different domains; and 2/performed a measure of recollection for an OSCE performance. We calculated measures of "overall-score variance" (the degree individual examiners' overall scores varied from the group mean) and "domain separation" (the degree to which examiners separated different performance domains). We related these variables to the measure of examiners' recollection. Examiners varied considerably in their recollection accuracy (recognition beyond chance -5% to +75% for different examiners). Examiners' recollection accuracy was weakly inversely related to their overall score accuracy ( = -0.17, < 0.001) and related to their ability to separate domains of performance ( = 0.25, < 0.001). Examiners vary substantially in their memories for students' performances which may offer a useful point of difference to study processing and integration phases of judgment. Findings could have implication for the utility of feedback.
客观结构化临床考试(OSCE)考官的评分存在差异,并且可能无法准确区分考生表现的不同领域。考官必须将对 OSCE 表现的观察结果暂存在“情景记忆”中,直到考试结束。我们研究了考官在回忆表现时是否存在差异;以及这种差异是否与他们的评分变异性或区分不同表现领域的能力有关。我们对以下数据进行了二次分析:1. 考官对 OSCE 表现进行评分,这些表现展示了不同考生在不同领域的不同能力;2. 考官对 OSCE 表现进行了回忆测试。我们计算了“总体评分方差”(个体考官的总体评分与群体平均值的差异程度)和“领域分离”(考官区分不同表现领域的程度)的指标。我们将这些变量与考官回忆的测试结果相关联。考官在回忆准确性方面存在较大差异(不同考官的识别准确率在 5%-75%之间)。考官的回忆准确性与他们的总体评分准确性呈弱负相关( = -0.17, < 0.001),并且与他们区分表现领域的能力相关( = 0.25, < 0.001)。考官对学生表现的记忆存在较大差异,这可能为研究判断的处理和整合阶段提供了一个有用的差异点。研究结果可能对反馈的有效性产生影响。