Wallenstein Joshua, Ander Douglas
Emory University, Department of Emergency Medicine, Atlanta, Georgia.
West J Emerg Med. 2015 Jan;16(1):121-6. doi: 10.5811/westjem.2014.11.22440. Epub 2014 Dec 12.
Evaluation of emergency medicine (EM) learners based on observed performance in the emergency department (ED) is limited by factors such as reproducibility and patient safety. EM educators depend on standardized and reproducible assessments such as the objective structured clinical examination (OSCE). The validity of the OSCE as an evaluation tool in EM education has not been previously studied. The objective was to assess the validity of a novel management-focused OSCE as an evaluation instrument in EM education through demonstration of performance correlation with established assessment methods and case item analysis.
We conducted a prospective cohort study of fourth-year medical students enrolled in a required EM clerkship. Students enrolled in the clerkship completed a five-station EM OSCE. We used Pearson's coefficient to correlate OSCE performance with performance in the ED based on completed faculty evaluations. Indices of difficulty and discrimination were computed for each scoring item.
We found a moderate and statistically-significant correlation between OSCE score and ED performance score [r(239) =0.40, p<0.001]. Of the 34 OSCE testing items the mean index of difficulty was 63.0 (SD =23.0) and the mean index of discrimination was 0.52 (SD =0.21).
Student performance on the OSCE correlated with their observed performance in the ED, and indices of difficulty and differentiation demonstrated alignment with published best-practice testing standards. This evidence, along with other attributes of the OSCE, attest to its validity. Our OSCE can be further improved by modifying testing items that performed poorly and by examining and maximizing the inter-rater reliability of our evaluation instrument.
基于急诊科(ED)观察到的表现对急诊医学(EM)学习者进行评估,会受到可重复性和患者安全等因素的限制。急诊医学教育工作者依赖于标准化且可重复的评估,如客观结构化临床考试(OSCE)。此前尚未研究过OSCE作为急诊医学教育评估工具的有效性。目的是通过证明与既定评估方法的表现相关性以及病例项目分析,评估一种新型的以管理为重点的OSCE作为急诊医学教育评估工具的有效性。
我们对参加必修急诊医学实习的四年级医学生进行了一项前瞻性队列研究。参加实习的学生完成了一个包含五个站点的急诊医学OSCE。我们使用皮尔逊系数将OSCE表现与基于教师完成的评估得出的急诊科表现相关联。计算每个评分项目的难度指数和区分指数。
我们发现OSCE分数与急诊科表现分数之间存在中等且具有统计学意义的相关性[r(239) =0.40, p<0.001]。在34个OSCE测试项目中,平均难度指数为63.0(标准差=23.0),平均区分指数为0.52(标准差=0.21)。
学生在OSCE上的表现与他们在急诊科观察到的表现相关,难度指数和区分指数表明符合已发表的最佳实践测试标准。这一证据以及OSCE的其他属性证明了其有效性。我们的OSCE可以通过修改表现不佳的测试项目以及检查并最大化我们评估工具的评分者间信度来进一步改进。