Department of Emergency Medicine, Dong-A University, College of Medicine, Busan, Korea.
Department of Medical Education, Dong-A University, College of Medicine, Busan, Korea.
J Educ Eval Health Prof. 2021;18:25. doi: 10.3352/jeehp.2021.18.25. Epub 2021 Sep 27.
Setting standards is critical in health professions. However, appropriate standard setting methods do not always apply to the set cut score in performance assessment. The aim of this study was to compare the cut score when the standard setting is changed from the norm-referenced method to the borderline group method (BGM) and borderline regression method (BRM) in an objective structured clinical examination (OSCE) in medical school.
This was an explorative study to model of the BGM and BRM. A total of 107 fourth-year medical students attended the OSCE at seven stations with encountering standardized patients (SPs) and one station with performing skills on a manikin on 15 July 2021. Thirty-two physician examiners evaluated the performance by completing a checklist and global rating scales.
The cut score of the norm-referenced method was lower than that of the BGM (p<0.01) and BRM (p<0.02). There was no significant difference in the cut score between the BGM and BRM (p=0.40). The station with the highest standard deviation and the highest proportion of the borderline group showed the largest cut score difference in standard setting methods.
Prefixed cut scores by the norm-referenced method without considering station contents or examinee performance can vary due to station difficulty and content, affecting the appropriateness of standard setting decisions. If there is an adequate consensus on the criteria for the borderline group, standard setting with the BRM could be applied as a practical and defensible method to determine the cut score for OSCE.
在健康职业中制定标准至关重要。然而,适当的标准制定方法并不总是适用于表现评估中的设定截止分数。本研究的目的是比较医学学校中的客观结构化临床考试(OSCE)中,从参照标准方法改为边界组方法(BGM)和边界回归方法(BRM)时的截止分数。
这是一项探索性研究,用于构建 BGM 和 BRM 模型。共有 107 名四年级医学生于 2021 年 7 月 15 日在七个站接受 OSCE 考试,其中包括与标准化患者(SP)相遇的站和在人体模型上进行技能操作的站。32 名医师 examiner 通过填写检查表和总体评分量表来评估表现。
参照标准方法的截止分数低于 BGM(p<0.01)和 BRM(p<0.02)。BGM 和 BRM 的截止分数之间没有显著差异(p=0.40)。标准偏差最高且边界组比例最高的站的标准设定方法的截止分数差异最大。
如果不考虑站的内容或考生的表现,通过参照标准方法预先设定的截止分数可能会因站的难度和内容而有所不同,从而影响标准设定决策的适当性。如果对边界组的标准有足够的共识,那么使用 BRM 进行标准设定可以作为一种实用且有防御性的方法来确定 OSCE 的截止分数。