Medical Education, College of Medicine, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia.
BMC Med Educ. 2012 Dec 7;12:121. doi: 10.1186/1472-6920-12-121.
The objective structure clinical examination (OSCE) has been used since the early 1970s for assessing clinical competence. There are very few studies that have examined the psychometric stability of the stations that are used repeatedly with different samples. The purpose of the present study was to assess the stability of objective structured clinical exams (OSCEs) employing the same stations used over time but with a different sample of candidates, SPs, and examiners.
At Time 1, 191 candidates and at Time 2 (one year apart), 236 candidates participated in a 10-station OSCE; 6 of the same stations were used in both years. Generalizability analyses (Ep2) were conducted. Employing item response analyses, test characteristic curves (TCC) were derived for each of the 6 stations for a 2-parameter model. The TCCs were compared across the two years, Time 1 and 2.
The Ep2 of the OSCEs exceeded.70. Standardized thetas (θ) and discriminations were equivalent for the same station across the two year period indicating equivalent TCCs for a 2-parameter model.
The 6 OSCE stations used by the AIMG program over two years have adequate internal consistency reliability, stable generalizability (Ep2) and equivalent test characteristics. The process of assessment employed for IMG's are stable OSCE stations that may be used several times over without compromising psychometric properties.With careful security, high-stakes OSCEs may use the same stations that have high internal consistency and generalizability repeatedly as the psychometric properties are stable over several years with different samples of candidates.
客观结构临床考试(OSCE)自 20 世纪 70 年代初以来一直被用于评估临床能力。很少有研究检验过在不同样本中重复使用的站点的心理测量稳定性。本研究的目的是评估使用相同站点但具有不同候选人、SP 和考官样本的客观结构化临床考试(OSCE)的稳定性。
在时间 1,有 191 名候选人,在时间 2(相隔一年),有 236 名候选人参加了 10 站 OSCE;其中 6 个相同的站点在两年内都有使用。进行了可概括性分析(Ep2)。利用项目反应分析,为每个 6 个站点的 2 个参数模型得出了测试特征曲线(TCC)。在两年,即时间 1 和 2 之间,对 TCC 进行了比较。
OSCE 的 Ep2 超过 0.70。在两年期间,同一站点的标准化θ和区分度是等效的,表明 2 个参数模型的 TCC 等效。
AIMG 项目在两年内使用的 6 个 OSCE 站点具有足够的内部一致性可靠性、稳定的可概括性(Ep2)和等效的测试特征。为 IMG 采用的评估过程是稳定的 OSCE 站点,可以多次使用而不会影响心理测量特性。在精心的安全措施下,高风险的 OSCE 可以重复使用具有高内部一致性和可概括性的相同站点,因为其心理测量特性在几年内对不同的候选人群体都是稳定的。