• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

研究在多中心客观结构化临床考试(OSCE)中针对考官差异进行调整的准确性。一项基于视频的考官分数比较与调整(VESCA)模拟研究。

Investigating the accuracy of adjusting for examiner differences in multi-centre Objective Structured Clinical Exams (OSCEs). A simulation study of video-based Examiner Score Comparison and Adjustment (VESCA).

作者信息

Yeates Peter, McCray Gareth

机构信息

School of Medicine, Keele University, David Weatherall Building, Keele, ST5 5BG, UK.

出版信息

BMC Med Educ. 2024 Dec 18;24(1):1466. doi: 10.1186/s12909-024-06462-3.

DOI:10.1186/s12909-024-06462-3
PMID:39695612
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11654327/
Abstract

INTRODUCTION

Ensuring examiner equivalence across distributed assessment locations is a priority within distributed Objective Structured Clinical Exams (OSCEs) but is challenging as examiners are typically fully nested within locations (i.e. no overlap in performances seen by different groups of examiners). Video-based Examiner Score Comparison and Adjustment (VESCA) is a recently developed method which uses video-based linking to compare and (potentially) adjust for the effect of different groups of examiners within OSCEs. Whilst initial research on VESCA has been promising, the accuracy of the resulting adjusted scores is unknown. Given this, we aimed to investigate the accuracy of adjusted scores produced by VESCA under a range of plausible operational parameters.

METHODS

Using statistical simulation, we investigated how: 1/proportion of participating examiners, 2/ number of linking videos, 3/baseline differences in examiner stringency between schools (i.e. whether examiners in School A are, on average, more stringent than the examiners in School B), 4/number of OSCE stations and 5/different degrees of random error within examiners' judgements influenced accuracy of adjusted scores. We generated distributions of students' "true" performances across several stations, added examiner error, and simulated linking through crossed video-scoring (as occurs in VESCA). We then used Many Facet Rasch Modelling to produce an adjusted score for each student which we compared with their corresponding original "true" performance score. We replicated this 1000 times for each permutation to determine average error reduction and the proportion of students whose scores became more accurate. Simulation parameters were derived from a real, summative, whole curriculum undergraduate Year 3 OSCE at Keele University School of Medicine.

RESULTS

We found that in all conditions where no baseline difference existed between groups of examiners, score adjustment only minimally improved or even worsened score accuracy. Conversely, as the size of baseline differences between schools increased, adjustment accuracy increased, reducing error by up to 71% and making scores more accurate for up to 93% of students in the 20% baseline-difference condition.

CONCLUSIONS

Score adjustment through VESCA has the potential to substantially enhance equivalence for candidates in distributed OSCEs in some circumstances, whilst making scores less accurate in others. These findings will support judgements about when score adjustment may beneficially aid OSCE equivalence.

摘要

引言

在分布式客观结构化临床考试(OSCE)中,确保不同考试地点的考官评分等效是一个优先事项,但这具有挑战性,因为考官通常完全局限于各自所在的地点(即不同考官组看到的考生表现没有重叠)。基于视频的考官分数比较与调整(VESCA)是一种最近开发的方法,它利用基于视频的关联来比较并(可能)调整OSCE中不同考官组的影响。虽然对VESCA的初步研究很有前景,但调整后分数的准确性尚不清楚。鉴于此,我们旨在研究在一系列合理的操作参数下,VESCA产生的调整后分数的准确性。

方法

我们使用统计模拟来研究以下因素如何影响调整后分数的准确性:1/参与考官的比例、2/关联视频的数量、3/学校间考官严格程度的基线差异(即A校的考官平均而言是否比B校的考官更严格)、4/OSCE考站的数量以及5/考官判断中的不同程度随机误差。我们生成了学生在多个考站的“真实”表现分布,加入考官误差,并通过交叉视频评分模拟关联(如同VESCA中的情况)。然后,我们使用多面Rasch模型为每个学生生成一个调整后分数,并将其与相应的原始“真实”表现分数进行比较。我们对每个排列重复此过程1000次,以确定平均误差减少情况以及分数变得更准确的学生比例。模拟参数源自基尔大学医学院本科三年级真实的、总结性的全课程OSCE。

结果

我们发现,在考官组之间不存在基线差异的所有情况下,分数调整仅略微提高或甚至降低了分数准确性。相反,随着学校间基线差异的增大,调整准确性提高,在20%基线差异的情况下,误差最多可减少71%,并且高达93%的学生分数变得更准确。

结论

通过VESCA进行分数调整在某些情况下有可能显著提高分布式OSCE中考生的等效性,而在其他情况下会使分数准确性降低。这些发现将有助于判断分数调整何时可能对OSCE等效性有有益帮助。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c16a/11654327/c9551ff8f8bf/12909_2024_6462_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c16a/11654327/c9551ff8f8bf/12909_2024_6462_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c16a/11654327/c9551ff8f8bf/12909_2024_6462_Fig1_HTML.jpg

相似文献

1
Investigating the accuracy of adjusting for examiner differences in multi-centre Objective Structured Clinical Exams (OSCEs). A simulation study of video-based Examiner Score Comparison and Adjustment (VESCA).研究在多中心客观结构化临床考试(OSCE)中针对考官差异进行调整的准确性。一项基于视频的考官分数比较与调整(VESCA)模拟研究。
BMC Med Educ. 2024 Dec 18;24(1):1466. doi: 10.1186/s12909-024-06462-3.
2
Determining the influence of different linking patterns on the stability of students' score adjustments produced using Video-based Examiner Score Comparison and Adjustment (VESCA).确定不同链接模式对基于视频的考官评分比较和调整(VESCA)产生的学生分数调整稳定性的影响。
BMC Med Educ. 2022 Jan 17;22(1):41. doi: 10.1186/s12909-022-03115-1.
3
Developing a video-based method to compare and adjust examiner effects in fully nested OSCEs.开发一种基于视频的方法,以比较和调整完全嵌套 OSCE 中的考官效应。
Med Educ. 2019 Mar;53(3):250-263. doi: 10.1111/medu.13783. Epub 2018 Dec 21.
4
Using video-based examiner score comparison and adjustment (VESCA) to compare the influence of examiners at different sites in a distributed objective structured clinical exam (OSCE).利用基于视频的考官评分比较和调整(VESCA)比较分布式客观结构化临床考试(OSCE)中不同站点考官的影响。
BMC Med Educ. 2023 Oct 26;23(1):803. doi: 10.1186/s12909-023-04774-4.
5
Measuring the Effect of Examiner Variability in a Multiple-Circuit Objective Structured Clinical Examination (OSCE).测量多站式客观结构化临床考试(OSCE)中主考人变异性的效果。
Acad Med. 2021 Aug 1;96(8):1189-1196. doi: 10.1097/ACM.0000000000004028. Epub 2021 Mar 2.
6
Enhancing authenticity, diagnosticity and quivalence (AD-Equiv) in multicentre OSCE exams in health professionals education: protocol for a complex intervention study.提高医学专业多中心 OSCE 考试的真实性、诊断性和等效性(AD-Equiv):一项复杂干预研究方案。
BMJ Open. 2022 Dec 7;12(12):e064387. doi: 10.1136/bmjopen-2022-064387.
7
Inter-school variations in the standard of examiners' graduation-level OSCE judgements.考官对毕业水平客观结构化临床考试判断标准的校际差异。
Med Teach. 2025 Apr;47(4):735-743. doi: 10.1080/0142159X.2024.2372087. Epub 2024 Jul 8.
8
Standardized examinees: development of a new tool to evaluate factors influencing OSCE scores and to train examiners.标准化考生:开发一种新工具,以评估影响客观结构化临床考试分数的因素并培训考官。
GMS J Med Educ. 2020 Jun 15;37(4):Doc40. doi: 10.3205/zma001333. eCollection 2020.
9
Hawks, Doves and Rasch decisions: Understanding the influence of different cycles of an OSCE on students' scores using Many Facet Rasch Modeling.鹰派、鸽派与拉施决策:运用多面拉施模型理解客观结构化临床考试不同轮次对学生成绩的影响
Med Teach. 2017 Jan;39(1):92-99. doi: 10.1080/0142159X.2017.1248916. Epub 2016 Nov 29.
10
Examiners' Perceptions in Surgical Education: The Blind Spot in the Assessment of OSCEs.考官在外科教育中的看法:客观结构化临床考试评估中的盲点。
J Surg Educ. 2021 Mar-Apr;78(2):590-596. doi: 10.1016/j.jsurg.2020.07.024. Epub 2020 Aug 18.

本文引用的文献

1
Inter-school variations in the standard of examiners' graduation-level OSCE judgements.考官对毕业水平客观结构化临床考试判断标准的校际差异。
Med Teach. 2025 Apr;47(4):735-743. doi: 10.1080/0142159X.2024.2372087. Epub 2024 Jul 8.
2
Using video-based examiner score comparison and adjustment (VESCA) to compare the influence of examiners at different sites in a distributed objective structured clinical exam (OSCE).利用基于视频的考官评分比较和调整(VESCA)比较分布式客观结构化临床考试(OSCE)中不同站点考官的影响。
BMC Med Educ. 2023 Oct 26;23(1):803. doi: 10.1186/s12909-023-04774-4.
3
Determining the influence of different linking patterns on the stability of students' score adjustments produced using Video-based Examiner Score Comparison and Adjustment (VESCA).
确定不同链接模式对基于视频的考官评分比较和调整(VESCA)产生的学生分数调整稳定性的影响。
BMC Med Educ. 2022 Jan 17;22(1):41. doi: 10.1186/s12909-022-03115-1.
4
Determining influence, interaction and causality of contrast and sequence effects in objective structured clinical exams.确定客观结构化临床考试中对比和序列效应的影响、相互作用和因果关系。
Med Educ. 2022 Mar;56(3):292-302. doi: 10.1111/medu.14713. Epub 2022 Jan 11.
5
Measuring the Effect of Examiner Variability in a Multiple-Circuit Objective Structured Clinical Examination (OSCE).测量多站式客观结构化临床考试(OSCE)中主考人变异性的效果。
Acad Med. 2021 Aug 1;96(8):1189-1196. doi: 10.1097/ACM.0000000000004028. Epub 2021 Mar 2.
6
Exploring differences in individual and group judgements in standard setting.探讨标准制定中个体和群体判断的差异。
Med Educ. 2019 Sep;53(9):941-952. doi: 10.1111/medu.13915. Epub 2019 Jul 2.
7
DRIFT happens, sometimes: Examining time based rater variance in a high-stakes OSCE.有时会出现漂移现象:在高风险 OSCE 中考察基于时间的评分者差异。
Med Teach. 2019 Jul;41(7):819-823. doi: 10.1080/0142159X.2019.1585788. Epub 2019 Apr 6.
8
Developing a video-based method to compare and adjust examiner effects in fully nested OSCEs.开发一种基于视频的方法,以比较和调整完全嵌套 OSCE 中的考官效应。
Med Educ. 2019 Mar;53(3):250-263. doi: 10.1111/medu.13783. Epub 2018 Dec 21.
9
On Rating Angels: The Halo Effect and Straight Line Scoring.论天使评分:光环效应与直线评分法
J Grad Med Educ. 2017 Dec;9(6):721-723. doi: 10.4300/JGME-D-17-00644.1.
10
Variation in passing standards for graduation-level knowledge items at UK medical schools.英国医学院毕业水平知识项目通过标准的差异。
Med Educ. 2017 Jun;51(6):612-620. doi: 10.1111/medu.13240. Epub 2017 Mar 13.