• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

测量多站式客观结构化临床考试(OSCE)中主考人变异性的效果。

Measuring the Effect of Examiner Variability in a Multiple-Circuit Objective Structured Clinical Examination (OSCE).

机构信息

P. Yeates is a senior lecturer in medical education research, School of Medicine, Keele University, Keele, Staffordshire, and a consultant in acute and respiratory medicine, Fairfield General Hospital, Pennine Acute Hospitals, NHS Trust, Bury, Lancashire, United Kingdom; ORCID: https://orcid.org/0000-0001-6316-4051 .

A. Moult is a research assistant in medical education, School of Medicine, Keele University, Keele, Staffordshire, United Kingdom; ORCID: https://orcid.org/0000-0002-9424-5660 .

出版信息

Acad Med. 2021 Aug 1;96(8):1189-1196. doi: 10.1097/ACM.0000000000004028. Epub 2021 Mar 2.

DOI:10.1097/ACM.0000000000004028
PMID:33656012
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8300845/
Abstract

PURPOSE

Ensuring that examiners in different parallel circuits of objective structured clinical examinations (OSCEs) judge to the same standard is critical to the chain of validity. Recent work suggests examiner-cohort (i.e., the particular group of examiners) could significantly alter outcomes for some candidates. Despite this, examiner-cohort effects are rarely examined since fully nested data (i.e., no crossover between the students judged by different examiner groups) limit comparisons. In this study, the authors aim to replicate and further develop a novel method called Video-based Examiner Score Comparison and Adjustment (VESCA), so it can be used to enhance quality assurance of distributed or national OSCEs.

METHOD

In 2019, 6 volunteer students were filmed on 12 stations in a summative OSCE. In addition to examining live student performances, examiners from 8 separate examiner-cohorts scored the pool of video performances. Examiners scored videos specific to their station. Video scores linked otherwise fully nested data, enabling comparisons by Many Facet Rasch Modeling. Authors compared and adjusted for examiner-cohort effects. They also compared examiners' scores when videos were embedded (interspersed between live students during the OSCE) or judged later via the Internet.

RESULTS

Having accounted for differences in students' ability, different examiner-cohort scores for the same ability of student ranged from 18.57 of 27 (68.8%) to 20.49 (75.9%), Cohen's d = 1.3. Score adjustment changed the pass/fail classification for up to 16% of students depending on the modeled cut score. Internet and embedded video scoring showed no difference in mean scores or variability. Examiners' accuracy did not deteriorate over the 3-week Internet scoring period.

CONCLUSIONS

Examiner-cohorts produced a replicable, significant influence on OSCE scores that was unaccounted for by typical assessment psychometrics. VESCA offers a promising means to enhance validity and fairness in distributed OSCEs or national exams. Internet-based scoring may enhance VESCA's feasibility.

摘要

目的

确保不同客观结构化临床考试(OSCE)平行测试中的考官按照相同标准进行评判,这对确保有效性至关重要。最近的研究表明,考官群体(即特定的考官群体)可能会对某些考生的成绩产生重大影响。尽管如此,由于完全嵌套数据(即不同考官组评判的学生之间没有交叉)限制了比较,因此很少检查考官群体效应。在这项研究中,作者旨在复制并进一步开发一种名为基于视频的考官评分比较和调整(VESCA)的新方法,以便将其用于增强分布式或全国性 OSCE 的质量保证。

方法

2019 年,6 名志愿者学生在总结性 OSCE 的 12 个站点上进行了拍摄。除了检查学生的现场表演外,来自 8 个独立考官群体的考官还对录像表演进行了评分。考官对特定于其岗位的录像进行评分。视频评分链接了其他完全嵌套的数据,使通过多方面的拉什模型进行比较成为可能。作者比较并调整了考官群体的影响。他们还比较了将视频嵌入(在 OSCE 期间穿插在现场学生之间)或通过互联网稍后进行评估时考官的评分。

结果

在考虑到学生能力差异的情况下,同一学生相同能力的不同考官群体评分范围从 27 分中的 18.57 分(68.8%)到 20.49 分(75.9%),Cohen's d = 1.3。根据模型化的切割分数,评分调整改变了多达 16%的学生的及格/不及格分类。基于互联网的评分和嵌入式视频评分在平均分数或变异性方面没有差异。随着互联网评分期的过去 3 周,考官的准确性并未恶化。

结论

考官群体对 OSCE 成绩产生了可复制的、显著的影响,这是典型的评估心理测量学无法解释的。VESCA 提供了一种有前途的方法,可以提高分布式 OSCE 或全国考试的有效性和公平性。基于互联网的评分可能会增强 VESCA 的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fccb/8300845/359b315e371b/acm-96-1189-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fccb/8300845/05f46e653ff0/acm-96-1189-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fccb/8300845/ac441b8e9970/acm-96-1189-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fccb/8300845/359b315e371b/acm-96-1189-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fccb/8300845/05f46e653ff0/acm-96-1189-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fccb/8300845/ac441b8e9970/acm-96-1189-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fccb/8300845/359b315e371b/acm-96-1189-g005.jpg

相似文献

1
Measuring the Effect of Examiner Variability in a Multiple-Circuit Objective Structured Clinical Examination (OSCE).测量多站式客观结构化临床考试(OSCE)中主考人变异性的效果。
Acad Med. 2021 Aug 1;96(8):1189-1196. doi: 10.1097/ACM.0000000000004028. Epub 2021 Mar 2.
2
Developing a video-based method to compare and adjust examiner effects in fully nested OSCEs.开发一种基于视频的方法,以比较和调整完全嵌套 OSCE 中的考官效应。
Med Educ. 2019 Mar;53(3):250-263. doi: 10.1111/medu.13783. Epub 2018 Dec 21.
3
Using video-based examiner score comparison and adjustment (VESCA) to compare the influence of examiners at different sites in a distributed objective structured clinical exam (OSCE).利用基于视频的考官评分比较和调整(VESCA)比较分布式客观结构化临床考试(OSCE)中不同站点考官的影响。
BMC Med Educ. 2023 Oct 26;23(1):803. doi: 10.1186/s12909-023-04774-4.
4
Investigating the accuracy of adjusting for examiner differences in multi-centre Objective Structured Clinical Exams (OSCEs). A simulation study of video-based Examiner Score Comparison and Adjustment (VESCA).研究在多中心客观结构化临床考试(OSCE)中针对考官差异进行调整的准确性。一项基于视频的考官分数比较与调整(VESCA)模拟研究。
BMC Med Educ. 2024 Dec 18;24(1):1466. doi: 10.1186/s12909-024-06462-3.
5
Determining the influence of different linking patterns on the stability of students' score adjustments produced using Video-based Examiner Score Comparison and Adjustment (VESCA).确定不同链接模式对基于视频的考官评分比较和调整(VESCA)产生的学生分数调整稳定性的影响。
BMC Med Educ. 2022 Jan 17;22(1):41. doi: 10.1186/s12909-022-03115-1.
6
Inter-school variations in the standard of examiners' graduation-level OSCE judgements.考官对毕业水平客观结构化临床考试判断标准的校际差异。
Med Teach. 2025 Apr;47(4):735-743. doi: 10.1080/0142159X.2024.2372087. Epub 2024 Jul 8.
7
Standardized examinees: development of a new tool to evaluate factors influencing OSCE scores and to train examiners.标准化考生:开发一种新工具,以评估影响客观结构化临床考试分数的因素并培训考官。
GMS J Med Educ. 2020 Jun 15;37(4):Doc40. doi: 10.3205/zma001333. eCollection 2020.
8
Determining influence, interaction and causality of contrast and sequence effects in objective structured clinical exams.确定客观结构化临床考试中对比和序列效应的影响、相互作用和因果关系。
Med Educ. 2022 Mar;56(3):292-302. doi: 10.1111/medu.14713. Epub 2022 Jan 11.
9
Enhancing authenticity, diagnosticity and quivalence (AD-Equiv) in multicentre OSCE exams in health professionals education: protocol for a complex intervention study.提高医学专业多中心 OSCE 考试的真实性、诊断性和等效性(AD-Equiv):一项复杂干预研究方案。
BMJ Open. 2022 Dec 7;12(12):e064387. doi: 10.1136/bmjopen-2022-064387.
10
Hawks, Doves and Rasch decisions: Understanding the influence of different cycles of an OSCE on students' scores using Many Facet Rasch Modeling.鹰派、鸽派与拉施决策:运用多面拉施模型理解客观结构化临床考试不同轮次对学生成绩的影响
Med Teach. 2017 Jan;39(1):92-99. doi: 10.1080/0142159X.2017.1248916. Epub 2016 Nov 29.

引用本文的文献

1
Investigating the accuracy of adjusting for examiner differences in multi-centre Objective Structured Clinical Exams (OSCEs). A simulation study of video-based Examiner Score Comparison and Adjustment (VESCA).研究在多中心客观结构化临床考试(OSCE)中针对考官差异进行调整的准确性。一项基于视频的考官分数比较与调整(VESCA)模拟研究。
BMC Med Educ. 2024 Dec 18;24(1):1466. doi: 10.1186/s12909-024-06462-3.
2
Development of a dynamic clinical assessment for finals.期末动态临床评估的制定。
Br Dent J. 2024 Nov;237(10):795-800. doi: 10.1038/s41415-024-8028-x. Epub 2024 Nov 22.
3
The equivalence of a high-stakes objective structured clinical exam adapted to suit a virtual delivery format.
适应虚拟交付形式的高风险客观结构化临床考试的等效性。
J Eval Clin Pract. 2025 Feb;31(1):e14167. doi: 10.1111/jep.14167. Epub 2024 Oct 24.
4
Digital Subtraction Angiography of Cerebral Arteries: Influence of Cranial Dimensions on X-ray Tube Performance.脑动脉数字减影血管造影:颅骨尺寸对X射线管性能的影响。
J Clin Med. 2024 May 20;13(10):3002. doi: 10.3390/jcm13103002.
5
Accuracy of Entrustment-Based Assessment: Implications for Programs and Patients.基于委托的评估的准确性:对项目和患者的影响。
J Grad Med Educ. 2024 Feb;16(1):30-36. doi: 10.4300/JGME-D-23-00275.1. Epub 2024 Feb 17.
6
Using video-based examiner score comparison and adjustment (VESCA) to compare the influence of examiners at different sites in a distributed objective structured clinical exam (OSCE).利用基于视频的考官评分比较和调整(VESCA)比较分布式客观结构化临床考试(OSCE)中不同站点考官的影响。
BMC Med Educ. 2023 Oct 26;23(1):803. doi: 10.1186/s12909-023-04774-4.
7
Towards a more nuanced conceptualisation of differential examiner stringency in OSCEs.朝向 OSCE 中更细微的差异化考站严格程度概念化。
Adv Health Sci Educ Theory Pract. 2024 Jul;29(3):919-934. doi: 10.1007/s10459-023-10289-w. Epub 2023 Oct 16.
8
The Evaluation of a High-Fidelity Simulation Model and Video Instruction Used to Teach Canine Dental Skills to Pre-Clinical Veterinary Students.用于向临床前兽医学生教授犬牙技能的高保真模拟模型和视频教学的评估
Vet Sci. 2023 Aug 16;10(8):526. doi: 10.3390/vetsci10080526.
9
Enhancing authenticity, diagnosticity and quivalence (AD-Equiv) in multicentre OSCE exams in health professionals education: protocol for a complex intervention study.提高医学专业多中心 OSCE 考试的真实性、诊断性和等效性(AD-Equiv):一项复杂干预研究方案。
BMJ Open. 2022 Dec 7;12(12):e064387. doi: 10.1136/bmjopen-2022-064387.
10
Pass/fail decisions and standards: the impact of differential examiner stringency on OSCE outcomes.通过/失败决策和标准:不同主考人严格程度对客观结构化临床考试结果的影响。
Adv Health Sci Educ Theory Pract. 2022 May;27(2):457-473. doi: 10.1007/s10459-022-10096-9. Epub 2022 Mar 1.