• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用通用理论优化本科医学生高风险客观结构化临床考试的质量:来自法国医学院的经验

Applying generalized theory to optimize the quality of high-stakes objective structured clinical examinations for undergraduate medical students: experience from the French medical school.

作者信息

Feigerlova Eva

机构信息

Centre Universitaire d'Enseignement par Simulation - CUESim, Faculté de Médecine, Maïeutique et Métiers de la Santé, Vandoeuvre-lès-Nancy, 54505, France.

Université de Lorraine, Inserm, DCAC, Vandoeuvre-lès-Nancy, 54505, France.

出版信息

BMC Med Educ. 2025 May 2;25(1):643. doi: 10.1186/s12909-025-07255-y.

DOI:10.1186/s12909-025-07255-y
PMID:40317009
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12046744/
Abstract

BACKGROUND

The national OSCE examination has recently been adopted in France as a prerequisite for medical students to enter accredited graduate education programs. However, the reliability and generalizability of OSCE scores are not well explored taking into account the national examination blueprint.

METHOD

To obtain complementary information for monitoring and improving the quality of the OSCE we performed a pilot study applying generalizability (G-)theory on a sample of 6th-year undergraduate medical students (n = 73) who were assessed by 24 examiner pairs at three stations. Based on the national blueprint, three different scoring subunits (a dichotomous task-specific checklist evaluating clinical skills and behaviorally anchored scales evaluating generic skills and a global performance scale) were used to evaluate students and combined into a station score. A variance component analysis was performed using mixed modelling to identify the impact of different facets (station, student and student x station interactions) on the scoring subunits. The generalizability and dependability statistics were calculated.

RESULTS

There was no significant difference between mean scores attributable to different examiner pairs across the data. The examiner variance component was greater for the clinical skills score (14.4%) than for the generic skills (5.6%) and global performance scores (5.1%). The station variance component was largest for the clinical skills score, accounting for 22.9% of the total score variance, compared to 3% for the generic skills and 13.9% for global performance scores. The variance component related to student represented 12% of the total variance for clinicals skills, 17.4% for generic skills and 14.3% for global performance ratings. The combined generalizability coefficients across all the data were 0.59 for the clinical skills score, 0.93 for the generic skills score and 0.75 for global performance.

CONCLUSIONS

The combined estimates of relative reliability across all data are greater for generic skills scores and global performance ratings than for clinical skills scores. This is likely explained by the fact that content-specific tasks evaluated using checklists produce greater variability in scores than scales evaluating broader competencies. This work can be valuable to other teaching institutions, as monitoring the sources of errors is a principal quality control strategy to ensure valid interpretations of the students' scores.

摘要

背景

法国最近采用国家客观结构化临床考试(OSCE)作为医学生进入认可的研究生教育项目的前提条件。然而,考虑到国家考试蓝图,OSCE分数的可靠性和普遍性尚未得到充分探讨。

方法

为了获取用于监测和提高OSCE质量的补充信息,我们进行了一项试点研究,对73名本科六年级医学生样本应用概化(G-)理论,这些学生在三个站点由24对考官进行评估。根据国家蓝图,使用三个不同的评分子单元(一个评估临床技能的二分任务特定检查表、评估通用技能的行为锚定量表和一个整体表现量表)对学生进行评估,并合并为站点分数。使用混合模型进行方差成分分析,以确定不同方面(站点、学生和学生×站点交互)对评分子单元的影响。计算概化性和可靠性统计量。

结果

不同考官对的数据的平均分数之间没有显著差异。临床技能分数的考官方差成分(14.4%)大于通用技能(5.6%)和整体表现分数(5.1%)。临床技能分数的站点方差成分最大占总分方差的22.9%,相比之下通用技能为3%,整体表现分数为13.9%。与学生相关的方差成分在临床技能总分方差中占12%,通用技能中占17.4%,整体表现评分中占14.3%。所有数据中临床技能分数的综合概化系数为0.59,通用技能分数为0.93,整体表现为0.75。

结论

所有数据中通用技能分数和整体表现评分的相对可靠性综合估计值高于临床技能分数。这可能是因为使用检查表评估的特定内容任务比分评估更广泛能力的量表产生的分数变异性更大。这项工作对其他教学机构可能有价值,因为监测误差来源是确保对学生分数进行有效解释的主要质量控制策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecfd/12046744/8ee5663acf41/12909_2025_7255_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecfd/12046744/8ee5663acf41/12909_2025_7255_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecfd/12046744/8ee5663acf41/12909_2025_7255_Fig1_HTML.jpg

相似文献

1
Applying generalized theory to optimize the quality of high-stakes objective structured clinical examinations for undergraduate medical students: experience from the French medical school.应用通用理论优化本科医学生高风险客观结构化临床考试的质量:来自法国医学院的经验
BMC Med Educ. 2025 May 2;25(1):643. doi: 10.1186/s12909-025-07255-y.
2
The educational effects of portfolios on undergraduate student learning: a Best Evidence Medical Education (BEME) systematic review. BEME Guide No. 11.档案袋对本科学生学习的教育效果:最佳证据医学教育(BEME)系统评价。BEME指南第11号。
Med Teach. 2009 Apr;31(4):282-98. doi: 10.1080/01421590902889897.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Correlation between task-based checklists and global rating scores in undergraduate objective structured clinical examinations in Saudi Arabia: a 1-year comparative study.沙特阿拉伯本科客观结构化临床考试中基于任务的检查表与整体评分分数之间的相关性:一项为期1年的比较研究。
J Educ Eval Health Prof. 2025;22:19. doi: 10.3352/jeehp.2025.22.19. Epub 2025 Jun 19.
5
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
6
Validation of Checklists and Evaluation of Clinical Skills in Cases of Abdominal Pain With Simulation in Formative, Objective, Structured Clinical Examination With Audiovisual Content in Third-Year Medical Students' Surgical Clerkship.在第三年医学生外科实习中,使用形成性、客观、结构化临床考试中的模拟病例进行腹痛检查表验证和临床技能评估,同时具有视听内容。
J Surg Educ. 2024 Nov;81(11):1756-1763. doi: 10.1016/j.jsurg.2024.08.016. Epub 2024 Sep 20.
7
Competency evaluation using randomized testing: feasibility of a new structured assessment method.使用随机测试进行能力评估:一种新的结构化评估方法的可行性
Adv Physiol Educ. 2025 Sep 1;49(3):801-806. doi: 10.1152/advan.00111.2025. Epub 2025 Jul 3.
8
Whether case-based teaching combined with the flipped classroom is more valuable than traditional lecture-based teaching methods in clinical medical education: a systematic review and meta-analysis.在临床医学教育中,基于案例的教学与翻转课堂相结合是否比传统的基于讲座的教学方法更具价值:一项系统评价与荟萃分析。
BMC Med Educ. 2025 Jul 1;25(1):906. doi: 10.1186/s12909-025-07465-4.
9
The effectiveness of tools used to evaluate successful critical decision making skills for applicants to healthcare graduate educational programs: a systematic review.用于评估医疗保健研究生教育项目申请者成功关键决策技能的工具的有效性:一项系统综述。
JBI Database System Rev Implement Rep. 2015 May 15;13(4):231-75. doi: 10.11124/jbisrir-2015-2322.
10
A systematic review of the reliability of objective structured clinical examination scores.客观结构化临床考试成绩可靠性的系统评价。
Med Educ. 2011 Dec;45(12):1181-9. doi: 10.1111/j.1365-2923.2011.04075.x. Epub 2011 Oct 11.

本文引用的文献

1
Measuring and correcting staff variability in large-scale OSCEs.测量和纠正大规模客观结构化临床考试中的工作人员变异性。
BMC Med Educ. 2024 Jul 29;24(1):817. doi: 10.1186/s12909-024-05803-6.
2
Variance due to the examination conditions and factors associated with success in objective structured clinical examinations (OSCEs): first experiences at Paris-Saclay medical school.由于检查条件的差异以及与客观结构化临床考试(OSCE)成功相关的因素:巴黎萨克雷医学院的初步经验。
BMC Med Educ. 2024 Jul 2;24(1):716. doi: 10.1186/s12909-024-05688-5.
3
Pass/fail decisions and standards: the impact of differential examiner stringency on OSCE outcomes.
通过/失败决策和标准:不同主考人严格程度对客观结构化临床考试结果的影响。
Adv Health Sci Educ Theory Pract. 2022 May;27(2):457-473. doi: 10.1007/s10459-022-10096-9. Epub 2022 Mar 1.
4
Building reliable and generalizable clerkship competency assessments: Impact of 'hawk-dove' correction.构建可靠且可推广的临床实习能力评估:“鹰鸽”校正的影响
Med Teach. 2021 Dec;43(12):1374-1380. doi: 10.1080/0142159X.2021.1948519. Epub 2021 Sep 17.
5
Validation Evidence using Generalizability Theory for an Objective Structured Clinical Examination.使用概化理论对客观结构化临床考试进行效度验证的证据
Innov Pharm. 2021 Feb 26;12(1). doi: 10.24926/iip.v12i1.2110. eCollection 2021.
6
Use of Generalizability Theory for Exploring Reliability of and Sources of Variance in Assessment of Technical Skills: A Systematic Review and Meta-Analysis.运用概化理论探究技术技能评估中变异性的可靠性和来源:系统评价和荟萃分析。
Acad Med. 2021 Nov 1;96(11):1609-1619. doi: 10.1097/ACM.0000000000004150.
7
Inter-rater reliability in clinical assessments: do examiner pairings influence candidate ratings?临床评估中的评分者间信度: examiner pairings 是否会影响考生评分?
BMC Med Educ. 2020 May 11;20(1):147. doi: 10.1186/s12909-020-02009-4.
8
Examiner effect on the objective structured clinical exam - a study at five medical schools.考官对客观结构化临床考试的影响——一项在五所医学院校开展的研究
BMC Med Educ. 2017 Apr 24;17(1):71. doi: 10.1186/s12909-017-0908-1.
9
Making sense of Cronbach's alpha.理解克朗巴哈系数。
Int J Med Educ. 2011 Jun 27;2:53-55. doi: 10.5116/ijme.4dfb.8dfd.
10
Reliability analysis of the objective structured clinical examination using generalizability theory.基于概化理论的客观结构化临床考试信度分析
Med Educ Online. 2016 Aug 18;21:31650. doi: 10.3402/meo.v21.31650. eCollection 2016.