• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

客观结构化临床考试的时间稳定性:采用项目反应理论的纵向研究。

Temporal stability of objective structured clinical exams: a longitudinal study employing item response theory.

机构信息

Medical Education, College of Medicine, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia.

出版信息

BMC Med Educ. 2012 Dec 7;12:121. doi: 10.1186/1472-6920-12-121.

DOI:10.1186/1472-6920-12-121
PMID:23216816
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3552978/
Abstract

BACKGROUND

The objective structure clinical examination (OSCE) has been used since the early 1970s for assessing clinical competence. There are very few studies that have examined the psychometric stability of the stations that are used repeatedly with different samples. The purpose of the present study was to assess the stability of objective structured clinical exams (OSCEs) employing the same stations used over time but with a different sample of candidates, SPs, and examiners.

METHODS

At Time 1, 191 candidates and at Time 2 (one year apart), 236 candidates participated in a 10-station OSCE; 6 of the same stations were used in both years. Generalizability analyses (Ep2) were conducted. Employing item response analyses, test characteristic curves (TCC) were derived for each of the 6 stations for a 2-parameter model. The TCCs were compared across the two years, Time 1 and 2.

RESULTS

The Ep2 of the OSCEs exceeded.70. Standardized thetas (θ) and discriminations were equivalent for the same station across the two year period indicating equivalent TCCs for a 2-parameter model.

CONCLUSION

The 6 OSCE stations used by the AIMG program over two years have adequate internal consistency reliability, stable generalizability (Ep2) and equivalent test characteristics. The process of assessment employed for IMG's are stable OSCE stations that may be used several times over without compromising psychometric properties.With careful security, high-stakes OSCEs may use the same stations that have high internal consistency and generalizability repeatedly as the psychometric properties are stable over several years with different samples of candidates.

摘要

背景

客观结构临床考试(OSCE)自 20 世纪 70 年代初以来一直被用于评估临床能力。很少有研究检验过在不同样本中重复使用的站点的心理测量稳定性。本研究的目的是评估使用相同站点但具有不同候选人、SP 和考官样本的客观结构化临床考试(OSCE)的稳定性。

方法

在时间 1,有 191 名候选人,在时间 2(相隔一年),有 236 名候选人参加了 10 站 OSCE;其中 6 个相同的站点在两年内都有使用。进行了可概括性分析(Ep2)。利用项目反应分析,为每个 6 个站点的 2 个参数模型得出了测试特征曲线(TCC)。在两年,即时间 1 和 2 之间,对 TCC 进行了比较。

结果

OSCE 的 Ep2 超过 0.70。在两年期间,同一站点的标准化θ和区分度是等效的,表明 2 个参数模型的 TCC 等效。

结论

AIMG 项目在两年内使用的 6 个 OSCE 站点具有足够的内部一致性可靠性、稳定的可概括性(Ep2)和等效的测试特征。为 IMG 采用的评估过程是稳定的 OSCE 站点,可以多次使用而不会影响心理测量特性。在精心的安全措施下,高风险的 OSCE 可以重复使用具有高内部一致性和可概括性的相同站点,因为其心理测量特性在几年内对不同的候选人群体都是稳定的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0121/3552978/8eaf5aa194e2/1472-6920-12-121-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0121/3552978/8eaf5aa194e2/1472-6920-12-121-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0121/3552978/8eaf5aa194e2/1472-6920-12-121-1.jpg

相似文献

1
Temporal stability of objective structured clinical exams: a longitudinal study employing item response theory.客观结构化临床考试的时间稳定性:采用项目反应理论的纵向研究。
BMC Med Educ. 2012 Dec 7;12:121. doi: 10.1186/1472-6920-12-121.
2
Accuracy of portrayal by standardized patients: results from four OSCE stations conducted for high stakes examinations.标准化患者的描绘准确性:在四项高风险考试中进行的 OSCE 站的结果。
BMC Med Educ. 2014 May 19;14:97. doi: 10.1186/1472-6920-14-97.
3
Development and evaluation of a spiral model of assessing EBM competency using OSCEs in undergraduate medical education.运用 OSCE 评估本科医学教育中循证医学能力的螺旋式模型的开发与评估。
BMC Med Educ. 2021 Apr 10;21(1):204. doi: 10.1186/s12909-021-02650-7.
4
Item analysis to improve reliability for an internal medicine undergraduate OSCE.项目分析以提高内科本科客观结构化临床考试的可靠性。
Adv Health Sci Educ Theory Pract. 2005;10(2):105-13. doi: 10.1007/s10459-005-2315-3.
5
Assessment of first-year veterinary students' clinical skills using objective structured clinical examinations.使用客观结构化临床考试评估一年级兽医学生的临床技能。
J Vet Med Educ. 2010 Winter;37(4):395-402. doi: 10.3138/jvme.37.4.395.
6
The consistency and uncertainty in examiners' definitions of pass/fail performance on OSCE (objective structured clinical examination) stations.考官对客观结构化临床考试(OSCE)站点及格/不及格表现定义的一致性与不确定性。
Eval Health Prof. 1996 Mar;19(1):118-24. doi: 10.1177/016327879601900109.
7
A systematic review of the reliability of objective structured clinical examination scores.客观结构化临床考试成绩可靠性的系统评价。
Med Educ. 2011 Dec;45(12):1181-9. doi: 10.1111/j.1365-2923.2011.04075.x. Epub 2011 Oct 11.
8
Feasibility and reliability of the pandemic-adapted online-onsite hybrid graduation OSCE in Japan.日本大流行适应型线上-线下混合毕业客观结构化临床考试的可行性和可靠性。
Adv Health Sci Educ Theory Pract. 2024 Jul;29(3):949-965. doi: 10.1007/s10459-023-10290-3. Epub 2023 Oct 18.
9
CARECOS study: Medical students' empathy as assessed with the CARE measure by examiners versus standardized patients during a formative Objective and Structured Clinical Examination (OSCE) station.CARECOS 研究:在形成性客观结构化临床考试(OSCE)站中,通过考官和标准化患者使用 CARE 措施评估医学生的同理心。
Med Teach. 2024 Sep;46(9):1187-1195. doi: 10.1080/0142159X.2024.2306840. Epub 2024 Jan 29.
10
Standardized examinees: development of a new tool to evaluate factors influencing OSCE scores and to train examiners.标准化考生:开发一种新工具,以评估影响客观结构化临床考试分数的因素并培训考官。
GMS J Med Educ. 2020 Jun 15;37(4):Doc40. doi: 10.3205/zma001333. eCollection 2020.

引用本文的文献

1
Academic performance and anxiety in the evaluation of health science undergraduate students using osce: a pre- and post-covid-19 cohort study.使用客观结构化临床考试评估健康科学专业本科生的学业成绩与焦虑:一项新冠疫情前后的队列研究
BMC Med Educ. 2025 Aug 18;25(1):1171. doi: 10.1186/s12909-025-07780-w.
2
Applying generalized theory to optimize the quality of high-stakes objective structured clinical examinations for undergraduate medical students: experience from the French medical school.应用通用理论优化本科医学生高风险客观结构化临床考试的质量:来自法国医学院的经验
BMC Med Educ. 2025 May 2;25(1):643. doi: 10.1186/s12909-025-07255-y.
3

本文引用的文献

1
Assessment of clinical skills with standardized patients: state of the art revisited.使用标准化病人评估临床技能:重新审视当前的技术水平
Teach Learn Med. 2013;25 Suppl 1:S17-25. doi: 10.1080/10401334.2013.842916.
2
Assessing clinical communication skills in physicians: are the skills context specific or generalizable.评估医生的临床沟通技巧:这些技巧是因具体情境而异还是具有通用性?
BMC Med Educ. 2009 May 15;9:22. doi: 10.1186/1472-6920-9-22.
3
Quality control of an OSCE using generalizability theory and many-faceted Rasch measurement.
The equivalence of a high-stakes objective structured clinical exam adapted to suit a virtual delivery format.
适应虚拟交付形式的高风险客观结构化临床考试的等效性。
J Eval Clin Pract. 2025 Feb;31(1):e14167. doi: 10.1111/jep.14167. Epub 2024 Oct 24.
4
Item response theory model highlighting rating scale of a rubric and rater-rubric interaction in objective structured clinical examination.项目反应理论模型突出了客观结构化临床考试中等级量表的评分和评分者-等级量表的交互作用。
PLoS One. 2024 Sep 6;19(9):e0309887. doi: 10.1371/journal.pone.0309887. eCollection 2024.
5
A many-facet Rasch measurement model approach to investigating objective structured clinical examination item parameter drift.一种用于研究客观结构化临床考试项目参数漂移的多维度Rasch测量模型方法。
J Eval Clin Pract. 2025 Feb;31(1):e14114. doi: 10.1111/jep.14114. Epub 2024 Jul 29.
6
Feasibility and reliability of the pandemic-adapted online-onsite hybrid graduation OSCE in Japan.日本大流行适应型线上-线下混合毕业客观结构化临床考试的可行性和可靠性。
Adv Health Sci Educ Theory Pract. 2024 Jul;29(3):949-965. doi: 10.1007/s10459-023-10290-3. Epub 2023 Oct 18.
7
A Short Note on Optimizing Cost-Generalizability via a Machine-Learning Approach.关于通过机器学习方法优化成本通用性的简短说明。
Educ Psychol Meas. 2021 Dec;81(6):1221-1233. doi: 10.1177/0013164421992112. Epub 2021 Feb 8.
8
Use of Eye-Tracking Technology by Medical Students Taking the Objective Structured Clinical Examination: Descriptive Study.医学生在使用客观结构化临床考试时使用眼动追踪技术:描述性研究。
J Med Internet Res. 2020 Aug 21;22(8):e17719. doi: 10.2196/17719.
9
Reliability analysis of the objective structured clinical examination using generalizability theory.基于概化理论的客观结构化临床考试信度分析
Med Educ Online. 2016 Aug 18;21:31650. doi: 10.3402/meo.v21.31650. eCollection 2016.
10
Clinical assessment of transthoracic echocardiography skills: a generalizability study.经胸超声心动图技能的临床评估:一项可推广性研究。
BMC Med Educ. 2015 Feb 1;15:9. doi: 10.1186/s12909-015-0294-5.
使用概化理论和多面Rasch测量法对客观结构化临床考试进行质量控制。
Adv Health Sci Educ Theory Pract. 2008 Nov;13(4):479-93. doi: 10.1007/s10459-007-9060-8. Epub 2007 Feb 20.
4
The presence and impact of local item dependence on objective structured clinical examinations scores and the potential use of the polytomous, many-facet Rasch model.局部项目依赖对客观结构化临床考试分数的影响及其存在情况,以及多值、多维度Rasch模型的潜在应用。
J Manipulative Physiol Ther. 2006 Oct;29(8):651-7. doi: 10.1016/j.jmpt.2006.08.002.
5
Using item response theory to explore the psychometric properties of extended matching questions examination in undergraduate medical education.运用项目反应理论探索本科医学教育中扩展匹配题考试的心理测量特性。
BMC Med Educ. 2005 Mar 7;5(1):9. doi: 10.1186/1472-6920-5-9.
6
Reliability: on the reproducibility of assessment data.可靠性:关于评估数据的可重复性。
Med Educ. 2004 Sep;38(9):1006-12. doi: 10.1111/j.1365-2929.2004.01932.x.
7
Techniques for measuring clinical competence: objective structured clinical examinations.临床能力测量技术:客观结构化临床考试
Med Educ. 2004 Feb;38(2):199-203. doi: 10.1111/j.1365-2923.2004.01755.x.
8
Detecting score drift in a high-stakes performance-based assessment.在基于高风险表现的评估中检测分数漂移。
Adv Health Sci Educ Theory Pract. 2004;9(1):29-38. doi: 10.1023/B:AHSE.0000012214.40340.03.
9
Simulated and standardized patients in OSCEs: achievements and challenges 1992-2003.客观结构化临床考试中的模拟患者与标准化患者:1992 - 2003年的成就与挑战
Med Teach. 2003 May;25(3):262-70. doi: 10.1080/0142159031000100300.
10
Quality assurance methods for performance-based assessments.基于表现的评估的质量保证方法。
Adv Health Sci Educ Theory Pract. 2003;8(1):27-47. doi: 10.1023/a:1022639521218.