• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用概化理论对客观结构化临床考试进行效度验证的证据

Validation Evidence using Generalizability Theory for an Objective Structured Clinical Examination.

作者信息

Peeters Michael J, Cor M Kenneth, Petite Sarah E, Schroeder Michelle N

机构信息

University of Toledo College of Pharmacy & Pharmaceutical Sciences, Toledo, OH.

University of Alberta Faculty of Pharmacy & Pharmaceutical Sciences, Edmonton, AB.

出版信息

Innov Pharm. 2021 Feb 26;12(1). doi: 10.24926/iip.v12i1.2110. eCollection 2021.

DOI:10.24926/iip.v12i1.2110
PMID:34007675
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8102968/
Abstract

OBJECTIVES

Performance-based assessments, including objective structured clinical examinations (OSCEs), are essential learning assessments within pharmacy education. Because important educational decisions can follow from performance-based assessment results, pharmacy colleges/schools should demonstrate acceptable rigor in validation of their learning assessments. Though G-Theory has rarely been reported in pharmacy education, it would behoove pharmacy educators to, using G-Theory, produce evidence demonstrating reliability as a part of their OSCE validation process. This investigation demonstrates the use of G-Theory to describes reliability for an OSCE, as well as to show methods for enhancement of the OSCE's reliability.

INNOVATION

To evaluate practice-readiness in the semester before final-year rotations, third-year PharmD students took an OSCE. This OSCE included 14 stations over three weeks. Each week had four or five stations; one or two stations were scored by faculty-raters while three stations required students' written responses. All stations were scored 1-4. For G-Theory analyses, we used G_Strings and then mGENOVA.

CRITICAL ANALYSIS

Ninety-seven students completed the OSCE; stations were scored independently. First, univariate G-Theory design of students crossed with stations nested in weeks (p × s:w) was used. The total-score g-coefficient (reliability) for this OSCE was 0.72. Variance components for test parameters were identified. Of note, students accounted for only some OSCE score variation. Second, a multivariate G-Theory design of students crossed with stations (p × s°) was used. This further analysis revealed which week(s) were weakest for the reliability of test-scores from this learning assessment. Moreover, decision-studies showed how reliability could change depending on the number of stations each week. For a g-coefficient >0.80, seven stations per week were needed. Additionally, targets for improvements were identified.

IMPLICATIONS

In test validation, evidence of reliability is vital for the inference of generalization; G-Theory provided this for our OSCE. Results indicated that the reliability of scores was mediocre and could be improved with more stations. Revision of problematic stations could help reliability as well. Within this need for more stations, one practical insight was to administer those stations over multiple weeks/occasions (instead of all stations in one occasion).

摘要

目标

基于表现的评估,包括客观结构化临床考试(OSCE),是药学教育中至关重要的学习评估方式。由于基于表现的评估结果可能会引发重要的教育决策,药学院校应在其学习评估的验证过程中展现出可接受的严谨性。尽管G理论在药学教育中的报道很少,但药学教育工作者使用G理论来提供证据证明可靠性,作为其OSCE验证过程的一部分是很有必要的。本研究展示了如何使用G理论来描述OSCE的可靠性,以及提高OSCE可靠性的方法。

创新点

为了评估在最后一年轮转前一个学期的实践准备情况,三年级药学博士学生参加了一次OSCE。这次OSCE在三周内设有14个考站。每周有四或五个考站;一两个考站由教师评分员评分,而三个考站要求学生进行书面回答。所有考站的评分都是1 - 4分。对于G理论分析,我们使用了G_Strings,然后是mGENOVA。

批判性分析

97名学生完成了OSCE;考站由不同的人独立评分。首先,采用了学生与嵌套在周内的考站的单变量G理论设计(p×s:w)。这次OSCE的总分g系数(可靠性)为0.72。确定了测试参数的方差成分。值得注意的是,学生只占OSCE分数变化的一部分。其次,采用了学生与考站的多变量G理论设计(p×s°)。进一步的分析揭示了从这次学习评估中测试分数可靠性最差的是哪一周。此外,决策研究表明可靠性如何根据每周考站的数量而变化。为了使g系数>0.80,每周需要七个考站。此外,还确定了改进的目标。

启示

在测试验证中,可靠性证据对于推广性推断至关重要;G理论为我们的OSCE提供了这一证据。结果表明分数的可靠性一般,可以通过增加考站数量来提高。对有问题的考站进行修订也有助于提高可靠性。在需要更多考站的情况下,一个实际的见解是在多个周/场次进行这些考站的测试(而不是在一次考试中设置所有考站)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec31/8102968/7d395ecb5f82/21550417-12-01-2110-g1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec31/8102968/7d395ecb5f82/21550417-12-01-2110-g1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec31/8102968/7d395ecb5f82/21550417-12-01-2110-g1.jpg

相似文献

1
Validation Evidence using Generalizability Theory for an Objective Structured Clinical Examination.使用概化理论对客观结构化临床考试进行效度验证的证据
Innov Pharm. 2021 Feb 26;12(1). doi: 10.24926/iip.v12i1.2110. eCollection 2021.
2
On the reliability of a dental OSCE, using SEM: effect of different days.使用结构方程模型评估口腔医学客观结构化临床考试的可靠性:不同日期的影响
Eur J Dent Educ. 2008 Aug;12(3):131-7. doi: 10.1111/j.1600-0579.2008.00507.x.
3
Effect of clinically discriminating, evidence-based checklist items on the reliability of scores from an Internal Medicine residency OSCE.临床鉴别性、基于证据的检查表项目对内科住院医师客观结构化临床考试分数可靠性的影响。
Adv Health Sci Educ Theory Pract. 2014 Oct;19(4):497-506. doi: 10.1007/s10459-013-9482-4. Epub 2014 Jan 22.
4
Item analysis to improve reliability for an internal medicine undergraduate OSCE.项目分析以提高内科本科客观结构化临床考试的可靠性。
Adv Health Sci Educ Theory Pract. 2005;10(2):105-13. doi: 10.1007/s10459-005-2315-3.
5
Evaluating the Role of Objective Structured Clinical Examination as a Summative Assessment Tool in Undergraduate and Postgraduate Psychiatry Residents.评估客观结构化临床考试作为本科及研究生精神科住院医师总结性评估工具的作用。
Cureus. 2024 Aug 23;16(8):e67640. doi: 10.7759/cureus.67640. eCollection 2024 Aug.
6
Development and evaluation of a spiral model of assessing EBM competency using OSCEs in undergraduate medical education.运用 OSCE 评估本科医学教育中循证医学能力的螺旋式模型的开发与评估。
BMC Med Educ. 2021 Apr 10;21(1):204. doi: 10.1186/s12909-021-02650-7.
7
Reliability analysis of the objective structured clinical examination using generalizability theory.基于概化理论的客观结构化临床考试信度分析
Med Educ Online. 2016 Aug 18;21:31650. doi: 10.3402/meo.v21.31650. eCollection 2016.
8
Assessment of first-year veterinary students' clinical skills using objective structured clinical examinations.使用客观结构化临床考试评估一年级兽医学生的临床技能。
J Vet Med Educ. 2010 Winter;37(4):395-402. doi: 10.3138/jvme.37.4.395.
9
Malaysian pharmacy students' assessment of an objective structured clinical examination (OSCE).马来西亚药学专业学生对客观结构化临床考试(OSCE)的评估。
Am J Pharm Educ. 2010 Mar 10;74(2):34. doi: 10.5688/aj740234.
10
Objective structured clinical examinations in doctor of pharmacy programs in the United States.美国药学博士项目中的客观结构化临床考试。
Am J Pharm Educ. 2010 Oct 11;74(8):148. doi: 10.5688/aj7408148.

引用本文的文献

1
Applying generalized theory to optimize the quality of high-stakes objective structured clinical examinations for undergraduate medical students: experience from the French medical school.应用通用理论优化本科医学生高风险客观结构化临床考试的质量:来自法国医学院的经验
BMC Med Educ. 2025 May 2;25(1):643. doi: 10.1186/s12909-025-07255-y.
2
Development of peer assessment rubrics in simulation-based learning for advanced cardiac life support skills among medical students.医学生高级心脏生命支持技能模拟学习中同伴评估量表的开发。
Adv Simul (Lond). 2024 Jun 24;9(1):25. doi: 10.1186/s41077-024-00301-7.
3
Moving beyond Cronbach's Alpha and Inter-Rater Reliability: A Primer on Generalizability Theory for Pharmacy Education.

本文引用的文献

1
Moving beyond Cronbach's Alpha and Inter-Rater Reliability: A Primer on Generalizability Theory for Pharmacy Education.超越克朗巴哈系数和评分者间信度:药学教育概化理论入门
Innov Pharm. 2021 Feb 26;12(1). doi: 10.24926/iip.v12i1.2131. eCollection 2021.
2
Guidance for high-stakes testing within pharmacy educational assessment.药学教育评估中高风险测试指南。
Curr Pharm Teach Learn. 2020 Jan;12(1):1-4. doi: 10.1016/j.cptl.2019.10.001. Epub 2019 Nov 22.
3
Validation of learning assessments: A primer.学习评估的验证:入门指南。
超越克朗巴哈系数和评分者间信度:药学教育概化理论入门
Innov Pharm. 2021 Feb 26;12(1). doi: 10.24926/iip.v12i1.2131. eCollection 2021.
Curr Pharm Teach Learn. 2017 Sep;9(5):925-933. doi: 10.1016/j.cptl.2017.06.001. Epub 2017 Jul 29.
4
The assessment of professional competence: Developments, research and practical implications.专业能力评估:发展、研究与实际影响。
Adv Health Sci Educ Theory Pract. 1996 Jan;1(1):41-67. doi: 10.1007/BF00596229.
5
A systematic review of the reliability of objective structured clinical examination scores.客观结构化临床考试成绩可靠性的系统评价。
Med Educ. 2011 Dec;45(12):1181-9. doi: 10.1111/j.1365-2923.2011.04075.x. Epub 2011 Oct 11.
6
Techniques for measuring clinical competence: objective structured clinical examinations.临床能力测量技术:客观结构化临床考试
Med Educ. 2004 Feb;38(2):199-203. doi: 10.1111/j.1365-2923.2004.01755.x.