• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在基于高风险表现的评估中检测分数漂移。

Detecting score drift in a high-stakes performance-based assessment.

作者信息

McKinley Danette W, Boulet John R

机构信息

Research and Evaluation, Educational Commission for Foreign Medical Graduates, 3624 Market Street, 4th Floor, Philadelphia, PA 19104, USA.

出版信息

Adv Health Sci Educ Theory Pract. 2004;9(1):29-38. doi: 10.1023/B:AHSE.0000012214.40340.03.

DOI:10.1023/B:AHSE.0000012214.40340.03
PMID:14739759
Abstract

Although studies have been conducted to examine the effects of a variety of factors on the comparability of scores obtained from standardized patient examinations (SPE), little research has been conducted to specifically investigate the challenge of detecting drift in case difficulty estimates over time, particularly for large-scale, performance-based, assessments. The purpose of the current study was to investigate the use of a procedure to detect drift in the difficulty estimates for a large-scale, high stakes SPE. The results of this investigation suggest that, for particular performance tasks, there was some variation in mean scores over time. These findings indicate that, although it is feasible to create a bank of case-SP means and link scores back to these fixed estimates, special attention must be paid to the standardization of exam materials over time. This is essential to ensure comparability of scores and pass-fail decisions for candidates who are assessed on multiple test forms throughout the year.

摘要

尽管已经开展了多项研究来考察各种因素对标准化患者检查(SPE)所得分数可比性的影响,但针对随着时间推移检测病例难度估计值漂移这一挑战,尤其是在大规模、基于表现的评估中,开展的研究却很少。本研究的目的是探讨一种程序在大规模、高风险SPE难度估计中检测漂移的应用。该调查结果表明,对于特定的表现任务,平均分数随时间存在一些变化。这些发现表明,虽然创建一组病例-SP均值库并将分数与这些固定估计值关联起来是可行的,但必须特别关注考试材料随时间的标准化。这对于确保全年通过多种测试形式进行评估的考生分数的可比性以及及格/不及格判定至关重要。

相似文献

1
Detecting score drift in a high-stakes performance-based assessment.在基于高风险表现的评估中检测分数漂移。
Adv Health Sci Educ Theory Pract. 2004;9(1):29-38. doi: 10.1023/B:AHSE.0000012214.40340.03.
2
Setting defensible performance standards on OSCEs and standardized patient examinations.制定客观结构化临床考试(OSCE)和标准化患者检查的合理性能标准。
Med Teach. 2003 May;25(3):245-9. doi: 10.1080/0142159031000100274.
3
The effect of task exposure on repeat candidate scores in a high-stakes standardized patient assessment.在高风险标准化患者评估中任务暴露对重复候选者分数的影响。
Teach Learn Med. 2003 Fall;15(4):227-32. doi: 10.1207/S15328015TLM1504_02.
4
The impact of repeat information on examinee performance for a large-scale standardized-patient examination.重复信息对大规模标准化病人考试考生表现的影响。
Acad Med. 2010 Sep;85(9):1506-10. doi: 10.1097/ACM.0b013e3181eadb25.
5
A work-centered approach for setting passing scores on performance-based assessments.一种基于工作的方法来设定基于表现的评估的及格分数。
Eval Health Prof. 2005 Sep;28(3):349-69. doi: 10.1177/0163278705278282.
6
The use of standardized patient assessments for certification and licensure decisions.将标准化患者评估用于认证和执照颁发决策。
Simul Healthc. 2009 Spring;4(1):35-42. doi: 10.1097/SIH.0b013e318182fc6c.
7
Evaluating construct equivalence and criterion-related validity for repeat examinees on a standardized patient examination.评估标准化患者考试中重测考生的结构等效性和效标关联效度。
Acad Med. 2011 Oct;86(10):1253-9. doi: 10.1097/ACM.0b013e31822bc0a4.
8
Simulated and standardized patients in OSCEs: achievements and challenges 1992-2003.客观结构化临床考试中的模拟患者与标准化患者:1992 - 2003年的成就与挑战
Med Teach. 2003 May;25(3):262-70. doi: 10.1080/0142159031000100300.
9
A model for setting performance standards for standardized patient examinations.一种用于设定标准化患者检查绩效标准的模型。
Eval Health Prof. 2003 Dec;26(4):427-46. doi: 10.1177/0163278703258105.
10
The use of standardised patients to assess clinical competence: does practice make perfect?使用标准化病人评估临床能力:熟能生巧吗?
Med Educ. 2006 May;40(5):444-9. doi: 10.1111/j.1365-2929.2006.02446.x.

引用本文的文献

1
A many-facet Rasch measurement model approach to investigating objective structured clinical examination item parameter drift.一种用于研究客观结构化临床考试项目参数漂移的多维度Rasch测量模型方法。
J Eval Clin Pract. 2025 Feb;31(1):e14114. doi: 10.1111/jep.14114. Epub 2024 Jul 29.
2
Trends in Classroom Observation Scores.课堂观察分数趋势
Educ Psychol Meas. 2015 Apr;75(2):311-337. doi: 10.1177/0013164414539163. Epub 2014 Jun 22.
3
"On the same page"? The effect of GP examiner feedback on differences in rating severity in clinical assessments: a pre/post intervention study.
“在同一页上”?GP 考官反馈对临床评估中评分严重程度差异的影响:一项干预前后研究。
BMC Med Educ. 2017 Jun 6;17(1):101. doi: 10.1186/s12909-017-0929-9.
4
Accuracy of portrayal by standardized patients: results from four OSCE stations conducted for high stakes examinations.标准化患者的描绘准确性:在四项高风险考试中进行的 OSCE 站的结果。
BMC Med Educ. 2014 May 19;14:97. doi: 10.1186/1472-6920-14-97.
5
Temporal stability of objective structured clinical exams: a longitudinal study employing item response theory.客观结构化临床考试的时间稳定性:采用项目反应理论的纵向研究。
BMC Med Educ. 2012 Dec 7;12:121. doi: 10.1186/1472-6920-12-121.