• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探索拉施模型在多站点和多年评估的“通用内容”项目中的应用。

Exploring the use of Rasch modelling in "common content" items for multi-site and multi-year assessment.

作者信息

Hope David, Kluth David, Homer Matthew, Dewar Avril, Goddard-Fuller Rikki, Jaap Alan, Cameron Helen

机构信息

Medical Education Unit, The Chancellor's Building, College of Medicine and Veterinary Medicine, The University of Edinburgh, 49 Little France Crescent, Edinburgh, Scotland, EH16 4SB, UK.

Leeds Institute of Medical Education, Leeds School of Medicine, Worsley Building, University of Leeds, Woodhouse, Leeds, LS2 9JT, UK.

出版信息

Adv Health Sci Educ Theory Pract. 2025 Apr;30(2):427-438. doi: 10.1007/s10459-024-10354-y. Epub 2024 Jul 8.

DOI:10.1007/s10459-024-10354-y
PMID:38977526
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11965148/
Abstract

Rasch modelling is a powerful tool for evaluating item performance, measuring drift in difficulty over time, and comparing students who sat assessments at different times or at different sites. Here, we use data from thirty UK medical schools to describe the benefits of Rasch modelling in quality assurance and the barriers to using it. Sixty "common content" multiple choice items were offered to all UK medical schools in 2016-17, and a further sixty in 2017-18, with five available in both years. Thirty medical schools participated, for sixty total datasets across two sessions, and 14,342 individual sittings. Schools selected items to embed in written assessment near the end of their programmes. We applied Rasch modelling to evaluate unidimensionality, model fit statistics and item quality, horizontal equating to compare performance across schools, and vertical equating to compare item performance across time. Of the sixty sittings, three provided non-unidimensional data, and eight violated goodness of fit measures. Item-level statistics identified potential improvements in item construction and provided quality assurance. Horizontal equating demonstrated large differences in scores across schools, while vertical equating showed item characteristics were stable across sessions. Rasch modelling provides significant advantages in model- and item- level reporting compared to classical approaches. However, the complexity of the analysis and the smaller number of educators familiar with Rasch must be addressed locally for a programme to benefit. Furthermore, due to the comparative novelty of Rasch modelling, there is greater ambiguity on how to proceed when a Rasch model identifies misfitting or problematic data.

摘要

拉施模型是评估试题表现、衡量难度随时间变化的漂移情况以及比较在不同时间或不同地点参加评估的学生的有力工具。在此,我们使用来自英国30所医学院的数据来描述拉施模型在质量保证方面的益处以及使用该模型的障碍。2016 - 2017年向所有英国医学院提供了60道“通用内容”多项选择题,2017 - 2018年又提供了另外60道,其中有5道在两年中都有。30所医学院参与其中,在两个阶段共产生60个数据集,以及14342次个人考试。学校在其课程接近尾声时选择试题嵌入书面评估中。我们应用拉施模型来评估单维性、模型拟合统计量和试题质量,通过水平等值来比较各学校间的表现,通过垂直等值来比较不同时间的试题表现。在这60次考试中,有3次提供了非单维数据,8次违反了拟合优度指标。试题层面的统计数据指出了试题编制方面潜在的改进之处,并提供了质量保证。水平等值显示各学校间的分数存在很大差异,而垂直等值表明各阶段的试题特征是稳定的。与传统方法相比时,拉施模型在模型和试题层面的报告方面具有显著优势。然而,分析的复杂性以及熟悉拉施模型的教育工作者数量较少,必须在当地加以解决,以便一个项目能够从中受益。此外,由于拉施模型相对新颖,当拉施模型识别出不拟合或有问题的数据时,对于如何继续进行存在更大的模糊性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a368/11965148/e45743bef978/10459_2024_10354_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a368/11965148/d7b251cde395/10459_2024_10354_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a368/11965148/74329515e244/10459_2024_10354_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a368/11965148/0cc77e9bfeea/10459_2024_10354_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a368/11965148/e45743bef978/10459_2024_10354_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a368/11965148/d7b251cde395/10459_2024_10354_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a368/11965148/74329515e244/10459_2024_10354_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a368/11965148/0cc77e9bfeea/10459_2024_10354_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a368/11965148/e45743bef978/10459_2024_10354_Fig4_HTML.jpg

相似文献

1
Exploring the use of Rasch modelling in "common content" items for multi-site and multi-year assessment.探索拉施模型在多站点和多年评估的“通用内容”项目中的应用。
Adv Health Sci Educ Theory Pract. 2025 Apr;30(2):427-438. doi: 10.1007/s10459-024-10354-y. Epub 2024 Jul 8.
2
Comparison of proficiency in an anesthesiology course across distinct medical student cohorts: psychometric approaches to test equating.不同医学学生群体在麻醉学课程中的熟练程度比较:用于测试等值性的心理测量方法。
J Chin Med Assoc. 2014 Mar;77(3):150-4. doi: 10.1016/j.jcma.2013.10.011. Epub 2013 Nov 28.
3
Establishing the HLS-Q12 short version of the European Health Literacy Survey Questionnaire: latent trait analyses applying Rasch modelling and confirmatory factor analysis.建立欧洲健康素养调查问卷的HLS-Q12简版:应用拉施模型和验证性因素分析的潜在特质分析
BMC Health Serv Res. 2018 Jun 28;18(1):506. doi: 10.1186/s12913-018-3275-7.
4
Application of Rasch Analysis to the Evaluation of the Measurement Properties of the Hearing Handicap Inventory for the Elderly.应用 Rasch 分析评价老年听力障碍量表的测量性能。
Ear Hear. 2020 Sep/Oct;41(5):1125-1134. doi: 10.1097/AUD.0000000000000832.
5
Using the dichotomous Rasch model to analyze polytomous items.使用二分法Rasch模型分析多分类项目。
J Appl Meas. 2013;14(1):44-56.
6
Variation in performance on common content items at UK medical schools.英国医学院校常见内容项目表现的差异。
BMC Med Educ. 2021 Jun 5;21(1):323. doi: 10.1186/s12909-021-02761-1.
7
The Foot Posture Index: Rasch analysis of a novel, foot-specific outcome measure.足部姿势指数:一种新型足部特异性结局指标的拉施分析
Arch Phys Med Rehabil. 2007 Jan;88(1):88-93. doi: 10.1016/j.apmr.2006.10.005.
8
Rasch fit statistics as a test of the invariance of item parameter estimates.拉施拟合统计作为项目参数估计不变性的一种检验。
J Appl Meas. 2003;4(2):153-63.
9
Measuring disability in Medicare home care patients: application of Rasch modeling to the outcome and assessment information set.衡量医疗保险家庭护理患者的残疾情况:拉施模型在结局与评估信息集中的应用
Med Care. 2003 May;41(5):601-15. doi: 10.1097/01.MLR.0000062553.63745.7A.
10
Using item response theory to explore the psychometric properties of extended matching questions examination in undergraduate medical education.运用项目反应理论探索本科医学教育中扩展匹配题考试的心理测量特性。
BMC Med Educ. 2005 Mar 7;5(1):9. doi: 10.1186/1472-6920-5-9.

本文引用的文献

1
Variation in performance on common content items at UK medical schools.英国医学院校常见内容项目表现的差异。
BMC Med Educ. 2021 Jun 5;21(1):323. doi: 10.1186/s12909-021-02761-1.
2
Comparison of the validity of bookmark and Angoff standard setting methods in medical performance tests.书签法和 Angoff 标准设定法在医学绩效测试中有效性的比较。
BMC Med Educ. 2021 Jan 2;21(1):1. doi: 10.1186/s12909-020-02436-3.
3
Re-conceptualising and accounting for examiner (cut-score) stringency in a 'high frequency, small cohort' performance test.
在“高频、小群体”表现测试中重新概念化并解释主考人(cut-score)严格程度。
Adv Health Sci Educ Theory Pract. 2021 May;26(2):369-383. doi: 10.1007/s10459-020-09990-x. Epub 2020 Sep 2.
4
Exploring differences in individual and group judgements in standard setting.探讨标准制定中个体和群体判断的差异。
Med Educ. 2019 Sep;53(9):941-952. doi: 10.1111/medu.13915. Epub 2019 Jul 2.
5
Using differential item functioning to evaluate potential bias in a high stakes postgraduate knowledge based assessment.使用差异项目功能评估高风险研究生基于知识的评估中的潜在偏差。
BMC Med Educ. 2018 Apr 3;18(1):64. doi: 10.1186/s12909-018-1143-0.
6
Exploring the Relationships Between USMLE Performance and Disciplinary Action in Practice: A Validity Study of Score Inferences From a Licensure Examination.探索美国医师执照考试成绩与实际执业中纪律处分之间的关系:一项关于执照考试分数推断的效度研究。
Acad Med. 2017 Dec;92(12):1780-1785. doi: 10.1097/ACM.0000000000001747.
7
Variation in passing standards for graduation-level knowledge items at UK medical schools.英国医学院毕业水平知识项目通过标准的差异。
Med Educ. 2017 Jun;51(6):612-620. doi: 10.1111/medu.13240. Epub 2017 Mar 13.
8
Setting standards in knowledge assessments: Comparing Ebel and Cohen via Rasch.
Med Teach. 2016 Dec;38(12):1267-1277. doi: 10.1080/0142159X.2016.1230184. Epub 2016 Sep 20.
9
UKMLA: American dream or nightmare?英国医学执照评估:美国梦还是噩梦?
Med Teach. 2016;38(3):320. doi: 10.3109/0142159X.2015.1105948. Epub 2015 Nov 11.
10
Implementing statistical equating for MRCP(UK) Parts 1 and 2.为英国皇家内科医学院会员考试第1部分和第2部分实施统计等值法。
BMC Med Educ. 2014 Sep 26;14:204. doi: 10.1186/1472-6920-14-204.