小样本量情况下Rasch模型与经典等值法的比较

Rasch Versus Classical Equating in the Context of Small Sample Sizes.

作者信息

Babcock Ben, Hodge Kari J

机构信息

The American Registry of Radiologic Technologists, Saint Paul, MN, USA.

NACE International, Houston, TX, USA.

出版信息

Educ Psychol Meas. 2020 Jun;80(3):499-521. doi: 10.1177/0013164419878483. Epub 2019 Sep 30.

DOI:10.1177/0013164419878483

PMID:32425217

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7221499/

Abstract

Equating and scaling in the context of small sample exams, such as credentialing exams for highly specialized professions, has received increased attention in recent research. Investigators have proposed a variety of both classical and Rasch-based approaches to the problem. This study attempts to extend past research by (1) directly comparing classical and Rasch techniques of equating exam scores when sample sizes are small (≤ 100 per exam form) and (2) attempting to pool multiple forms' worth of data to improve estimation in the Rasch framework. We simulated multiple years of a small-sample exam program by resampling from a larger certification exam program's real data. Results showed that combining multiple administrations' worth of data via the Rasch model can lead to more accurate equating compared to classical methods designed to work well in small samples. WINSTEPS-based Rasch methods that used multiple exam forms' data worked better than Bayesian Markov Chain Monte Carlo methods, as the prior distribution used to estimate the item difficulty parameters biased predicted scores when there were difficulty differences between exam forms.

摘要

在小规模考试的背景下，如高度专业化职业的资格认证考试，等值化和量表化在最近的研究中受到了越来越多的关注。研究人员针对这一问题提出了多种基于经典方法和拉施模型的途径。本研究试图通过以下方式扩展以往的研究：（1）在样本量较小（每个考试形式≤100）时，直接比较经典和拉施考试分数等值化技术；（2）尝试合并多个考试形式的数据，以改善拉施框架中的估计。我们通过从一个更大的认证考试项目的真实数据中重新抽样，模拟了多年的小规模考试项目。结果表明，与旨在在小样本中表现良好的经典方法相比，通过拉施模型合并多个考试的数据可以带来更准确的等值化。基于WINSTEPS的拉施方法使用多个考试形式的数据比贝叶斯马尔可夫链蒙特卡罗方法效果更好，因为当考试形式之间存在难度差异时，用于估计项目难度参数的先验分布会使预测分数产生偏差。

相似文献

Rasch Versus Classical Equating in the Context of Small Sample Sizes.

Educ Psychol Meas. 2020 Jun;80(3):499-521. doi: 10.1177/0013164419878483. Epub 2019 Sep 30.

The NEAT Equating Via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method.

Educ Psychol Meas. 2023 Oct;83(5):984-1006. doi: 10.1177/00131644221120899. Epub 2022 Sep 4.

Comparison of proficiency in an anesthesiology course across distinct medical student cohorts: psychometric approaches to test equating.

J Chin Med Assoc. 2014 Mar;77(3):150-4. doi: 10.1016/j.jcma.2013.10.011. Epub 2013 Nov 28.

Equating designs and procedures used in Rasch scaling.

J Appl Meas. 2010;11(2):182-95.

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach.

Educ Psychol Meas. 2016 Aug;76(4):662-684. doi: 10.1177/0013164415608418. Epub 2015 Oct 12.

Rasch fit statistics as a test of the invariance of item parameter estimates.

J Appl Meas. 2003;4(2):153-63.

Comparing concurrent versus fixed parameter equating with common items: using the dichotomous and partial credit models in a mixed-item format test.

J Appl Meas. 2007;8(1):84-96.

Exploring the use of Rasch modelling in "common content" items for multi-site and multi-year assessment.

Adv Health Sci Educ Theory Pract. 2025 Apr;30(2):427-438. doi: 10.1007/s10459-024-10354-y. Epub 2024 Jul 8.

Can IRT Solve the Missing Data Problem in Test Equating?

Front Psychol. 2016 Jan 5;6:1956. doi: 10.3389/fpsyg.2015.01956. eCollection 2015.

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model.

Educ Psychol Meas. 2023 Dec;83(6):1249-1290. doi: 10.1177/00131644221143051. Epub 2023 Jan 13.

引用本文的文献

Equating Oral Reading Fluency Scores: A Model-Based Approach.

Educ Psychol Meas. 2024 Feb;84(1):190-209. doi: 10.1177/00131644221148122. Epub 2023 Jan 5.

Consistency analysis and conversion model establishment of mini-mental state examination and montreal cognitive assessment in Chinese patients with Alzheimer's disease.

Front Psychol. 2022 Sep 23;13:990666. doi: 10.3389/fpsyg.2022.990666. eCollection 2022.

An Extension of Testlet-Based Equating to the Polytomous Testlet Response Theory Model.

Front Psychol. 2022 Jan 12;12:743362. doi: 10.3389/fpsyg.2021.743362. eCollection 2021.

本文引用的文献

A Gibbs Sampler for the (Extended) Marginal Rasch Model.

Psychometrika. 2015 Dec;80(4):859-79. doi: 10.1007/s11336-015-9479-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

小样本量情况下Rasch模型与经典等值法的比较

Rasch Versus Classical Equating in the Context of Small Sample Sizes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献