单独IRT校准的多重等值

Multiple Equating of Separate IRT Calibrations.

作者信息

Battauz Michela

机构信息

Department of Economics and Statistics, University of Udine, Udine, Italy.

出版信息

Psychometrika. 2016 Oct 3. doi: 10.1007/s11336-016-9517-x.

DOI:10.1007/s11336-016-9517-x

PMID:27699559

Abstract

When test forms are calibrated separately, item response theory parameters are not comparable because they are expressed on different measurement scales. The equating process includes the conversion of item parameter estimates on a common scale and the determination of comparable test scores. Various statistical methods have been proposed to perform equating between two test forms. This paper provides a generalization to multiple test forms of the mean-geometric mean, the mean-mean, the Haebara, and the Stocking-Lord methods. The proposed methods estimate simultaneously the equating coefficients that permit the scale transformation of the parameters of all forms to the scale of the base form. Asymptotic standard errors of the equating coefficients are derived. A simulation study is presented to illustrate the performance of the methods.

摘要

当分别校准测试形式时，项目反应理论参数不可比，因为它们是在不同的测量尺度上表示的。等值化过程包括将项目参数估计值转换到一个共同的尺度上，并确定可比的测试分数。已经提出了各种统计方法来在两种测试形式之间进行等值化。本文将均值 - 几何均值法、均值 - 均值法、Haebara法和Stocking - Lord法推广到多种测试形式。所提出的方法同时估计等值化系数，这些系数允许将所有形式的参数尺度转换为基础形式的尺度。推导了等值化系数的渐近标准误差。进行了一项模拟研究以说明这些方法的性能。

相似文献

Multiple Equating of Separate IRT Calibrations.

Psychometrika. 2016 Oct 3. doi: 10.1007/s11336-016-9517-x.

Asymptotic Variance of Linking Coefficient Estimators for Polytomous IRT Models.

Appl Psychol Meas. 2018 May;42(3):192-205. doi: 10.1177/0146621617721249. Epub 2017 Aug 24.

Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing.

Front Psychol. 2019 Jun 6;10:1277. doi: 10.3389/fpsyg.2019.01277. eCollection 2019.

Reading Comprehension Tests for Children: Test Equating and Specific Age-Interval Reports.

Front Psychol. 2021 Sep 10;12:662192. doi: 10.3389/fpsyg.2021.662192. eCollection 2021.

Asymptotic Standard Errors of Generalized Partial Credit Model True Score Equating Using Characteristic Curve Methods.

Appl Psychol Meas. 2021 Jul;45(5):331-345. doi: 10.1177/01466216211013101. Epub 2021 May 12.

Practical Consequences of Item Response Theory Model Misfit in the Context of Test Equating with Mixed-Format Test Data.

Front Psychol. 2017 Apr 4;8:484. doi: 10.3389/fpsyg.2017.00484. eCollection 2017.

A Likelihood Approach to Item Response Theory Equating of Multiple Forms.

Appl Psychol Meas. 2023 May;47(3):200-220. doi: 10.1177/01466216231151702. Epub 2023 Jan 24.

New Robust Scale Transformation Methods in the Presence of Outlying Common Items.

Appl Psychol Meas. 2015 Nov;39(8):613-626. doi: 10.1177/0146621615587003. Epub 2015 May 18.

IRT test equating in complex linkage plans.

Psychometrika. 2013 Jul;78(3):464-80. doi: 10.1007/s11336-012-9316-y. Epub 2013 Jan 4.

Item Response Theory Observed-Score Kernel Equating.

Psychometrika. 2017 Mar;82(1):48-66. doi: 10.1007/s11336-016-9528-7. Epub 2016 Oct 14.

引用本文的文献

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model.

Educ Psychol Meas. 2023 Dec;83(6):1249-1290. doi: 10.1177/00131644221143051. Epub 2023 Jan 13.

A Likelihood Approach to Item Response Theory Equating of Multiple Forms.

Appl Psychol Meas. 2023 May;47(3):200-220. doi: 10.1177/01466216231151702. Epub 2023 Jan 24.

Development and Validation of the Open Matrices Item Bank.

J Intell. 2022 Jul 13;10(3):41. doi: 10.3390/jintelligence10030041.

On the Treatment of Missing Item Responses in Educational Large-Scale Assessment Data: An Illustrative Simulation Study and a Case Study Using PISA 2018 Mathematics Data.

Eur J Investig Health Psychol Educ. 2021 Dec 14;11(4):1653-1687. doi: 10.3390/ejihpe11040117.

本文引用的文献

IRT test equating in complex linkage plans.

Psychometrika. 2013 Jul;78(3):464-80. doi: 10.1007/s11336-012-9316-y. Epub 2013 Jan 4.

Harmonic regression and scale stability.

Psychometrika. 2013 Oct;78(4):815-29. doi: 10.1007/s11336-013-9337-1. Epub 2013 Apr 20.

On mean-sigma estimators and bias.

Br J Math Stat Psychol. 2013 May;66(2):277-89. doi: 10.1111/j.2044-8317.2012.02048.x. Epub 2012 May 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

单独IRT校准的多重等值

Multiple Equating of Separate IRT Calibrations.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献