Andersson Björn
Collaborative Innovation Center of Assessment toward Basic Education Quality, Beijing Normal University, China.
Appl Psychol Meas. 2018 May;42(3):192-205. doi: 10.1177/0146621617721249. Epub 2017 Aug 24.
In item response theory (IRT), when two groups from different populations take two separate tests, there is a need to link the two ability scales so that the item parameters of the tests are comparable across the groups. To link the two scales, information from common items are utilized to estimate linking coefficients which place the item parameters on the same scale. For polytomous IRT models, the Haebara and Stocking-Lord methods for estimating the linking coefficients have commonly been recommended. However, estimates of the variance for these methods are not available in the literature. In this article, the asymptotic variance of linking coefficients for polytomous IRT models with the Haebara and Stocking-Lord methods are derived. The results are presented in a general form and specific results are given for the generalized partial credit model. Simulations which investigate the accuracy of the derivations under various settings of model complexity and sample size are provided, showing that the derivations are accurate under the conditions considered and that the Haebara and Stocking-Lord methods have superior performance to several moment methods with performance close to that of concurrent calibration.
在项目反应理论(IRT)中,当来自不同总体的两组人进行两项独立测试时,需要将两个能力量表联系起来,以便测试的项目参数在两组之间具有可比性。为了联系这两个量表,利用共同项目的信息来估计联系系数,这些系数将项目参数置于同一量表上。对于多分类IRT模型,通常推荐使用海原法和斯托金-洛德法来估计联系系数。然而,文献中没有这些方法的方差估计。在本文中,推导了使用海原法和斯托金-洛德法的多分类IRT模型联系系数的渐近方差。结果以一般形式呈现,并给出了广义部分计分模型的具体结果。提供了模拟,研究在各种模型复杂度和样本量设置下推导的准确性,结果表明在考虑的条件下推导是准确的,并且海原法和斯托金-洛德法比几种矩法具有更好的性能,其性能接近同时校准。