Hagell Peter, Westergren Albert
Peter Hagell, School of Health and Society, Kristianstad University, SE-291 88 Kristianstad, Sweden,
J Appl Meas. 2016;17(4):416-431.
Sample size is a major factor in statistical null hypothesis testing, which is the basis for many approaches to testing Rasch model fit. Few sample size recommendations for testing fit to the Rasch model concern the Rasch Unidimensional Measurement Models (RUMM) software, which features chi-square and ANOVA/F-ratio based fit statistics, including Bonferroni and algebraic sample size adjustments. This paper explores the occurrence of Type I errors with RUMM fit statistics, and the effects of algebraic sample size adjustments. Data with simulated Rasch model fitting 25-item dichotomous scales and sample sizes ranging from N = 50 to N = 2500 were analysed with and without algebraically adjusted sample sizes. Results suggest the occurrence of Type I errors with N less then or equal to 500, and that Bonferroni correction as well as downward algebraic sample size adjustment are useful to avoid such errors, whereas upward adjustment of smaller samples falsely signal misfit. Our observations suggest that sample sizes around N = 250 to N = 500 may provide a good balance for the statistical interpretation of the RUMM fit statistics studied here with respect to Type I errors and under the assumption of Rasch model fit within the examined frame of reference (i.e., about 25 item parameters well targeted to the sample).
样本量是统计零假设检验中的一个主要因素,而零假设检验是许多检验Rasch模型拟合度方法的基础。很少有关于检验Rasch模型拟合度的样本量建议涉及Rasch单维测量模型(RUMM)软件,该软件具有基于卡方和方差分析/F比率的拟合统计量,包括Bonferroni校正和代数样本量调整。本文探讨了RUMM拟合统计量中I型错误的发生情况以及代数样本量调整的影响。对模拟Rasch模型拟合25项二分制量表且样本量从N = 50到N = 2500的数据进行了分析,分析时采用了代数调整样本量和未采用代数调整样本量两种情况。结果表明,当N小于或等于500时会出现I型错误,并且Bonferroni校正以及向下的代数样本量调整有助于避免此类错误,而对较小样本进行向上调整会错误地显示不拟合。我们的观察结果表明,在本文研究的RUMM拟合统计量的统计解释方面,就I型错误而言,并且在所研究的参考框架内假设Rasch模型拟合(即大约25个项目参数与样本匹配良好)的情况下,样本量在N = 250到N = 500之间可能会达到较好的平衡。