Kohli Nidhi, Koran Jennifer, Henn Lisa
University of Minnesota, Minneapolis, MN, USA.
Southern Illinois University, Carbondale, IL, USA.
Educ Psychol Meas. 2015 Jun;75(3):389-405. doi: 10.1177/0013164414559071. Epub 2014 Nov 20.
There are well-defined theoretical differences between the classical test theory (CTT) and item response theory (IRT) frameworks. It is understood that in the CTT framework, person and item statistics are test- and sample-dependent. This is not the perception with IRT. For this reason, the IRT framework is considered to be theoretically superior to the CTT framework for the purpose of estimating person and item parameters. In previous simulation studies, IRT models were used both as generating and as fitting models. Hence, results favoring the IRT framework could be attributed to IRT being the data-generation framework. Moreover, previous studies only considered the traditional CTT framework for the comparison, yet there is considerable literature suggesting that it may be more appropriate to use CTT statistics based on an underlying normal variable (UNV) assumption. The current study relates the class of CTT-based models with the UNV assumption to that of IRT, using confirmatory factor analysis to delineate the connections. A small Monte Carlo study was carried out to assess the comparability between the item and person statistics obtained from the frameworks of IRT and CTT with UNV assumption. Results show the frameworks of IRT and CTT with UNV assumption to be quite comparable, with neither framework showing an advantage over the other.
经典测验理论(CTT)和项目反应理论(IRT)框架之间存在明确的理论差异。据了解,在CTT框架中,个体和项目统计数据依赖于测验和样本。而IRT并非如此。因此,就估计个体和项目参数而言,IRT框架在理论上被认为优于CTT框架。在以往的模拟研究中,IRT模型既被用作生成模型,也被用作拟合模型。因此,支持IRT框架的结果可能归因于IRT是数据生成框架。此外,以往的研究在比较时仅考虑了传统的CTT框架,但有大量文献表明,基于潜在正态变量(UNV)假设使用CTT统计数据可能更合适。本研究使用验证性因素分析来描述基于UNV假设的CTT模型类别与IRT模型类别之间的联系。进行了一项小型蒙特卡罗研究,以评估从IRT框架和基于UNV假设的CTT框架获得的项目和个体统计数据之间的可比性。结果表明,基于UNV假设的IRT框架和CTT框架具有相当的可比性,两个框架均未显示出优于对方的优势。