Kim Kyung Yong
Department of Educational Research Methodology, University of North Carolina at Greensboro, Greensboro, NC, USA.
Appl Psychol Meas. 2022 Sep;46(6):479-493. doi: 10.1177/01466216221108995. Epub 2022 Jun 17.
Applying item response theory (IRT) true score equating to multidimensional IRT models is not straightforward due to the one-to-many relationship between a true score and latent variables. Under the common-item nonequivalent groups design, the purpose of the current study was to introduce two IRT true score equating procedures that adopted different dimension reduction strategies for the bifactor model. The first procedure, which was referred to as the integration procedure, linked the latent variable scales for the bifactor model and integrated out the specific factors from the item response function of the bifactor model. Then, IRT true score equating was applied to the marginalized bifactor model. The second procedure, which was referred to as the PIRT-based procedure, projected the specific dimensions onto the general dimension to obtain a locally dependent unidimensional IRT (UIRT) model and linked the scales of the UIRT model, followed by the application of IRT true score equating to the locally dependent UIRT model. Equating results obtained with the two equating procedures along with those obtained with the unidimensional three-parameter logistic (3PL) model were compared using both simulated and real data. In general, the integration and PIRT-based procedures provided equating results that were not practically different. Furthermore, the equating results produced by the two bifactor-based procedures became more accurate than the results returned by the 3PL model as tests became more multidimensional.
由于真分数与潜在变量之间存在一对多的关系,将项目反应理论(IRT)真分数等值应用于多维IRT模型并非易事。在共同项目非等组设计下,本研究的目的是引入两种IRT真分数等值程序,它们对双因素模型采用了不同的降维策略。第一种程序称为整合程序,它将双因素模型的潜在变量量表联系起来,并从双因素模型的项目反应函数中整合出特定因素。然后,将IRT真分数等值应用于边缘化的双因素模型。第二种程序称为基于PIRT的程序,它将特定维度投影到一般维度上,以获得局部相依的单维IRT(UIRT)模型,并将UIRT模型的量表联系起来,随后将IRT真分数等值应用于局部相依的UIRT模型。使用模拟数据和真实数据比较了两种等值程序以及单维三参数逻辑斯蒂(3PL)模型获得的等值结果。一般来说,整合程序和基于PIRT的程序提供的等值结果在实际应用中没有差异。此外,随着测验维度的增加,两种基于双因素模型的程序产生的等值结果比3PL模型返回的结果更准确。