Department of Statistical Science, The Graduate University for Advanced Studies, SOKENDAI, 10-3 Midori-cho, Tachikawa, Tokyo, 190-8562, Japan.
The Institute of Statistical Mathematics, Tachikawa, Japan.
Psychometrika. 2023 Jun;88(2):487-526. doi: 10.1007/s11336-023-09903-y. Epub 2023 Mar 6.
Multidimensional item response theory (MIRT) is a statistical test theory that precisely estimates multiple latent skills of learners from the responses in a test. Both compensatory and non-compensatory models have been proposed for MIRT: the former assumes that each skill can complement other skills, whereas the latter assumes they cannot. This non-compensatory assumption is convincing in many tests that measure multiple skills; therefore, applying non-compensatory models to such data is crucial for achieving unbiased and accurate estimation. In contrast to tests, latent skills will change over time in daily learning. To monitor the growth of skills, dynamical extensions of MIRT models have been investigated. However, most of them assumed compensatory models, and a model that can reproduce continuous latent states of skills under the non-compensatory assumption has not been proposed thus far. To enable accurate skill tracing under the non-compensatory assumption, we propose a dynamical extension of non-compensatory MIRT models by combining a linear dynamical system and a non-compensatory model. This results in a complicated posterior of skills, which we approximate with a Gaussian distribution by minimizing the Kullback-Leibler divergence between the approximated posterior and the true posterior. The learning algorithm for the model parameters is derived through Monte Carlo expectation maximization. Simulation studies verify that the proposed method is able to reproduce latent skills accurately, whereas the dynamical compensatory model suffers from significant underestimation errors. Furthermore, experiments on an actual data set demonstrate that our dynamical non-compensatory model can infer practical skill tracing and clarify differences in skill tracing between non-compensatory and compensatory models.
多维项目反应理论(MIRT)是一种统计测试理论,它可以从测试中的反应中精确估计学习者的多个潜在技能。已经提出了补偿和非补偿模型用于 MIRT:前者假设每个技能可以补充其他技能,而后者假设它们不能。在许多衡量多项技能的测试中,这种非补偿假设是令人信服的;因此,将非补偿模型应用于此类数据对于实现无偏和准确的估计至关重要。与测试不同,在日常学习中,潜在技能会随时间变化。为了监测技能的增长,已经研究了 MIRT 模型的动态扩展。然而,它们大多数都假设了补偿模型,而且到目前为止,还没有提出可以在非补偿假设下再现连续潜在技能状态的模型。为了在非补偿假设下实现准确的技能跟踪,我们通过将线性动力系统和非补偿模型相结合,提出了一种非补偿 MIRT 模型的动态扩展。这导致了技能的复杂后验,我们通过最小化近似后验和真实后验之间的 Kullback-Leibler 散度来用高斯分布来近似该后验。模型参数的学习算法是通过蒙特卡罗期望最大化推导出来的。模拟研究验证了所提出的方法能够准确地再现潜在技能,而动态补偿模型则存在显著的低估误差。此外,在实际数据集上的实验表明,我们的动态非补偿模型可以推断出实际的技能跟踪,并澄清非补偿和补偿模型之间的技能跟踪差异。