Wang Chun, Xu Gongjun
Department of Psychology, University of Minnesota, Minneapolis, Minnesota, USA.
School of Statistics, University of Minnesota, Minneapolis, Minnesota, USA.
Br J Math Stat Psychol. 2015 Nov;68(3):456-77. doi: 10.1111/bmsp.12054. Epub 2015 Apr 15.
In real testing, examinees may manifest different types of test-taking behaviours. In this paper we focus on two types that appear to be among the more frequently occurring behaviours – solution behaviour and rapid guessing behaviour. Rapid guessing usually happens in high-stakes tests when there is insufficient time, and in low-stakes tests when there is lack of effort. These two qualitatively different test-taking behaviours, if ignored, will lead to violation of the local independence assumption and, as a result, yield biased item/person parameter estimation. We propose a mixture hierarchical model to account for differences among item responses and response time patterns arising from these two behaviours. The model is also able to identify the specific behaviour an examinee engages in when answering an item. A Monte Carlo expectation maximization algorithm is proposed for model calibration. A simulation study shows that the new model yields more accurate item and person parameter estimates than a non-mixture model when the data indeed come from two types of behaviour. The model also fits real, high-stakes test data better than a non-mixture model, and therefore the new model can better identify the underlying test-taking behaviour an examinee engages in on a certain item.
在实际测试中,考生可能会表现出不同类型的应试行为。在本文中,我们关注两种似乎较为常见的行为类型——解题行为和快速猜测行为。快速猜测通常发生在高风险测试中时间不足时,以及低风险测试中缺乏努力时。这两种性质不同的应试行为,如果被忽视,将导致违反局部独立性假设,结果会产生有偏差的项目/人员参数估计。我们提出一种混合层次模型,以解释由这两种行为产生的项目反应和反应时间模式之间的差异。该模型还能够识别考生在回答一个项目时所采用的具体行为。提出了一种蒙特卡罗期望最大化算法用于模型校准。一项模拟研究表明,当数据确实来自两种行为类型时,新模型比非混合模型能产生更准确的项目和人员参数估计。该模型对真实的高风险测试数据的拟合也优于非混合模型,因此新模型能够更好地识别考生在某一项目上所采用的潜在应试行为。