Rios Joseph A
Educational Psychology, University of Minnesota, Minneapolis, MN, USA.
Appl Psychol Meas. 2022 May;46(3):236-249. doi: 10.1177/01466216221084371. Epub 2022 Apr 4.
Rapid guessing (RG) behavior can undermine measurement property and score-based inferences. To mitigate this potential bias, practitioners have relied on response time information to identify and filter RG responses. However, response times may be unavailable in many testing contexts, such as paper-and-pencil administrations. When this is the case, self-report measures of effort and person-fit statistics have been used. These methods are limited in that inferences concerning motivation and aberrant responding are made at the examinee level. As test takers can engage in a mixture of solution and RG behavior throughout a test administration, there is a need to limit the influence of potential aberrant responses at the item level. This can be done by employing robust estimation procedures. Since these estimators have received limited attention in the RG literature, the objective of this simulation study was to evaluate ability parameter estimation accuracy in the presence of RG by comparing maximum likelihood estimation (MLE) to two robust variants, the bisquare and Huber estimators. Two RG conditions were manipulated, RG percentage (10%, 20%, and 40%) and pattern (difficulty-based and changing state). Contrasted to the MLE procedure, results demonstrated that both the bisquare and Huber estimators reduced bias in ability parameter estimates by as much as 94%. Given that the Huber estimator showed smaller standard deviations of error and performed equally as well as the bisquare approach under most conditions, it is recommended as a promising approach to mitigating bias from RG when response time information is unavailable.
快速猜测(RG)行为可能会破坏测量属性和基于分数的推断。为了减轻这种潜在偏差,从业者依赖于反应时间信息来识别和筛选RG反应。然而,在许多测试情境中,如纸笔测试中,反应时间可能无法获取。在这种情况下,人们使用了努力程度的自我报告测量方法和个体拟合统计方法。这些方法的局限性在于,关于动机和异常反应的推断是在考生层面进行的。由于考生在整个测试过程中可能会同时采用解答和RG行为,因此有必要在题目层面限制潜在异常反应的影响。这可以通过采用稳健估计程序来实现。由于这些估计器在RG文献中受到的关注有限,本模拟研究的目的是通过将最大似然估计(MLE)与两种稳健变体——双平方估计器和休伯估计器进行比较,来评估在存在RG的情况下能力参数估计的准确性。操纵了两种RG条件,RG百分比(10%、20%和40%)和模式(基于难度和变化状态)。与MLE程序相比,结果表明双平方估计器和休伯估计器都将能力参数估计中的偏差降低了多达94%。鉴于休伯估计器在大多数情况下显示出较小的误差标准差,并且表现与双平方方法相当,因此建议在无法获取反应时间信息时,将其作为减轻RG偏差的一种有前景的方法。