Finch W Holmes
Ball State University, Muncie, IN, USA.
Educ Psychol Meas. 2023 Oct;83(5):929-952. doi: 10.1177/00131644221111993. Epub 2022 Jul 21.
Psychometricians have devoted much research and attention to categorical item responses, leading to the development and widespread use of item response theory for the estimation of model parameters and identification of items that do not perform in the same way for examinees from different population subgroups (e.g., differential item functioning [DIF]). With the increasing use of computer-based measurement, use of items with a continuous response modality is becoming more common. Models for use with these items have been developed and refined in recent years, but less attention has been devoted to investigating DIF for these continuous response models (CRMs). Therefore, the purpose of this simulation study was to compare the performance of three potential methods for assessing DIF for CRMs, including regression, the MIMIC model, and factor invariance testing. Study results revealed that the MIMIC model provided a combination of Type I error control and relatively high power for detecting DIF. Implications of these findings are discussed.
心理测量学家对分类项目反应投入了大量研究和关注,这导致了项目反应理论的发展和广泛应用,用于估计模型参数以及识别对于来自不同总体亚组的考生表现不同的项目(例如,项目功能差异[DIF])。随着基于计算机测量的使用日益增加,具有连续反应形式的项目的使用变得越来越普遍。近年来,针对这些项目的使用模型已经得到开发和完善,但对于这些连续反应模型(CRM)的DIF研究却较少。因此,本模拟研究的目的是比较三种评估CRM的DIF的潜在方法的性能,包括回归、MIMIC模型和因子不变性检验。研究结果表明,MIMIC模型在控制I型错误和检测DIF的相对高功效方面提供了一种组合。讨论了这些发现的意义。