Finch W Holmes, Demir Cihan, French Brian F, Vo Thao
Ball State University, Muncie, IN, USA.
Washington State University, Pullman, WA, USA.
Appl Psychol Meas. 2025 Mar 24:01466216251325644. doi: 10.1177/01466216251325644.
Applied and simulation studies document model convergence and accuracy issues in differential item functioning detection with multilevel models, hindering detection. This study aimed to evaluate the effectiveness of various estimation techniques in addressing these issues and ensure robust DIF detection. We conducted a simulation study to investigate the performance of multilevel logistic regression models with predictors at level 2 across different estimation procedures, including maximum likelihood estimation (MLE), Bayesian estimation, and generalized estimating equations (GEE). The simulation results demonstrated that all maintained control over the Type I error rate across conditions. In most cases, GEE had comparable or higher power compared to MLE for identifying DIF, with Bayes having the lowest power. When potentially important covariates at levels-1 and 2 were included in the model, power for all methods was higher. These results suggest that in many cases where multilevel logistic regression is used for DIF detection, GEE offers a viable option for researchers and that including important contextual variables at all levels of the data is desirable. Implications for practice are discussed.
应用研究和模拟研究记录了在使用多级模型进行差异项目功能检测时模型收敛和准确性问题,这阻碍了检测。本研究旨在评估各种估计技术在解决这些问题方面的有效性,并确保稳健的差异项目功能(DIF)检测。我们进行了一项模拟研究,以调查具有二级预测变量的多级逻辑回归模型在不同估计程序下的性能,包括最大似然估计(MLE)、贝叶斯估计和广义估计方程(GEE)。模拟结果表明,所有方法在各种条件下都能控制第一类错误率。在大多数情况下,与MLE相比,GEE在识别DIF方面具有相当或更高的功效,而贝叶斯方法的功效最低。当模型中纳入一级和二级潜在重要协变量时,所有方法的功效都更高。这些结果表明,在许多使用多级逻辑回归进行DIF检测的情况下,GEE为研究人员提供了一个可行的选择,并且在数据的所有级别纳入重要的背景变量是可取的。文中还讨论了对实践的启示。