Huo Yan, de la Torre Jimmy, Mun Eun-Young, Kim Su-Young, Ray Anne E, Jiao Yang, White Helene R
Graduate School of Education, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA,
Psychometrika. 2015 Sep;80(3):834-55. doi: 10.1007/s11336-014-9420-2. Epub 2014 Sep 30.
The present paper proposes a hierarchical, multi-unidimensional two-parameter logistic item response theory (2PL-MUIRT) model extended for a large number of groups. The proposed model was motivated by a large-scale integrative data analysis (IDA) study which combined data (N = 24,336) from 24 independent alcohol intervention studies. IDA projects face unique challenges that are different from those encountered in individual studies, such as the need to establish a common scoring metric across studies and to handle missingness in the pooled data. To address these challenges, we developed a Markov chain Monte Carlo (MCMC) algorithm for a hierarchical 2PL-MUIRT model for multiple groups in which not only were the item parameters and latent traits estimated, but the means and covariance structures for multiple dimensions were also estimated across different groups. Compared to a few existing MCMC algorithms for multidimensional IRT models that constrain the item parameters to facilitate estimation of the covariance matrix, we adapted an MCMC algorithm so that we could directly estimate the correlation matrix for the anchor group without any constraints on the item parameters. The feasibility of the MCMC algorithm and the validity of the basic calibration procedure were examined using a simulation study. Results showed that model parameters could be adequately recovered, and estimated latent trait scores closely approximated true latent trait scores. The algorithm was then applied to analyze real data (69 items across 20 studies for 22,608 participants). The posterior predictive model check showed that the model fit all items well, and the correlations between the MCMC scores and original scores were overall quite high. An additional simulation study demonstrated robustness of the MCMC procedures in the context of the high proportion of missingness in data. The Bayesian hierarchical IRT model using the MCMC algorithms developed in the current study has the potential to be widely implemented for IDA studies or multi-site studies, and can be further refined to meet more complicated needs in applied research.
本文提出了一种为大量群组扩展的分层、多维度双参数逻辑斯蒂项目反应理论(2PL-MUIRT)模型。该模型是由一项大规模整合数据分析(IDA)研究推动的,该研究结合了来自24项独立酒精干预研究的数据(N = 24,336)。IDA项目面临着与个体研究中不同的独特挑战,比如需要在各项研究中建立共同的评分指标,以及处理汇总数据中的缺失值。为应对这些挑战,我们为多组的分层2PL-MUIRT模型开发了一种马尔可夫链蒙特卡罗(MCMC)算法,在该模型中不仅要估计项目参数和潜在特质,还要在不同组间估计多维度的均值和协方差结构。与一些现有的用于多维IRT模型的MCMC算法相比,这些算法通过约束项目参数来便于协方差矩阵的估计,我们改编了一种MCMC算法,这样我们就可以直接估计锚定组的相关矩阵,而不对项目参数施加任何约束。通过一项模拟研究检验了MCMC算法的可行性和基本校准程序的有效性。结果表明,模型参数能够得到充分恢复,并且估计的潜在特质分数与真实潜在特质分数非常接近。然后将该算法应用于分析实际数据(来自20项研究的69个项目,涉及22,608名参与者)。后验预测模型检验表明该模型对所有项目拟合良好,并且MCMC分数与原始分数之间的相关性总体上相当高。另一项模拟研究证明了MCMC程序在数据缺失比例较高情况下的稳健性。使用本研究开发的MCMC算法的贝叶斯分层IRT模型有潜力在IDA研究或多地点研究中得到广泛应用,并且可以进一步完善以满足应用研究中更复杂的需求。