Sen Sedat, Cohen Allan S
Harran University, Sanliurfa, Turkey.
University of Georgia, Athens, USA.
Educ Psychol Meas. 2024 Jun;84(3):481-509. doi: 10.1177/00131644231180529. Epub 2023 Jun 26.
A Monte Carlo simulation study was conducted to compare fit indices used for detecting the correct latent class in three dichotomous mixture item response theory (IRT) models. Ten indices were considered: Akaike's information criterion (AIC), the corrected AIC (AICc), Bayesian information criterion (BIC), consistent AIC (CAIC), Draper's information criterion (DIC), sample size adjusted BIC (SABIC), relative entropy, the integrated classification likelihood criterion (ICL-BIC), the adjusted Lo-Mendell-Rubin (LMR), and Vuong-Lo-Mendell-Rubin (VLMR). The accuracy of the fit indices was assessed for correct detection of the number of latent classes for different simulation conditions including sample size (2,500 and 5,000), test length (15, 30, and 45), mixture proportions (equal and unequal), number of latent classes (2, 3, and 4), and latent class separation (no-separation and small separation). Simulation study results indicated that as the number of examinees or number of items increased, correct identification rates also increased for most of the indices. Correct identification rates by the different fit indices, however, decreased as the number of estimated latent classes or parameters (i.e., model complexity) increased. Results were good for BIC, CAIC, DIC, SABIC, ICL-BIC, LMR, and VLMR, and the relative entropy index tended to select correct models most of the time. Consistent with previous studies, AIC and AICc showed poor performance. Most of these indices had limited utility for three-class and four-class mixture 3PL model conditions.
进行了一项蒙特卡罗模拟研究,以比较用于在三种二分混合项目反应理论(IRT)模型中检测正确潜在类别的拟合指数。考虑了十个指数:赤池信息准则(AIC)、校正后的AIC(AICc)、贝叶斯信息准则(BIC)、一致AIC(CAIC)、德雷珀信息准则(DIC)、样本量调整后的BIC(SABIC)、相对熵、综合分类似然准则(ICL-BIC)、调整后的洛-门德尔-鲁宾(LMR)和Vuong-洛-门德尔-鲁宾(VLMR)。针对不同模拟条件下潜在类别的数量的正确检测,评估了拟合指数的准确性,这些条件包括样本量(2500和5000)、测试长度(15、30和45)、混合比例(相等和不相等)、潜在类别的数量(2、3和4)以及潜在类别的分离程度(无分离和小分离)。模拟研究结果表明,随着考生数量或项目数量的增加,大多数指数的正确识别率也会提高。然而,随着估计的潜在类别或参数数量(即模型复杂性)的增加,不同拟合指数的正确识别率会降低。BIC、CAIC、DIC、SABIC、ICL-BIC、LMR和VLMR的结果较好,相对熵指数在大多数情况下倾向于选择正确的模型。与先前的研究一致,AIC和AICc表现不佳。这些指数中的大多数在三类和四类混合3PL模型条件下的效用有限。