Yuan Ke-Hai, Tian Yubin, Yanagihara Hirokazu
Department of Psychology, University of Notre Dame, Notre Dame, IN, 46556, USA,
Psychometrika. 2015 Jun;80(2):379-405. doi: 10.1007/s11336-013-9386-5. Epub 2013 Dec 11.
Survey data typically contain many variables. Structural equation modeling (SEM) is commonly used in analyzing such data. The most widely used statistic for evaluating the adequacy of a SEM model is T ML, a slight modification to the likelihood ratio statistic. Under normality assumption, T ML approximately follows a chi-square distribution when the number of observations (N) is large and the number of items or variables (p) is small. However, in practice, p can be rather large while N is always limited due to not having enough participants. Even with a relatively large N, empirical results show that T ML rejects the correct model too often when p is not too small. Various corrections to T ML have been proposed, but they are mostly heuristic. Following the principle of the Bartlett correction, this paper proposes an empirical approach to correct T ML so that the mean of the resulting statistic approximately equals the degrees of freedom of the nominal chi-square distribution. Results show that empirically corrected statistics follow the nominal chi-square distribution much more closely than previously proposed corrections to T ML, and they control type I errors reasonably well whenever N ≥ max(50,2p). The formulations of the empirically corrected statistics are further used to predict type I errors of T ML as reported in the literature, and they perform well.
调查数据通常包含许多变量。结构方程模型(SEM)常用于分析此类数据。用于评估SEM模型适配性的最广泛使用的统计量是T ML,它是似然比统计量的一种轻微修改。在正态性假设下,当观测值数量(N)较大且项目或变量数量(p)较小时,T ML近似服从卡方分布。然而,在实际中,由于没有足够的参与者,p可能会相当大而N总是有限的。即使N相对较大,实证结果表明当p不太小时,T ML也经常拒绝正确的模型。已经提出了对T ML的各种修正,但大多是启发式的。遵循巴特利特修正的原则,本文提出一种实证方法来修正T ML,以使所得统计量的均值近似等于名义卡方分布的自由度。结果表明,经实证修正的统计量比先前提出的对T ML的修正更紧密地遵循名义卡方分布,并且只要N≥max(50,2p),它们就能合理地控制I型错误。经实证修正的统计量的公式进一步用于预测文献中报道的T ML的I型错误,并且表现良好。