Yuan Ke-Hai, Bentler Peter M
University of Notre Dame.
Sociol Methodol. 2010 Aug;40(1):191-245. doi: 10.1111/j.1467-9531.2010.01224.x.
This paper proposes a two-stage maximum likelihood (ML) approach to normal mixture structural equation modeling (SEM), and develops statistical inference that allows distributional misspecification. Saturated means and covariances are estimated at stage-1 together with a sandwich-type covariance matrix. These are used to evaluate structural models at stage-2. Techniques accumulated in the conventional SEM literature for model diagnosis and evaluation can be used to study the model structure for each component. Examples show that the two-stage ML approach leads to correct or nearly correct models even when the normal mixture assumptions are violated and initial models are misspecified. Compared to single-stage ML, two-stage ML avoids the confounding effect of model specification and the number of components, and is computationally more efficient. Monte-Carlo results indicate that two-stage ML loses only minimal efficiency under the condition where single-stage ML performs best. Monte-Carlo results also indicate that the commonly used model selection criterion BIC is more robust to distribution violations for the saturated model than that for a structural model at moderate sample sizes. The proposed two-stage ML approach is also extremely flexible in modeling different components with different models. Potential new developments in the mixture modeling literature can be easily adapted to study issues with normal mixture SEM.
本文提出了一种用于正态混合结构方程模型(SEM)的两阶段最大似然(ML)方法,并开展了允许分布错误设定的统计推断。在第一阶段估计饱和均值和协方差以及一个三明治型协方差矩阵。这些用于在第二阶段评估结构模型。传统SEM文献中积累的用于模型诊断和评估的技术可用于研究每个成分的模型结构。示例表明,即使正态混合假设被违反且初始模型被错误设定,两阶段ML方法也能得出正确或近乎正确的模型。与单阶段ML相比,两阶段ML避免了模型设定和成分数量的混杂效应,并且计算效率更高。蒙特卡罗结果表明,在单阶段ML表现最佳的条件下,两阶段ML仅损失极小的效率。蒙特卡罗结果还表明,在中等样本量下,常用的模型选择标准BIC对饱和模型的分布违反情况比结构模型更具鲁棒性。所提出的两阶段ML方法在使用不同模型对不同成分进行建模时也极具灵活性。混合建模文献中的潜在新进展可轻松适用于研究正态混合SEM的问题。