Department of Population Health Sciences.
Psychol Methods. 2023 Feb;28(1):39-60. doi: 10.1037/met0000413. Epub 2021 Oct 25.
Individuals routinely differ in how they present with psychiatric illnesses and in how they respond to treatment. This heterogeneity, when overlooked in data analysis, can lead to misspecified models and distorted inferences. While several methods exist to handle various forms of heterogeneity in latent variable models, their implementation in applied research requires additional layers of model crafting, which might be a reason for their underutilization. In response, we present a robust estimation approach based on the expectation-maximization (EM) algorithm. Our method makes minor adjustments to EM to enable automatic detection of population heterogeneity and to recognize individuals who are inadequately explained by the assumed model. Each individual is associated with a probability that reflects how likely their data were to have been generated from the assumed model. The individual-level probabilities are simultaneously estimated and used to weight each individual's contribution in parameter estimation. We examine the utility of our approach for Gaussian mixture models and linear factor models through several simulation studies, drawing contrasts with the EM algorithm. We demonstrate that our method yields inferences more robust to population heterogeneity or other model misspecifications than EM does. We hope that the proposed approach can be incorporated into the model-building process to improve population-level estimates and to shed light on subsets of the population that demand further attention. (PsycInfo Database Record (c) 2023 APA, all rights reserved).
个体在呈现精神疾病的方式和对治疗的反应方面通常存在差异。这种异质性如果在数据分析中被忽视,可能会导致模型指定不当和推断失真。虽然有几种方法可以处理潜在变量模型中的各种形式的异质性,但在应用研究中实施这些方法需要额外的模型制作层次,这可能是它们未被充分利用的原因之一。为了应对这一问题,我们提出了一种基于期望最大化(EM)算法的稳健估计方法。我们的方法对 EM 进行了微小的调整,以实现自动检测群体异质性,并识别出那些不能被假设模型充分解释的个体。每个个体都与一个概率相关联,该概率反映了他们的数据从假设模型生成的可能性。个体水平的概率同时进行估计,并用于加权每个个体在参数估计中的贡献。我们通过几项模拟研究,考察了我们的方法在高斯混合模型和线性因子模型中的效用,与 EM 算法进行了对比。我们证明,与 EM 相比,我们的方法在推断方面对群体异质性或其他模型失配更稳健。我们希望所提出的方法可以被纳入模型构建过程中,以提高群体水平的估计,并揭示需要进一步关注的人群子集。(PsycInfo 数据库记录(c)2023 APA,保留所有权利)。