通过稳健估计解决潜在变量设置中的异质人群。

Addressing heterogeneous populations in latent variable settings through robust estimation.

机构信息

Department of Population Health Sciences.

出版信息

Psychol Methods. 2023 Feb;28(1):39-60. doi: 10.1037/met0000413. Epub 2021 Oct 25.

DOI:10.1037/met0000413

PMID:34694831

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9035483/

Abstract

Individuals routinely differ in how they present with psychiatric illnesses and in how they respond to treatment. This heterogeneity, when overlooked in data analysis, can lead to misspecified models and distorted inferences. While several methods exist to handle various forms of heterogeneity in latent variable models, their implementation in applied research requires additional layers of model crafting, which might be a reason for their underutilization. In response, we present a robust estimation approach based on the expectation-maximization (EM) algorithm. Our method makes minor adjustments to EM to enable automatic detection of population heterogeneity and to recognize individuals who are inadequately explained by the assumed model. Each individual is associated with a probability that reflects how likely their data were to have been generated from the assumed model. The individual-level probabilities are simultaneously estimated and used to weight each individual's contribution in parameter estimation. We examine the utility of our approach for Gaussian mixture models and linear factor models through several simulation studies, drawing contrasts with the EM algorithm. We demonstrate that our method yields inferences more robust to population heterogeneity or other model misspecifications than EM does. We hope that the proposed approach can be incorporated into the model-building process to improve population-level estimates and to shed light on subsets of the population that demand further attention. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

摘要

个体在呈现精神疾病的方式和对治疗的反应方面通常存在差异。这种异质性如果在数据分析中被忽视，可能会导致模型指定不当和推断失真。虽然有几种方法可以处理潜在变量模型中的各种形式的异质性，但在应用研究中实施这些方法需要额外的模型制作层次，这可能是它们未被充分利用的原因之一。为了应对这一问题，我们提出了一种基于期望最大化（EM）算法的稳健估计方法。我们的方法对 EM 进行了微小的调整，以实现自动检测群体异质性，并识别出那些不能被假设模型充分解释的个体。每个个体都与一个概率相关联，该概率反映了他们的数据从假设模型生成的可能性。个体水平的概率同时进行估计，并用于加权每个个体在参数估计中的贡献。我们通过几项模拟研究，考察了我们的方法在高斯混合模型和线性因子模型中的效用，与 EM 算法进行了对比。我们证明，与 EM 相比，我们的方法在推断方面对群体异质性或其他模型失配更稳健。我们希望所提出的方法可以被纳入模型构建过程中，以提高群体水平的估计，并揭示需要进一步关注的人群子集。（PsycInfo 数据库记录（c）2023 APA，保留所有权利）。

相似文献

Addressing heterogeneous populations in latent variable settings through robust estimation.

Psychol Methods. 2023 Feb;28(1):39-60. doi: 10.1037/met0000413. Epub 2021 Oct 25.

An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients.

BMC Bioinformatics. 2019 Nov 8;20(1):555. doi: 10.1186/s12859-019-3148-z.

Learning mixture models with the regularized latent maximum entropy principle.

IEEE Trans Neural Netw. 2004 Jul;15(4):903-16. doi: 10.1109/TNN.2004.828755.

Estimating Finite Mixtures of Ordinal Graphical Models.

Psychometrika. 2022 Mar;87(1):83-106. doi: 10.1007/s11336-021-09781-2. Epub 2021 Jun 30.

Addressing patient heterogeneity in disease predictive model development.

Biometrics. 2022 Sep;78(3):1045-1055. doi: 10.1111/biom.13514. Epub 2021 Aug 1.

Gaussian variational estimation for multidimensional item response theory.

Br J Math Stat Psychol. 2021 Jul;74 Suppl 1:52-85. doi: 10.1111/bmsp.12219. Epub 2020 Oct 16.

An expectation-maximization algorithm for the Lasso estimation of quantitative trait locus effects.

Heredity (Edinb). 2010 Nov;105(5):483-94. doi: 10.1038/hdy.2009.180. Epub 2010 Jan 6.

Causal mediation analysis with latent subgroups.

Stat Med. 2021 Nov 10;40(25):5628-5641. doi: 10.1002/sim.9144. Epub 2021 Jul 15.

Marginalized maximum a posteriori estimation for the four-parameter logistic model under a mixture modelling framework.

Br J Math Stat Psychol. 2020 Nov;73 Suppl 1:51-82. doi: 10.1111/bmsp.12185. Epub 2019 Sep 25.

Validation of an approximate REML algorithm for parameter estimation in a multitrait, multiple across-country evaluation model: a simulation study.

J Dairy Sci. 2007 Oct;90(10):4846-55. doi: 10.3168/jds.2007-0072.

引用本文的文献

Detection of differential depressive symptom patterns in a cohort of perinatal women: an exploratory factor analysis using a robust statistics approach.

EClinicalMedicine. 2023 Feb 1;57:101830. doi: 10.1016/j.eclinm.2023.101830. eCollection 2023 Mar.

本文引用的文献

Heterogeneity in psychiatric diagnostic classification.

Psychiatry Res. 2019 Sep;279:15-22. doi: 10.1016/j.psychres.2019.07.005. Epub 2019 Jul 2.

Sex differences in antidepressant efficacy.

Neuropsychopharmacology. 2019 Jan;44(1):140-154. doi: 10.1038/s41386-018-0156-z. Epub 2018 Jul 20.

Parsing the heterogeneity of depression: An exploratory factor analysis across commonly used depression rating scales.

J Affect Disord. 2018 Apr 15;231:51-57. doi: 10.1016/j.jad.2018.01.027. Epub 2018 Feb 5.

The Problem with Having Two Watches: Assessment of Fit When RMSEA and CFI Disagree.

Multivariate Behav Res. 2016 Mar-Jun;51(2-3):220-39. doi: 10.1080/00273171.2015.1134306. Epub 2016 Mar 25.

Choosing the Optimal Number of Factors in Exploratory Factor Analysis: A Model Selection Perspective.

Multivariate Behav Res. 2013 Jan;48(1):28-56. doi: 10.1080/00273171.2012.710386.

Detecting Outliers in Factor Analysis Using the Forward Search Algorithm.

Multivariate Behav Res. 2008 Jul-Sep;43(3):453-75. doi: 10.1080/00273170802285909.

A Moderated Nonlinear Factor Model for the Development of Commensurate Measures in Integrative Data Analysis.

Multivariate Behav Res. 2014 Jun;49(3):214-231. doi: 10.1080/00273171.2014.889594.

How many different ways do patients meet the diagnostic criteria for major depressive disorder?

Compr Psychiatry. 2015 Jan;56:29-34. doi: 10.1016/j.comppsych.2014.09.007. Epub 2014 Sep 6.

Models and Strategies for Factor Mixture Analysis: An Example Concerning the Structure Underlying Psychological Disorders.

Struct Equ Modeling. 2013 Oct 1;20(4). doi: 10.1080/10705511.2013.824786.

Identifying careless responses in survey data.

Psychol Methods. 2012 Sep;17(3):437-55. doi: 10.1037/a0028085. Epub 2012 Apr 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过稳健估计解决潜在变量设置中的异质人群。

Addressing heterogeneous populations in latent variable settings through robust estimation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献