通过贝叶斯混合潜马尔可夫模型对纵向分类数据进行多重填补。

Multiple imputation of longitudinal categorical data through bayesian mixture latent Markov models.

作者信息

Vidotto Davide, Vermunt Jeroen K, Van Deun Katrijn

机构信息

Department of Methodology and Statistics, Tilburg University, Tilburg, Netherlands.

出版信息

J Appl Stat. 2019 Nov 24;47(10):1720-1738. doi: 10.1080/02664763.2019.1692794. eCollection 2020.

DOI:10.1080/02664763.2019.1692794

PMID:35707130

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9041790/

Abstract

Standard latent class modeling has recently been shown to provide a flexible tool for the multiple imputation (MI) of missing categorical covariates in cross-sectional studies. This article introduces an analogous tool for longitudinal studies: MI using Bayesian mixture Latent Markov (BMLM) models. Besides retaining the benefits of latent class models, i.e. respecting the (categorical) measurement scale of the variables and preserving possibly complex relationships between variables within a measurement occasion, the Markov dependence structure of the proposed BMLM model allows capturing lagged dependencies between adjacent time points, while the time-constant mixture structure allows capturing dependencies across all time points, as well as retrieving associations between time-varying and time-constant variables. The performance of the BMLM model for MI is evaluated by means of a simulation study and an empirical experiment, in which it is compared with complete case analysis and MICE. Results show good performance of the proposed method in retrieving the parameters of the analysis model. In contrast, competing methods could provide correct estimates only for some aspects of the data.

摘要

标准潜在类别建模最近已被证明是一种灵活的工具，可用于横断面研究中缺失分类协变量的多重填补（MI）。本文介绍了一种适用于纵向研究的类似工具：使用贝叶斯混合潜在马尔可夫（BMLM）模型的多重填补。除了保留潜在类别模型的优点，即尊重变量的（分类）测量尺度并保留测量场合内变量之间可能复杂的关系外，所提出的BMLM模型的马尔可夫依赖结构允许捕捉相邻时间点之间的滞后依赖性，而时间常数混合结构允许捕捉所有时间点之间的依赖性，以及检索随时间变化和时间常数变量之间的关联。通过模拟研究和实证实验评估了BMLM模型用于多重填补的性能，并将其与完整病例分析和MICE进行了比较。结果表明，所提出的方法在检索分析模型参数方面表现良好。相比之下，竞争方法仅能对数据的某些方面提供正确的估计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f77/9041790/f47002846119/CJAS_A_1692794_F0001_OC.jpg

相似文献

Multiple imputation of longitudinal categorical data through bayesian mixture latent Markov models.

J Appl Stat. 2019 Nov 24;47(10):1720-1738. doi: 10.1080/02664763.2019.1692794. eCollection 2020.

Bayesian Multilevel Latent Class Models for the Multiple Imputation of Nested Categorical Data.

J Educ Behav Stat. 2018 Oct;43(5):511-539. doi: 10.3102/1076998618769871. Epub 2018 Apr 30.

Multiple imputation for discrete data: Evaluation of the joint latent normal model.

Biom J. 2019 Jul;61(4):1003-1019. doi: 10.1002/bimj.201800222. Epub 2019 Mar 14.

A comparison of multiple imputation methods for missing data in longitudinal studies.

BMC Med Res Methodol. 2018 Dec 12;18(1):168. doi: 10.1186/s12874-018-0615-6.

Missing data strategies for time-varying confounders in comparative effectiveness studies of non-missing time-varying exposures and right-censored outcomes.

Stat Med. 2019 Jul 30;38(17):3204-3220. doi: 10.1002/sim.8174. Epub 2019 May 17.

Missing data in longitudinal studies: cross-sectional multiple imputation provides similar estimates to full-information maximum likelihood.

Ann Epidemiol. 2014 Jan;24(1):75-7. doi: 10.1016/j.annepidem.2013.10.007. Epub 2013 Oct 18.

Dynamic Latent Trait Models with Mixed Hidden Markov Structure for Mixed Longitudinal Outcomes.

J Appl Stat. 2016;43(4):704-720. doi: 10.1080/02664763.2015.1077373. Epub 2015 Oct 2.

Multiple Imputation with Factor Scores: A Practical Approach for Handling Simultaneous Missingness Across Items in Longitudinal Designs.

Multivariate Behav Res. 2025 Jan-Feb;60(1):61-89. doi: 10.1080/00273171.2024.2371816. Epub 2024 Jul 12.

A comparison of multiple imputation methods for handling missing values in longitudinal data in the presence of a time-varying covariate with a non-linear association with time: a simulation study.

BMC Med Res Methodol. 2017 Jul 25;17(1):114. doi: 10.1186/s12874-017-0372-y.

Latent class based multiple imputation approach for missing categorical data.

J Stat Plan Inference. 2010 Nov;140(11):3252-3262. doi: 10.1016/j.jspi.2010.04.020.

引用本文的文献

Associations of social media and health content use with sexual risk behaviours among adolescents in South Africa.

Sex Reprod Health Matters. 2023 Dec;31(1):2267893. doi: 10.1080/26410397.2023.2267893. Epub 2023 Nov 10.

本文引用的文献

Bayesian Multilevel Latent Class Models for the Multiple Imputation of Nested Categorical Data.

J Educ Behav Stat. 2018 Oct;43(5):511-539. doi: 10.3102/1076998618769871. Epub 2018 Apr 30.

A discrete time event-history approach to informative drop-out in mixed latent Markov models with covariates.

Biometrics. 2015 Mar;71(1):80-89. doi: 10.1111/biom.12224. Epub 2014 Sep 16.

Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.

IEEE Trans Pattern Anal Mach Intell. 1984 Jun;6(6):721-41. doi: 10.1109/tpami.1984.4767596.

Multiple imputation using chained equations: Issues and guidance for practice.

Stat Med. 2011 Feb 20;30(4):377-99. doi: 10.1002/sim.4067. Epub 2010 Nov 30.

Missing data: our view of the state of the art.

Psychol Methods. 2002 Jun;7(2):147-77.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过贝叶斯混合潜马尔可夫模型对纵向分类数据进行多重填补。

Multiple imputation of longitudinal categorical data through bayesian mixture latent Markov models.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献