• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

纵向数据的多重填补:教程

Multiple Imputation for Longitudinal Data: A Tutorial.

作者信息

Wijesuriya Rushani, Moreno-Betancur Margarita, Carlin John B, White Ian R, Quartagno Matteo, Lee Katherine J

机构信息

Clinical Epidemiology & Biostatistics (CEBU), Murdoch Children's Research Institute, Parkville, Australia.

Department of Paediatrics, University of Melbourne, Melbourne, Australia.

出版信息

Stat Med. 2025 Feb 10;44(3-4):e10274. doi: 10.1002/sim.10274.

DOI:10.1002/sim.10274
PMID:39846338
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11755704/
Abstract

Longitudinal studies are frequently used in medical research and involve collecting repeated measures on individuals over time. Observations from the same individual are invariably correlated and thus an analytic approach that accounts for this clustering by individual is required. While almost all research suffers from missing data, this can be particularly problematic in longitudinal studies as participation often becomes harder to maintain over time. Multiple imputation (MI) is widely used to handle missing data in such studies. When using MI, it is important that the imputation model is compatible with the proposed analysis model. In a longitudinal analysis, this implies that the clustering considered in the analysis model should be reflected in the imputation process. Several MI approaches have been proposed to impute incomplete longitudinal data, such as treating repeated measurements of the same variable as distinct variables or using generalized linear mixed imputation models. However, the uptake of these methods has been limited, as they require additional data manipulation and use of advanced imputation procedures. In this tutorial, we review the available MI approaches that can be used for handling incomplete longitudinal data, including where individuals are clustered within higher-level clusters. We illustrate implementation with replicable R and Stata code using a case study from the Childhood to Adolescence Transition Study.

摘要

纵向研究在医学研究中经常被使用,涉及对个体随时间进行重复测量。来自同一个体的观察结果总是相关的,因此需要一种考虑个体聚类的分析方法。虽然几乎所有研究都存在数据缺失问题,但在纵向研究中这可能尤其成问题,因为随着时间推移,参与往往变得更难维持。多重填补(MI)被广泛用于处理此类研究中的缺失数据。使用MI时,重要的是填补模型要与所提出的分析模型兼容。在纵向分析中,这意味着分析模型中考虑的聚类应在填补过程中得到体现。已经提出了几种MI方法来填补不完整的纵向数据,例如将同一变量的重复测量视为不同变量,或使用广义线性混合填补模型。然而,这些方法的采用受到限制,因为它们需要额外的数据处理和使用先进的填补程序。在本教程中,我们回顾了可用于处理不完整纵向数据的现有MI方法,包括个体在更高级别聚类中的情况。我们使用来自儿童到青少年过渡研究的案例研究,用可复制的R和Stata代码说明实施过程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7949/11755704/934d84b885bd/SIM-44-0-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7949/11755704/06309634e6a8/SIM-44-0-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7949/11755704/934d84b885bd/SIM-44-0-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7949/11755704/06309634e6a8/SIM-44-0-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7949/11755704/934d84b885bd/SIM-44-0-g003.jpg

相似文献

1
Multiple Imputation for Longitudinal Data: A Tutorial.纵向数据的多重填补:教程
Stat Med. 2025 Feb 10;44(3-4):e10274. doi: 10.1002/sim.10274.
2
Evaluation of approaches for multiple imputation of three-level data.三水平数据的多重插补方法评价。
BMC Med Res Methodol. 2020 Aug 12;20(1):207. doi: 10.1186/s12874-020-01079-8.
3
A comparison of multiple imputation methods for missing data in longitudinal studies.纵向研究中缺失数据的多种插补方法比较。
BMC Med Res Methodol. 2018 Dec 12;18(1):168. doi: 10.1186/s12874-018-0615-6.
4
Multiple imputation methods for handling incomplete longitudinal and clustered data where the target analysis is a linear mixed effects model.用于处理目标分析为线性混合效应模型的不完全纵向和聚类数据的多重填补方法。
Biom J. 2020 Mar;62(2):444-466. doi: 10.1002/bimj.201900051. Epub 2020 Jan 9.
5
A comparison of multiple imputation methods for handling missing values in longitudinal data in the presence of a time-varying covariate with a non-linear association with time: a simulation study.存在与时间呈非线性关联的时变协变量时,用于处理纵向数据中缺失值的多种多重填补方法的比较:一项模拟研究。
BMC Med Res Methodol. 2017 Jul 25;17(1):114. doi: 10.1186/s12874-017-0372-y.
6
Review and evaluation of imputation methods for multivariate longitudinal data with mixed-type incomplete variables.多元纵向混合缺失数据插补方法的评价与研究
Stat Med. 2022 Dec 30;41(30):5844-5876. doi: 10.1002/sim.9592. Epub 2022 Oct 11.
7
Multiple imputation methods for handling missing values in a longitudinal categorical variable with restrictions on transitions over time: a simulation study.多种插补方法处理具有时间过渡限制的纵向分类变量中的缺失值:一项模拟研究。
BMC Med Res Methodol. 2019 Jan 10;19(1):14. doi: 10.1186/s12874-018-0653-0.
8
Two-stage multiple imputation with a longitudinal composite variable.使用纵向复合变量的两阶段多重填补法。
BMC Med Res Methodol. 2025 May 6;25(1):124. doi: 10.1186/s12874-025-02555-9.
9
Multiple imputation approaches for handling incomplete three-level data with time-varying cluster-memberships.处理具有时变聚类成员关系的不完全三级数据的多种插补方法。
Stat Med. 2022 Sep 30;41(22):4385-4402. doi: 10.1002/sim.9515. Epub 2022 Jul 27.
10
Evaluation of two-fold fully conditional specification multiple imputation for longitudinal electronic health record data.纵向电子健康记录数据的双重完全条件指定多重填补法评估
Stat Med. 2014 Sep 20;33(21):3725-37. doi: 10.1002/sim.6184. Epub 2014 Apr 30.

引用本文的文献

1
Bayesian Analysis of Longitudinal Ordinal Data with Missing Values Using Multivariate Probit Models.使用多元概率单位模型对具有缺失值的纵向有序数据进行贝叶斯分析。
J Stat Appl Probab. 2025 May;14(3):337-352. doi: 10.18576/jsap/140302. Epub 2025 May 1.
2
Alcohol Consumption, Risk of Periodontitis and Change of Periodontal Parameters in a Population-Based Cohort Study.一项基于人群的队列研究中的饮酒量、牙周炎风险及牙周参数变化
J Clin Periodontol. 2025 Jul;52(7):1024-1031. doi: 10.1111/jcpe.14154. Epub 2025 Mar 13.

本文引用的文献

1
Review and evaluation of imputation methods for multivariate longitudinal data with mixed-type incomplete variables.多元纵向混合缺失数据插补方法的评价与研究
Stat Med. 2022 Dec 30;41(30):5844-5876. doi: 10.1002/sim.9592. Epub 2022 Oct 11.
2
Multiple imputation approaches for handling incomplete three-level data with time-varying cluster-memberships.处理具有时变聚类成员关系的不完全三级数据的多种插补方法。
Stat Med. 2022 Sep 30;41(22):4385-4402. doi: 10.1002/sim.9515. Epub 2022 Jul 27.
3
Evaluation of approaches for accommodating interactions and non-linear terms in multiple imputation of incomplete three-level data.
评价在不完全三级数据的多重插补中处理交互作用和非线性项的方法。
Biom J. 2022 Dec;64(8):1404-1425. doi: 10.1002/bimj.202000343. Epub 2021 Dec 16.
4
Framework for the treatment and reporting of missing data in observational studies: The Treatment And Reporting of Missing data in Observational Studies framework.观察性研究中缺失数据的处理和报告框架:观察性研究中缺失数据的处理和报告框架。
J Clin Epidemiol. 2021 Jun;134:79-88. doi: 10.1016/j.jclinepi.2021.01.008. Epub 2021 Feb 2.
5
Evaluation of approaches for multiple imputation of three-level data.三水平数据的多重插补方法评价。
BMC Med Res Methodol. 2020 Aug 12;20(1):207. doi: 10.1186/s12874-020-01079-8.
6
Multiple imputation methods for handling incomplete longitudinal and clustered data where the target analysis is a linear mixed effects model.用于处理目标分析为线性混合效应模型的不完全纵向和聚类数据的多重填补方法。
Biom J. 2020 Mar;62(2):444-466. doi: 10.1002/bimj.201900051. Epub 2020 Jan 9.
7
Regression models involving nonlinear effects with missing data: A sequential modeling approach using Bayesian estimation.涉及缺失数据的非线性效应的回归模型:使用贝叶斯估计的序贯建模方法。
Psychol Methods. 2020 Apr;25(2):157-181. doi: 10.1037/met0000233. Epub 2019 Sep 2.
8
Multiple imputation for discrete data: Evaluation of the joint latent normal model.离散数据的多重填补:联合潜在正态模型的评估
Biom J. 2019 Jul;61(4):1003-1019. doi: 10.1002/bimj.201800222. Epub 2019 Mar 14.
9
A Comparison of Multilevel Imputation Schemes for Random Coefficient Models: Fully Conditional Specification and Joint Model Imputation with Random Covariance Matrices.多水平随机系数模型插补方案的比较:完全条件化指定和带有随机协方差矩阵的联合模型插补。
Multivariate Behav Res. 2018 Sep-Oct;53(5):695-713. doi: 10.1080/00273171.2018.1477040. Epub 2019 Jan 29.
10
A comparison of multiple imputation methods for missing data in longitudinal studies.纵向研究中缺失数据的多种插补方法比较。
BMC Med Res Methodol. 2018 Dec 12;18(1):168. doi: 10.1186/s12874-018-0615-6.