• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

处理具有时变聚类成员关系的不完全三级数据的多种插补方法。

Multiple imputation approaches for handling incomplete three-level data with time-varying cluster-memberships.

机构信息

Department of Pediatrics, Faculty of Medicine Dentistry and Health Sciences, The University of Melbourne, Melbourne, Victoria, Australia.

Clinical Epidemiology and Biostatistics Unit, Murdoch Children's Research Institute, Melbourne, Victoria, Australia.

出版信息

Stat Med. 2022 Sep 30;41(22):4385-4402. doi: 10.1002/sim.9515. Epub 2022 Jul 27.

DOI:10.1002/sim.9515
PMID:35893317
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9540355/
Abstract

Three-level data arising from repeated measures on individuals clustered within higher-level units are common in medical research. A complexity arises when individuals change clusters over time, resulting in a cross-classified data structure. Missing values in these studies are commonly handled via multiple imputation (MI). If the three-level, cross-classified structure is modeled in the analysis, it also needs to be accommodated in the imputation model to ensure valid results. While incomplete three-level data can be handled using various approaches within MI, the performance of these in the cross-classified data setting remains unclear. We conducted simulations under a range of scenarios to compare these approaches in the context of an acute-effects cross-classified random effects substantive model, which models the time-varying cluster membership via simple additive random effects. The simulation study was based on a case study in a longitudinal cohort of students clustered within schools. We evaluated methods that ignore the time-varying cluster memberships by taking the first or most common cluster for each individual; pragmatic extensions of single- and two-level MI approaches within the joint modeling (JM) and the fully conditional specification (FCS) frameworks, using dummy indicators (DI) and/or imputing repeated measures in wide format to account for the cross-classified structure; and a three-level FCS MI approach developed specifically for cross-classified data. Results indicated that the FCS implementations performed well in terms of bias and precision while JM approaches performed poorly. Under both frameworks approaches using the DI extension should be used with caution in the presence of sparse data.

摘要

在医学研究中,常见的是在高层次单位内聚类的个体进行重复测量产生的三级数据。当个体随时间改变聚类时,就会出现交叉分类数据结构,从而增加了复杂性。在这些研究中,缺失值通常通过多重插补(MI)来处理。如果在分析中对三级交叉分类结构进行建模,则也需要在插补模型中进行调整,以确保结果有效。虽然在 MI 中可以使用各种方法来处理不完整的三级数据,但在交叉分类数据设置中,这些方法的性能仍不清楚。我们在一系列场景下进行了模拟,以在急性效应交叉分类随机效应实质性模型的背景下比较这些方法,该模型通过简单的加法随机效应来模拟随时间变化的聚类成员。模拟研究基于学生纵向队列中聚类的学校的案例研究。我们评估了通过为每个个体取第一个或最常见的聚类来忽略随时间变化的聚类成员的方法;在联合建模(JM)和完全条件规范(FCS)框架内的单级和两级 MI 方法的实用扩展,使用虚拟指标(DI)和/或将重复测量值插补为宽格式以考虑交叉分类结构;以及专门为交叉分类数据开发的三级 FCS MI 方法。结果表明,在偏差和精度方面,FCS 实现表现良好,而 JM 方法表现不佳。在这两种框架下,在存在稀疏数据的情况下,应谨慎使用使用 DI 扩展的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/bf45bd251246/SIM-41-4385-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/b0c9b14fa78e/SIM-41-4385-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/47deeba67766/SIM-41-4385-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/6011035e479e/SIM-41-4385-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/3f64f1f9c477/SIM-41-4385-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/7b28c1089836/SIM-41-4385-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/bf45bd251246/SIM-41-4385-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/b0c9b14fa78e/SIM-41-4385-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/47deeba67766/SIM-41-4385-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/6011035e479e/SIM-41-4385-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/3f64f1f9c477/SIM-41-4385-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/7b28c1089836/SIM-41-4385-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ab/9540355/bf45bd251246/SIM-41-4385-g005.jpg

相似文献

1
Multiple imputation approaches for handling incomplete three-level data with time-varying cluster-memberships.处理具有时变聚类成员关系的不完全三级数据的多种插补方法。
Stat Med. 2022 Sep 30;41(22):4385-4402. doi: 10.1002/sim.9515. Epub 2022 Jul 27.
2
Evaluation of approaches for multiple imputation of three-level data.三水平数据的多重插补方法评价。
BMC Med Res Methodol. 2020 Aug 12;20(1):207. doi: 10.1186/s12874-020-01079-8.
3
A comparison of multiple imputation methods for handling missing values in longitudinal data in the presence of a time-varying covariate with a non-linear association with time: a simulation study.存在与时间呈非线性关联的时变协变量时,用于处理纵向数据中缺失值的多种多重填补方法的比较:一项模拟研究。
BMC Med Res Methodol. 2017 Jul 25;17(1):114. doi: 10.1186/s12874-017-0372-y.
4
Evaluation of approaches for accommodating interactions and non-linear terms in multiple imputation of incomplete three-level data.评价在不完全三级数据的多重插补中处理交互作用和非线性项的方法。
Biom J. 2022 Dec;64(8):1404-1425. doi: 10.1002/bimj.202000343. Epub 2021 Dec 16.
5
Review and evaluation of imputation methods for multivariate longitudinal data with mixed-type incomplete variables.多元纵向混合缺失数据插补方法的评价与研究
Stat Med. 2022 Dec 30;41(30):5844-5876. doi: 10.1002/sim.9592. Epub 2022 Oct 11.
6
Multiple imputation methods for missing multilevel ordinal outcomes.缺失多水平有序结局的多重插补方法。
BMC Med Res Methodol. 2023 May 9;23(1):112. doi: 10.1186/s12874-023-01909-5.
7
A comparison of multiple imputation methods for missing data in longitudinal studies.纵向研究中缺失数据的多种插补方法比较。
BMC Med Res Methodol. 2018 Dec 12;18(1):168. doi: 10.1186/s12874-018-0615-6.
8
Multiple imputation methods for handling missing values in a longitudinal categorical variable with restrictions on transitions over time: a simulation study.多种插补方法处理具有时间过渡限制的纵向分类变量中的缺失值:一项模拟研究。
BMC Med Res Methodol. 2019 Jan 10;19(1):14. doi: 10.1186/s12874-018-0653-0.
9
Multiple imputation methods for handling incomplete longitudinal and clustered data where the target analysis is a linear mixed effects model.用于处理目标分析为线性混合效应模型的不完全纵向和聚类数据的多重填补方法。
Biom J. 2020 Mar;62(2):444-466. doi: 10.1002/bimj.201900051. Epub 2020 Jan 9.
10
Multiple imputation methods for handling missing values in longitudinal studies with sampling weights: Comparison of methods implemented in Stata.多重插补方法处理纵向研究中带有抽样权重的缺失值:Stata 中实现方法的比较。
Biom J. 2021 Feb;63(2):354-371. doi: 10.1002/bimj.201900360. Epub 2020 Oct 25.

引用本文的文献

1
Multiple Imputation for Longitudinal Data: A Tutorial.纵向数据的多重填补:教程
Stat Med. 2025 Feb 10;44(3-4):e10274. doi: 10.1002/sim.10274.
2
Multiple imputation methods for missing multilevel ordinal outcomes.缺失多水平有序结局的多重插补方法。
BMC Med Res Methodol. 2023 May 9;23(1):112. doi: 10.1186/s12874-023-01909-5.

本文引用的文献

1
Evaluation of approaches for accommodating interactions and non-linear terms in multiple imputation of incomplete three-level data.评价在不完全三级数据的多重插补中处理交互作用和非线性项的方法。
Biom J. 2022 Dec;64(8):1404-1425. doi: 10.1002/bimj.202000343. Epub 2021 Dec 16.
2
Self-harm in primary school-aged children: Prospective cohort study.小学生自伤行为:前瞻性队列研究。
PLoS One. 2020 Nov 30;15(11):e0242802. doi: 10.1371/journal.pone.0242802. eCollection 2020.
3
Electronic media use and academic performance in late childhood: A longitudinal study.
电子媒体使用与儿童晚期学业成绩:一项纵向研究。
PLoS One. 2020 Sep 2;15(9):e0237908. doi: 10.1371/journal.pone.0237908. eCollection 2020.
4
Evaluation of approaches for multiple imputation of three-level data.三水平数据的多重插补方法评价。
BMC Med Res Methodol. 2020 Aug 12;20(1):207. doi: 10.1186/s12874-020-01079-8.
5
Multiple imputation methods for handling incomplete longitudinal and clustered data where the target analysis is a linear mixed effects model.用于处理目标分析为线性混合效应模型的不完全纵向和聚类数据的多重填补方法。
Biom J. 2020 Mar;62(2):444-466. doi: 10.1002/bimj.201900051. Epub 2020 Jan 9.
6
Using simulation studies to evaluate statistical methods.运用模拟研究评估统计方法。
Stat Med. 2019 May 20;38(11):2074-2102. doi: 10.1002/sim.8086. Epub 2019 Jan 16.
7
A comparison of multiple imputation methods for missing data in longitudinal studies.纵向研究中缺失数据的多种插补方法比较。
BMC Med Res Methodol. 2018 Dec 12;18(1):168. doi: 10.1186/s12874-018-0615-6.
8
Biases in multilevel analyses caused by cluster-specific fixed-effects imputation.由簇特定固定效应插补引起的多层次分析中的偏差。
Behav Res Methods. 2018 Oct;50(5):1824-1840. doi: 10.3758/s13428-017-0951-1.
9
Multiple imputation of missing data in multilevel designs: A comparison of different strategies.多水平设计中缺失数据的多重插补:不同策略的比较。
Psychol Methods. 2017 Mar;22(1):141-165. doi: 10.1037/met0000096. Epub 2016 Sep 8.
10
Incorporating Student Mobility in Achievement Growth Modeling: A Cross-Classified Multiple Membership Growth Curve Model.将学生流动性纳入成就增长建模:一种交叉分类多重成员增长曲线模型。
Multivariate Behav Res. 2010 May 28;45(3):393-419. doi: 10.1080/00273171.2010.483390.