• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于数据聚类对多重填补方差估计器影响的一则注释:对[文献名称1]、[文献名称2]的理论附录

A Note on the Effect of Data Clustering on the Multiple-Imputation Variance Estimator: A Theoretical Addendum to , .

作者信息

He Yulei, Shimizu Iris, Schappert Susan, Xu Jianmin, Beresovsky Vladislav, Khan Diba, Valverde Roberto, Schenker Nathaniel

机构信息

National Center for Health Statistics, Centers for Disease Control and Prevention, Hyattsville, MD, 20782, U.S.A.

出版信息

J Off Stat. 2016;32(1):147-164. doi: 10.1515/jos-2016-0007. Epub 2016 Mar 10.

DOI:10.1515/jos-2016-0007
PMID:30948863
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6444354/
Abstract

Multiple imputation is a popular approach to handling missing data. Although it was originally motivated by survey nonresponse problems, it has been readily applied to other data settings. However, its general behavior still remains unclear when applied to survey data with complex sample designs, including clustering. Recently, Lewis et al. (2014) compared single- and multiple-imputation analyses for certain incomplete variables in the 2008 National Ambulatory Medicare Care Survey, which has a nationally representative, multistage, and clustered sampling design. Their study results suggested that the increase of the variance estimate due to multiple imputation compared with single imputation largely disappears for estimates with large design effects. We complement their empirical research by providing some theoretical reasoning. We consider data sampled from an equally weighted, single-stage cluster design and characterize the process using a balanced, one-way normal random-effects model. Assuming that the missingness is completely at random, we derive analytic expressions for the within- and between-multiple-imputation variance estimators for the mean estimator, and thus conveniently reveal the impact of design effects on these variance estimators. We propose approximations for the fraction of missing information in clustered samples, extending previous results for simple random samples. We discuss some generalizations of this research and its practical implications for data release by statistical agencies.

摘要

多重填补是处理缺失数据的一种常用方法。尽管它最初是由调查无回答问题引发的,但已被广泛应用于其他数据设置。然而,当应用于具有复杂样本设计(包括聚类)的调查数据时,其一般行为仍不明确。最近,刘易斯等人(2014年)在2008年全国门诊医疗保险护理调查中,对某些不完整变量的单重填补分析和多重填补分析进行了比较,该调查采用了具有全国代表性的多阶段聚类抽样设计。他们的研究结果表明,与单重填补相比,多重填补导致的方差估计增加在设计效应较大的估计中基本消失。我们通过提供一些理论推理来补充他们的实证研究。我们考虑从等权重单阶段聚类设计中抽样的数据,并使用平衡的单向正态随机效应模型来描述该过程。假设缺失是完全随机的,我们推导出均值估计量的多重填补内方差估计量和多重填补间方差估计量的解析表达式,从而方便地揭示设计效应对这些方差估计量的影响。我们提出了聚类样本中缺失信息比例的近似值,扩展了简单随机样本的先前结果。我们讨论了这项研究的一些推广及其对统计机构数据发布的实际意义。

相似文献

1
A Note on the Effect of Data Clustering on the Multiple-Imputation Variance Estimator: A Theoretical Addendum to , .关于数据聚类对多重填补方差估计器影响的一则注释:对[文献名称1]、[文献名称2]的理论附录
J Off Stat. 2016;32(1):147-164. doi: 10.1515/jos-2016-0007. Epub 2016 Mar 10.
2
Multiple imputation with missing data indicators.带有缺失数据指标的多重插补。
Stat Methods Med Res. 2021 Dec;30(12):2685-2700. doi: 10.1177/09622802211047346. Epub 2021 Oct 13.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Synthetic Multiple-Imputation Procedure for Multistage Complex Samples.多阶段复杂样本的合成多重填补程序
J Off Stat. 2016 Mar;32(1):231-256. doi: 10.1515/JOS-2016-0011. Epub 2016 Mar 10.
5
A two-step semiparametric method to accommodate sampling weights in multiple imputation.一种用于在多重填补中纳入抽样权重的两步半参数方法。
Biometrics. 2016 Mar;72(1):242-52. doi: 10.1111/biom.12413. Epub 2015 Sep 22.
6
Empirical Comparison of Imputation Methods for Multivariate Missing Data in Public Health.公共卫生中多元缺失数据插补方法的实证比较。
Int J Environ Res Public Health. 2023 Jan 14;20(2):1524. doi: 10.3390/ijerph20021524.
7
Multiple imputation methods for handling incomplete longitudinal and clustered data where the target analysis is a linear mixed effects model.用于处理目标分析为线性混合效应模型的不完全纵向和聚类数据的多重填补方法。
Biom J. 2020 Mar;62(2):444-466. doi: 10.1002/bimj.201900051. Epub 2020 Jan 9.
8
Multiple imputation for non-response when estimating HIV prevalence using survey data.使用调查数据估计艾滋病毒流行率时对无应答情况的多重填补法
BMC Public Health. 2015 Oct 16;15:1059. doi: 10.1186/s12889-015-2390-1.
9
Multiple Imputation in Two-Stage Cluster Samples Using The Weighted Finite Population Bayesian Bootstrap.使用加权有限总体贝叶斯自助法对两阶段整群样本进行多重填补
J Surv Stat Methodol. 2016 Jun 1;4(2):139-170. doi: 10.1093/jssam/smv031. Epub 2016 Jan 31.
10
Multiple imputation methods for bivariate outcomes in cluster randomised trials.整群随机试验中双变量结局的多重填补方法。
Stat Med. 2016 Sep 10;35(20):3482-96. doi: 10.1002/sim.6935. Epub 2016 Mar 14.

引用本文的文献

1
Improved methods for estimating fraction of missing information in multiple imputation.多重填补中缺失信息比例估计的改进方法。
Cogent Math Stat. 2018;5:1551504. doi: 10.1080/25742558.2018.1551504. Epub 2018 Nov 23.

本文引用的文献

1
Quantifying the impact of fixed effects modeling of clusters in multiple imputation for cluster randomized trials.量化整群随机试验多重填补中整群固定效应建模的影响。
Biom J. 2011 Feb;53(1):57-74. doi: 10.1002/bimj.201000140.
2
How many imputations are really needed? Some practical clarifications of multiple imputation theory.究竟需要多少次插补?多重插补理论的一些实际阐释。
Prev Sci. 2007 Sep;8(3):206-13. doi: 10.1007/s11121-007-0070-9. Epub 2007 Jun 5.
3
Multiple imputation: review of theory, implementation and software.多重填补:理论、实施与软件综述
Stat Med. 2007 Jul 20;26(16):3057-77. doi: 10.1002/sim.2787.