• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

究竟需要多少次插补?多重插补理论的一些实际阐释。

How many imputations are really needed? Some practical clarifications of multiple imputation theory.

作者信息

Graham John W, Olchowski Allison E, Gilreath Tamika D

机构信息

Department of Biobehavioral Health, Penn State University, E-315 Health & Human Development Bldg., University Park, PA 16802, USA.

出版信息

Prev Sci. 2007 Sep;8(3):206-13. doi: 10.1007/s11121-007-0070-9. Epub 2007 Jun 5.

DOI:10.1007/s11121-007-0070-9
PMID:17549635
Abstract

Multiple imputation (MI) and full information maximum likelihood (FIML) are the two most common approaches to missing data analysis. In theory, MI and FIML are equivalent when identical models are tested using the same variables, and when m, the number of imputations performed with MI, approaches infinity. However, it is important to know how many imputations are necessary before MI and FIML are sufficiently equivalent in ways that are important to prevention scientists. MI theory suggests that small values of m, even on the order of three to five imputations, yield excellent results. Previous guidelines for sufficient m are based on relative efficiency, which involves the fraction of missing information (gamma) for the parameter being estimated, and m. In the present study, we used a Monte Carlo simulation to test MI models across several scenarios in which gamma and m were varied. Standard errors and p-values for the regression coefficient of interest varied as a function of m, but not at the same rate as relative efficiency. Most importantly, statistical power for small effect sizes diminished as m became smaller, and the rate of this power falloff was much greater than predicted by changes in relative efficiency. Based our findings, we recommend that researchers using MI should perform many more imputations than previously considered sufficient. These recommendations are based on gamma, and take into consideration one's tolerance for a preventable power falloff (compared to FIML) due to using too few imputations.

摘要

多重填补(MI)和全信息极大似然法(FIML)是缺失数据分析中最常用的两种方法。理论上,当使用相同变量测试相同模型,且多重填补(MI)执行的填补次数m趋近于无穷大时,MI和FIML是等效的。然而,对于预防科学家而言,了解在MI和FIML充分等效之前需要进行多少次填补非常重要。MI理论表明,即使m值较小,如三到五次填补,也能产生出色的结果。先前关于足够m值的指导方针基于相对效率,其中涉及所估计参数的缺失信息比例(γ)和m。在本研究中,我们使用蒙特卡洛模拟在γ和m变化的几种情况下测试MI模型。感兴趣的回归系数的标准误差和p值随m的变化而变化,但变化速率与相对效率不同。最重要的是,小效应量的统计功效随着m变小而降低,且这种功效下降的速率远大于相对效率变化所预测的速率。基于我们的研究结果,我们建议使用MI的研究人员应进行比先前认为足够的次数更多的填补。这些建议基于γ,并考虑到由于填补次数过少而导致的可预防的功效下降(与FIML相比)的容忍度。

相似文献

1
How many imputations are really needed? Some practical clarifications of multiple imputation theory.究竟需要多少次插补?多重插补理论的一些实际阐释。
Prev Sci. 2007 Sep;8(3):206-13. doi: 10.1007/s11121-007-0070-9. Epub 2007 Jun 5.
2
Number of imputations needed to stabilize estimated treatment difference in longitudinal data analysis.纵向数据分析中稳定估计治疗差异所需的插补次数。
Stat Methods Med Res. 2017 Apr;26(2):674-690. doi: 10.1177/0962280214554439. Epub 2014 Oct 10.
3
A comparison of full information maximum likelihood and multiple imputation in structural equation modeling with missing data.缺失数据结构方程建模中完全信息极大似然和多重插补的比较。
Psychol Methods. 2021 Aug;26(4):466-485. doi: 10.1037/met0000381. Epub 2021 Jan 28.
4
Maximum likelihood versus multiple imputation for missing data in small longitudinal samples with nonnormality.最大似然法与多重插补法在小纵向样本非正态缺失数据中的比较。
Psychol Methods. 2017 Sep;22(3):426-449. doi: 10.1037/met0000094. Epub 2016 Oct 6.
5
Two-stage maximum likelihood approach for item-level missing data in regression.两阶段极大似然法处理回归中项目级别的缺失数据。
Behav Res Methods. 2020 Dec;52(6):2306-2323. doi: 10.3758/s13428-020-01355-x.
6
A Comparison of Methods for Creating Multiple Imputations of Nominal Variables.创建名义变量多重插补方法的比较
Multivariate Behav Res. 2017 May-Jun;52(3):290-304. doi: 10.1080/00273171.2017.1289360. Epub 2017 Mar 7.
7
Fraction of Missing Information () at Different Missing Data Fractions in the 2012 NAMCS Physician Workflow Mail Survey.2012年国家门诊医疗调查(NAMCS)医生工作流程邮件调查中不同缺失数据比例下的信息缺失比例()
Appl Math (Irvine). 2016 Jun;7(10):1057-1067. doi: 10.4236/am.2016.710093. Epub 2016 Jun 15.
8
Evaluating FIML and multiple imputation in joint ordinal-continuous measurements models with missing data.评估具有缺失数据的联合有序-连续测量模型中的 FIML 和多重插补。
Behav Res Methods. 2022 Jun;54(3):1063-1077. doi: 10.3758/s13428-021-01582-w. Epub 2021 Sep 20.
9
Full Information Maximum Likelihood Estimation for Latent Variable Interactions With Incomplete Indicators.具有不完全指标的潜在变量交互作用的完全信息最大似然估计
Multivariate Behav Res. 2017 Jan-Feb;52(1):12-30. doi: 10.1080/00273171.2016.1245600. Epub 2016 Nov 11.
10
Bias and Precision of the "Multiple Imputation, Then Deletion" Method for Dealing With Missing Outcome Data.处理缺失结局数据的“多次插补,然后删除”方法的偏倚和精密度
Am J Epidemiol. 2015 Sep 15;182(6):528-34. doi: 10.1093/aje/kwv100. Epub 2015 Sep 2.

引用本文的文献

1
Trajectories of Incarceration Over Six Years Among People with Mental Illness and Experiences of Homelessness: Predictive Factors Among Participants in a Randomized Trial of Housing First: Trajectoires d'incarcération sur six ans chez les personnes atteintes de maladie mentale et ayant connu l'itinérance-Facteurs prédictifs chez les participants à un essai à répartition aléatoire en marge du programme «Logement d'abord».患有精神疾病和有过无家可归经历者六年内的监禁轨迹:“住房优先”随机试验参与者中的预测因素:患有精神疾病和有过流浪经历者六年内的监禁轨迹——“住房优先”计划边缘随机试验参与者中的预测因素
Can J Psychiatry. 2025 Sep 3:7067437251372188. doi: 10.1177/07067437251372188.
2
A Three-Year Longitudinal Study of Athlete Mental Health: A Cricket Case Study.一项关于运动员心理健康的三年纵向研究:板球案例研究。
Scand J Med Sci Sports. 2025 Sep;35(9):e70125. doi: 10.1111/sms.70125.
3

本文引用的文献

1
Multiple Imputation for Multivariate Missing-Data Problems: A Data Analyst's Perspective.多元缺失数据问题的多重填补:数据分析师视角
Multivariate Behav Res. 1998 Oct 1;33(4):545-71. doi: 10.1207/s15327906mbr3304_5.
2
Missing data: our view of the state of the art.缺失数据:我们对当前技术水平的看法。
Psychol Methods. 2002 Jun;7(2):147-77.
3
A comparison of inclusive and restrictive strategies in modern missing data procedures.现代缺失数据处理中包容性策略与限制性策略的比较。
Comparing Multiple Imputation Methods to Address Missing Patient Demographics in Immunization Information Systems: Retrospective Cohort Study.比较多种多重填补方法以解决免疫接种信息系统中患者人口统计学数据缺失问题:回顾性队列研究。
JMIR Public Health Surveill. 2025 Aug 26;11:e73916. doi: 10.2196/73916.
4
A random forest dynamic threshold imputation method for handling missing data in cognitive diagnosis assessments.一种用于处理认知诊断评估中缺失数据的随机森林动态阈值插补方法。
Front Psychol. 2025 Aug 5;16:1487111. doi: 10.3389/fpsyg.2025.1487111. eCollection 2025.
5
Perceived blood glucose regulation after menopause: a cross-sectional survey in women with type 1 diabetes in the Netherlands.绝经后血糖调节认知:荷兰1型糖尿病女性的横断面调查
Diabetologia. 2025 Aug 16. doi: 10.1007/s00125-025-06518-z.
6
Effectiveness of a telephone-based randomised clinical trial targeting obesity risk of preschool-aged children: An extension study during the COVID-19 pandemic.一项针对学龄前儿童肥胖风险的电话随机临床试验的有效性:COVID-19大流行期间的一项扩展研究。
Int J Obes (Lond). 2025 Aug 14. doi: 10.1038/s41366-025-01869-4.
7
Prevalence and gender-specific correlates of hazardous and binge drinking among Swedish and Finnish older adults.瑞典和芬兰老年人中危险饮酒和暴饮的患病率及性别特异性相关因素
Alcohol Clin Exp Res (Hoboken). 2025 Aug;49(8):1744-1758. doi: 10.1111/acer.70098. Epub 2025 Aug 1.
8
Prior Authorization of Medication and Its Influence on Provider Behavior: Latent Class Analysis.药物预先授权及其对医疗服务提供者行为的影响:潜在类别分析
J Med Internet Res. 2025 Jul 29;27:e75361. doi: 10.2196/75361.
9
Feasibility trial of an unguided ultra-brief online psychological intervention within an online mental health clinic: The "things you do" intervention.在线心理健康诊所中无指导的超简短在线心理干预可行性试验:“你所做的事”干预措施
Internet Interv. 2025 Jun 26;41:100852. doi: 10.1016/j.invent.2025.100852. eCollection 2025 Sep.
10
Neighbourhood deprivation and smoking cessation: a survival analysis using the French constances cohort.邻里贫困与戒烟:一项使用法国康斯坦斯队列的生存分析。
BMC Public Health. 2025 Jul 3;25(1):2317. doi: 10.1186/s12889-025-23261-7.
Psychol Methods. 2001 Dec;6(4):330-51.