• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

缺失数据的比例不应用于指导多重插补的决策。

The proportion of missing data should not be used to guide decisions on multiple imputation.

机构信息

Population Health Sciences, Bristol Medical School, University of Bristol, Oakfield House, Oakfield Grove, Bristol BS8 2BN, UK.

Population Health Sciences, Bristol Medical School, University of Bristol, Oakfield House, Oakfield Grove, Bristol BS8 2BN, UK; MRC Integrative Epidemiology Unit, University of Bristol, Oakfield House, Oakfield Grove, Bristol BS8 2BN, UK.

出版信息

J Clin Epidemiol. 2019 Jun;110:63-73. doi: 10.1016/j.jclinepi.2019.02.016. Epub 2019 Mar 13.

DOI:10.1016/j.jclinepi.2019.02.016
PMID:30878639
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6547017/
Abstract

OBJECTIVES

Researchers are concerned whether multiple imputation (MI) or complete case analysis should be used when a large proportion of data are missing. We aimed to provide guidance for drawing conclusions from data with a large proportion of missingness.

STUDY DESIGN AND SETTING

Via simulations, we investigated how the proportion of missing data, the fraction of missing information (FMI), and availability of auxiliary variables affected MI performance. Outcome data were missing completely at random or missing at random (MAR).

RESULTS

Provided sufficient auxiliary information was available; MI was beneficial in terms of bias and never detrimental in terms of efficiency. Models with similar FMI values, but differing proportions of missing data, also had similar precision for effect estimates. In the absence of bias, the FMI was a better guide to the efficiency gains using MI than the proportion of missing data.

CONCLUSION

We provide evidence that for MAR data, valid MI reduces bias even when the proportion of missingness is large. We advise researchers to use FMI to guide choice of auxiliary variables for efficiency gain in imputation analyses, and that sensitivity analyses including different imputation models may be needed if the number of complete cases is small.

摘要

目的

当大量数据缺失时,研究人员关注应使用多重插补(MI)还是完全案例分析。本研究旨在为从大量缺失数据中得出结论提供指导。

研究设计和设置

通过模拟,我们研究了缺失数据的比例、缺失信息量(FMI)和辅助变量的可用性如何影响 MI 的性能。结局数据完全随机缺失或随机缺失(MAR)。

结果

只要有足够的辅助信息可用;MI 在偏差方面是有益的,在效率方面从未有害。具有相似 FMI 值但缺失数据比例不同的模型,其效应估计的精度也相似。在不存在偏差的情况下,FMI 比缺失数据的比例更能指导使用 MI 获得效率增益。

结论

我们提供的证据表明,对于 MAR 数据,有效的 MI 即使在缺失率较大的情况下也能减少偏差。我们建议研究人员使用 FMI 来指导辅助变量的选择,以提高插补分析的效率增益,并且如果完整案例数较少,则可能需要包括不同插补模型的敏感性分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f72/6547017/ade4318d64cd/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f72/6547017/d2ee468eec0f/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f72/6547017/3c6d26707009/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f72/6547017/ade4318d64cd/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f72/6547017/d2ee468eec0f/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f72/6547017/3c6d26707009/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f72/6547017/ade4318d64cd/gr3.jpg

相似文献

1
The proportion of missing data should not be used to guide decisions on multiple imputation.缺失数据的比例不应用于指导多重插补的决策。
J Clin Epidemiol. 2019 Jun;110:63-73. doi: 10.1016/j.jclinepi.2019.02.016. Epub 2019 Mar 13.
2
Multiple imputation using auxiliary imputation variables that only predict missingness can increase bias due to data missing not at random.仅使用辅助预测缺失变量的多重插补可能会因数据缺失而增加偏差。
BMC Med Res Methodol. 2024 Oct 7;24(1):231. doi: 10.1186/s12874-024-02353-9.
3
Accounting for bias due to outcome data missing not at random: comparison and illustration of two approaches to probabilistic bias analysis: a simulation study.考虑由于非随机缺失结局数据导致的偏倚:两种概率性偏倚分析方法的比较和说明:一项模拟研究。
BMC Med Res Methodol. 2024 Nov 13;24(1):278. doi: 10.1186/s12874-024-02382-4.
4
Bias and Precision of the "Multiple Imputation, Then Deletion" Method for Dealing With Missing Outcome Data.处理缺失结局数据的“多次插补,然后删除”方法的偏倚和精密度
Am J Epidemiol. 2015 Sep 15;182(6):528-34. doi: 10.1093/aje/kwv100. Epub 2015 Sep 2.
5
Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?在二元结局观察值缺失的情况下,对于估计随机对照试验中的患病率(风险)差异,使用多重填补法是否比完全病例分析法更好?
Trials. 2016 Jul 22;17:341. doi: 10.1186/s13063-016-1473-3.
6
Accounting for missing data in statistical analyses: multiple imputation is not always the answer.在统计分析中处理缺失数据:多重插补并不总是答案。
Int J Epidemiol. 2019 Aug 1;48(4):1294-1304. doi: 10.1093/ije/dyz032.
7
Analyses Using Multiple Imputation Need to Consider Missing Data in Auxiliary Variables.使用多重填补法进行分析时需要考虑辅助变量中的缺失数据。
Am J Epidemiol. 2024 Aug 27. doi: 10.1093/aje/kwae306.
8
Comparison of techniques for handling missing covariate data within prognostic modelling studies: a simulation study.预后建模研究中缺失协变量数据处理技术的比较:一项模拟研究。
BMC Med Res Methodol. 2010 Jan 19;10:7. doi: 10.1186/1471-2288-10-7.
9
Multiple imputation of missing data under missing at random: including a collider as an auxiliary variable in the imputation model can induce bias.随机缺失情况下缺失数据的多重填补:在填补模型中纳入一个对撞机作为辅助变量会导致偏差。
Front Epidemiol. 2023 Sep 15;3:1237447. doi: 10.3389/fepid.2023.1237447.
10
Dealing with missing delirium assessments in prospective clinical studies of the critically ill: a simulation study and reanalysis of two delirium studies.处理危重症患者前瞻性临床研究中缺失的谵妄评估:一项模拟研究和两项谵妄研究的重新分析。
BMC Med Res Methodol. 2021 May 6;21(1):97. doi: 10.1186/s12874-021-01274-1.

引用本文的文献

1
Validating the Revised Child Anxiety and Depression Scale-Short Version (RCADS-25) in Chinese Preadolescents.验证中文版修订版儿童焦虑与抑郁量表简版(RCADS - 25)在青春期前儿童中的有效性。
Child Psychiatry Hum Dev. 2025 Sep 8. doi: 10.1007/s10578-025-01892-6.
2
Exploring interrelationships between cognition, functioning and quality of life in schizophrenia spectrum disorders: a Bayesian analysis of networks.探索精神分裂症谱系障碍中认知、功能与生活质量之间的相互关系:网络的贝叶斯分析
Eur Arch Psychiatry Clin Neurosci. 2025 Sep 4. doi: 10.1007/s00406-025-02084-y.
3
Survival Outcomes Associated with Antidepressant Use in Glioblastoma: A Cohort Study.

本文引用的文献

1
When and how should multiple imputation be used for handling missing data in randomised clinical trials - a practical guide with flowcharts.何时以及如何在随机临床试验中使用多重插补来处理缺失数据——附流程图的实用指南。
BMC Med Res Methodol. 2017 Dec 6;17(1):162. doi: 10.1186/s12874-017-0442-1.
2
A systematic survey on reporting and methods for handling missing participant data for continuous outcomes in randomized controlled trials.一项关于随机对照试验中连续结局的报告及处理缺失参与者数据方法的系统调查。
J Clin Epidemiol. 2017 Aug;88:57-66. doi: 10.1016/j.jclinepi.2017.05.017. Epub 2017 Jun 3.
3
Appropriate inclusion of interactions was needed to avoid bias in multiple imputation.
胶质母细胞瘤患者使用抗抑郁药的生存结局:一项队列研究。
Res Sq. 2025 Aug 19:rs.3.rs-7339610. doi: 10.21203/rs.3.rs-7339610/v1.
4
Post-adoption perinatal grief and parenting future children in the home: The moderating roles of social support and parental substance use.领养后的围产期悲伤与家中养育未来子女:社会支持和父母物质使用的调节作用。
Parent Sci Pract. 2024 Oct 30. doi: 10.1080/15295192.2024.2412266.
5
Postoperative atrial fibrillation in emergent non-cardiac surgery: Risk factors and outcomes from a ten-year intensive-care unit retrospective study.急诊非心脏手术术后房颤:一项为期十年的重症监护病房回顾性研究的危险因素及结果
World J Crit Care Med. 2025 Sep 9;14(3):102991. doi: 10.5492/wjccm.v14.i3.102991.
6
Feasibility and acceptability of a two-phase survey for estimating the prevalence of mental disorders in adults with type 1 diabetes.一项用于估计 1 型糖尿病成年患者精神障碍患病率的两阶段调查的可行性和可接受性。
Pilot Feasibility Stud. 2025 Aug 28;11(1):115. doi: 10.1186/s40814-025-01669-7.
7
Clinical and demographic predictors of the need for pharmacotherapy in neonatal abstinence syndrome.新生儿戒断综合征药物治疗需求的临床和人口统计学预测因素
Front Pediatr. 2025 Aug 11;13:1527276. doi: 10.3389/fped.2025.1527276. eCollection 2025.
8
Metabolic score for insulin resistance is associated with adverse cardiovascular events in patients with type 2 diabetes.胰岛素抵抗的代谢评分与2型糖尿病患者的不良心血管事件相关。
World J Diabetes. 2025 Aug 15;16(8):108671. doi: 10.4239/wjd.v16.i8.108671.
9
External Comparator Studies: Performance of Four Missing Data-Handling Approaches, Stratified by Four Different Marginal Estimators.外部对照研究:四种缺失数据处理方法的性能,按四种不同的边际估计量分层
Drug Saf. 2025 Aug 21. doi: 10.1007/s40264-025-01586-x.
10
Increased occurrence of migraine aura and susceptibility to spreading depolarizations at altitude.海拔高度下偏头痛先兆发生率增加及对扩散性去极化的易感性
medRxiv. 2025 Aug 13:2025.08.09.25333153. doi: 10.1101/2025.08.09.25333153.
为避免多重填补中的偏差,需要适当纳入交互作用。
J Clin Epidemiol. 2016 Dec;80:107-115. doi: 10.1016/j.jclinepi.2016.07.004. Epub 2016 Jul 19.
4
Asymptotically Unbiased Estimation of Exposure Odds Ratios in Complete Records Logistic Regression.完全记录逻辑回归中暴露比值比的渐近无偏估计
Am J Epidemiol. 2015 Oct 15;182(8):730-6. doi: 10.1093/aje/kwv114. Epub 2015 Sep 30.
5
Analytical results in longitudinal studies depended on target of inference and assumed mechanism of attrition.纵向研究的分析结果取决于推断的目标和假设的损耗机制。
J Clin Epidemiol. 2015 Oct;68(10):1165-75. doi: 10.1016/j.jclinepi.2015.03.011. Epub 2015 Mar 31.
6
Missing data estimation in morphometrics: how much is too much?形态计量学中缺失数据的估计:多少算太多?
Syst Biol. 2014 Mar;63(2):203-18. doi: 10.1093/sysbio/syt100. Epub 2013 Dec 10.
7
Principled missing data methods for researchers.面向研究人员的有原则的缺失数据处理方法。
Springerplus. 2013 May 14;2(1):222. doi: 10.1186/2193-1801-2-222. Print 2013 Dec.
8
Auxiliary variables in multiple imputation in regression with missing X: a warning against including too many in small sample research.回归中缺失 X 值的多重插补中的辅助变量:在小样本研究中,过多地包含辅助变量需谨慎。
BMC Med Res Methodol. 2012 Dec 5;12:184. doi: 10.1186/1471-2288-12-184.
9
Recovery of information from multiple imputation: a simulation study.从多重填补中恢复信息:一项模拟研究。
Emerg Themes Epidemiol. 2012 Jun 13;9(1):3. doi: 10.1186/1742-7622-9-3.
10
Cohort Profile: the 'children of the 90s'--the index offspring of the Avon Longitudinal Study of Parents and Children.队列特征描述:“90 后的孩子们”——雅芳纵向父母与子女研究的索引后代。
Int J Epidemiol. 2013 Feb;42(1):111-27. doi: 10.1093/ije/dys064. Epub 2012 Apr 16.