• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用多重填补法进行分析时需要考虑辅助变量中的缺失数据。

Analyses Using Multiple Imputation Need to Consider Missing Data in Auxiliary Variables.

作者信息

Madley-Dowd Paul, Curnow Elinor, Hughes Rachael A, Cornish Rosie, Tilling Kate, Heron Jon

机构信息

Centre for Academic Mental Health, Population Health Sciences, Bristol Medical School, University of Bristol, United Kingdom.

MRC Integrative Epidemiology Unit at the University of Bristol, United Kingdom.

出版信息

Am J Epidemiol. 2024 Aug 27. doi: 10.1093/aje/kwae306.

DOI:10.1093/aje/kwae306
PMID:39191658
Abstract

Auxiliary variables are used in multiple imputation (MI) to reduce bias and increase efficiency. These variables may often themselves be incomplete. We explored how missing data in auxiliary variables influenced estimates obtained from MI. We implemented a simulation study with three different missing data mechanisms for the outcome. We then examined the impact of increasing proportions of missing data and different missingness mechanisms for the auxiliary variable on bias of an unadjusted linear regression coefficient and the fraction of missing information. We illustrate our findings with an applied example in the Avon Longitudinal Study of Parents and Children. We found that where complete records analyses were biased, increasing proportions of missing data in auxiliary variables, under any missing data mechanism, reduced the ability of MI including the auxiliary variable to mitigate this bias. Where there was no bias in the complete records analysis, inclusion of a missing not at random auxiliary variable in MI introduced bias of potentially important magnitude (up to 17% of the effect size in our simulation). Careful consideration of the quantity and nature of missing data in auxiliary variables needs to be made when selecting them for use in MI models.

摘要

辅助变量用于多重填补(MI)以减少偏差并提高效率。这些变量本身往往也不完整。我们探讨了辅助变量中的缺失数据如何影响从MI获得的估计值。我们针对结果实施了一项具有三种不同缺失数据机制的模拟研究。然后,我们研究了辅助变量中缺失数据比例的增加以及不同的缺失机制对未调整线性回归系数偏差和缺失信息比例的影响。我们用阿冯亲子纵向研究中的一个应用实例来说明我们的发现。我们发现,在完整记录分析存在偏差的情况下,在任何缺失数据机制下,辅助变量中缺失数据比例的增加都会降低包含该辅助变量的MI减轻这种偏差的能力。在完整记录分析无偏差的情况下,在MI中纳入一个非随机缺失的辅助变量会引入潜在重要程度的偏差(在我们的模拟中高达效应大小的17%)。在选择辅助变量用于MI模型时,需要仔细考虑其缺失数据的数量和性质。

相似文献

1
Analyses Using Multiple Imputation Need to Consider Missing Data in Auxiliary Variables.使用多重填补法进行分析时需要考虑辅助变量中的缺失数据。
Am J Epidemiol. 2024 Aug 27. doi: 10.1093/aje/kwae306.
2
Multiple imputation using auxiliary imputation variables that only predict missingness can increase bias due to data missing not at random.仅使用辅助预测缺失变量的多重插补可能会因数据缺失而增加偏差。
BMC Med Res Methodol. 2024 Oct 7;24(1):231. doi: 10.1186/s12874-024-02353-9.
3
Multiple imputation of missing data under missing at random: including a collider as an auxiliary variable in the imputation model can induce bias.随机缺失情况下缺失数据的多重填补:在填补模型中纳入一个对撞机作为辅助变量会导致偏差。
Front Epidemiol. 2023 Sep 15;3:1237447. doi: 10.3389/fepid.2023.1237447.
4
Multiple imputation using linked proxy outcome data resulted in important bias reduction and efficiency gains: a simulation study.使用关联替代结局数据的多重填补可显著减少偏差并提高效率:一项模拟研究。
Emerg Themes Epidemiol. 2017 Dec 19;14:14. doi: 10.1186/s12982-017-0068-0. eCollection 2017.
5
Accounting for bias due to outcome data missing not at random: comparison and illustration of two approaches to probabilistic bias analysis: a simulation study.考虑由于非随机缺失结局数据导致的偏倚:两种概率性偏倚分析方法的比较和说明:一项模拟研究。
BMC Med Res Methodol. 2024 Nov 13;24(1):278. doi: 10.1186/s12874-024-02382-4.
6
Bias and Precision of the "Multiple Imputation, Then Deletion" Method for Dealing With Missing Outcome Data.处理缺失结局数据的“多次插补,然后删除”方法的偏倚和精密度
Am J Epidemiol. 2015 Sep 15;182(6):528-34. doi: 10.1093/aje/kwv100. Epub 2015 Sep 2.
7
The proportion of missing data should not be used to guide decisions on multiple imputation.缺失数据的比例不应用于指导多重插补的决策。
J Clin Epidemiol. 2019 Jun;110:63-73. doi: 10.1016/j.jclinepi.2019.02.016. Epub 2019 Mar 13.
8
A Causal View on Bias in Missing Data Imputation: The Impact of Evil Auxiliary Variables on Norming of Test Scores.缺失数据插补中偏差的因果观点:不良辅助变量对测试分数归一化的影响。
Multivariate Behav Res. 2025 Mar-Apr;60(2):258-274. doi: 10.1080/00273171.2024.2412682. Epub 2024 Oct 20.
9
Correction of bias from non-random missing longitudinal data using auxiliary information.利用辅助信息纠正非随机缺失纵向数据的偏差。
Stat Med. 2010 Mar 15;29(6):671-9. doi: 10.1002/sim.3821.
10
Accounting for missing data in statistical analyses: multiple imputation is not always the answer.在统计分析中处理缺失数据:多重插补并不总是答案。
Int J Epidemiol. 2019 Aug 1;48(4):1294-1304. doi: 10.1093/ije/dyz032.

引用本文的文献

1
Pitfalls of imputing using incomplete auxiliary variables.使用不完整辅助变量进行插补的陷阱。
Am J Epidemiol. 2025 Jun 3;194(6):1801-1802. doi: 10.1093/aje/kwaf043.

本文引用的文献

1
Multiple imputation using auxiliary imputation variables that only predict missingness can increase bias due to data missing not at random.仅使用辅助预测缺失变量的多重插补可能会因数据缺失而增加偏差。
BMC Med Res Methodol. 2024 Oct 7;24(1):231. doi: 10.1186/s12874-024-02353-9.
2
A flexible approach to measure care coordination based on patient-sharing networks.基于患者共享网络的灵活的护理协调测量方法。
BMC Med Res Methodol. 2024 Jan 3;24(1):1. doi: 10.1186/s12874-023-02106-0.
3
Multiple imputation of missing data under missing at random: including a collider as an auxiliary variable in the imputation model can induce bias.
随机缺失情况下缺失数据的多重填补:在填补模型中纳入一个对撞机作为辅助变量会导致偏差。
Front Epidemiol. 2023 Sep 15;3:1237447. doi: 10.3389/fepid.2023.1237447.
4
The Avon Longitudinal Study of Parents and Children (ALSPAC): a 2022 update on the enrolled sample of mothers and the associated baseline data.雅芳亲子纵向研究(ALSPAC):2022年母亲入组样本及相关基线数据更新
Wellcome Open Res. 2023 Sep 6;7:283. doi: 10.12688/wellcomeopenres.18564.1. eCollection 2022.
5
Multiple imputation of missing data under missing at random: compatible imputation models are not sufficient to avoid bias if they are mis-specified.在随机缺失下对缺失数据进行多重插补:如果插补模型指定错误,即使相容的插补模型也不足以避免偏差。
J Clin Epidemiol. 2023 Aug;160:100-109. doi: 10.1016/j.jclinepi.2023.06.011. Epub 2023 Jun 19.
6
Assumptions and analysis planning in studies with missing data in multiple variables: moving beyond the MCAR/MAR/MNAR classification.多变量缺失数据研究中的假设和分析计划:超越 MCAR/MAR/MNAR 分类。
Int J Epidemiol. 2023 Aug 2;52(4):1268-1275. doi: 10.1093/ije/dyad008.
7
Immunohistochemistry localises myosin-7a to cochlear efferent boutons.免疫组织化学将肌球蛋白-7a定位到耳蜗传出终扣。
Wellcome Open Res. 2022 Feb 22;7:1. doi: 10.12688/wellcomeopenres.17428.2. eCollection 2022.
8
Bias due to Berkson error: issues when using predicted values in place of observed covariates.由于 Berkson 误差导致的偏倚:在使用预测值替代观测协变量时出现的问题。
Biostatistics. 2021 Oct 13;22(4):858-872. doi: 10.1093/biostatistics/kxaa002.
9
Accounting for missing data in statistical analyses: multiple imputation is not always the answer.在统计分析中处理缺失数据:多重插补并不总是答案。
Int J Epidemiol. 2019 Aug 1;48(4):1294-1304. doi: 10.1093/ije/dyz032.
10
The proportion of missing data should not be used to guide decisions on multiple imputation.缺失数据的比例不应用于指导多重插补的决策。
J Clin Epidemiol. 2019 Jun;110:63-73. doi: 10.1016/j.jclinepi.2019.02.016. Epub 2019 Mar 13.