• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用辅助信息纠正非随机缺失纵向数据的偏差。

Correction of bias from non-random missing longitudinal data using auxiliary information.

机构信息

Albert Einstein College of Medicine, 1300 Morris Park Ave, Bronx, NY 10461, USA.

出版信息

Stat Med. 2010 Mar 15;29(6):671-9. doi: 10.1002/sim.3821.

DOI:10.1002/sim.3821
PMID:20029935
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4162134/
Abstract

Missing data are common in longitudinal studies due to drop-out, loss to follow-up, and death. Likelihood-based mixed effects models for longitudinal data give valid estimates when the data are missing at random (MAR). These assumptions, however, are not testable without further information. In some studies, there is additional information available in the form of an auxiliary variable known to be correlated with the missing outcome of interest. Availability of such auxiliary information provides us with an opportunity to test the MAR assumption. If the MAR assumption is violated, such information can be utilized to reduce or eliminate bias when the missing data process depends on the unobserved outcome through the auxiliary information. We compare two methods of utilizing the auxiliary information: joint modeling of the outcome of interest and the auxiliary variable, and multiple imputation (MI). Simulation studies are performed to examine the two methods. The likelihood-based joint modeling approach is consistent and most efficient when correctly specified. However, mis-specification of the joint distribution can lead to biased results. MI is slightly less efficient than a correct joint modeling approach and can also be biased when the imputation model is mis-specified, though it is more robust to mis-specification of the imputation distribution when all the variables affecting the missing data mechanism and the missing outcome are included in the imputation model. An example is presented from a dementia screening study.

摘要

由于脱落、失访和死亡,缺失数据在纵向研究中很常见。当数据是随机缺失(MAR)时,基于似然的混合效应模型可以对纵向数据进行有效估计。然而,在没有进一步信息的情况下,这些假设是无法检验的。在某些研究中,以辅助变量的形式提供了额外的信息,这些辅助变量与缺失的感兴趣结局相关。这种辅助信息的可用性为我们提供了一个检验 MAR 假设的机会。如果 MAR 假设被违反,那么当缺失数据过程通过辅助信息依赖于未观察到的结局时,这种信息可以被用来减少或消除偏差。我们比较了利用辅助信息的两种方法:感兴趣结局和辅助变量的联合建模,以及多重插补(MI)。进行了模拟研究来检验这两种方法。基于似然的联合建模方法在正确指定时是一致和最有效的。然而,联合分布的错误指定可能会导致有偏的结果。MI 比正确的联合建模方法效率略低,并且当插补模型错误指定时也可能有偏,尽管当所有影响缺失数据机制和缺失结局的变量都包含在插补模型中时,它对插补分布的错误指定更稳健。从一个痴呆症筛查研究中给出了一个示例。

相似文献

1
Correction of bias from non-random missing longitudinal data using auxiliary information.利用辅助信息纠正非随机缺失纵向数据的偏差。
Stat Med. 2010 Mar 15;29(6):671-9. doi: 10.1002/sim.3821.
2
Multiple imputation using auxiliary imputation variables that only predict missingness can increase bias due to data missing not at random.仅使用辅助预测缺失变量的多重插补可能会因数据缺失而增加偏差。
BMC Med Res Methodol. 2024 Oct 7;24(1):231. doi: 10.1186/s12874-024-02353-9.
3
Multiple imputation of missing data under missing at random: compatible imputation models are not sufficient to avoid bias if they are mis-specified.在随机缺失下对缺失数据进行多重插补:如果插补模型指定错误,即使相容的插补模型也不足以避免偏差。
J Clin Epidemiol. 2023 Aug;160:100-109. doi: 10.1016/j.jclinepi.2023.06.011. Epub 2023 Jun 19.
4
Does pattern mixture modelling reduce bias due to informative attrition compared to fitting a mixed effects model to the available cases or data imputed using multiple imputation?: a simulation study.与对可用病例或使用多重插补法插补的数据拟合混合效应模型相比,模式混合建模是否会减少因信息性缺失而产生的偏差?一项模拟研究。
BMC Med Res Methodol. 2018 Aug 29;18(1):89. doi: 10.1186/s12874-018-0548-0.
5
Analyses Using Multiple Imputation Need to Consider Missing Data in Auxiliary Variables.使用多重填补法进行分析时需要考虑辅助变量中的缺失数据。
Am J Epidemiol. 2024 Aug 27. doi: 10.1093/aje/kwae306.
6
Multiple imputation of missing data under missing at random: including a collider as an auxiliary variable in the imputation model can induce bias.随机缺失情况下缺失数据的多重填补:在填补模型中纳入一个对撞机作为辅助变量会导致偏差。
Front Epidemiol. 2023 Sep 15;3:1237447. doi: 10.3389/fepid.2023.1237447.
7
A comparison of multiple imputation methods for handling missing values in longitudinal data in the presence of a time-varying covariate with a non-linear association with time: a simulation study.存在与时间呈非线性关联的时变协变量时,用于处理纵向数据中缺失值的多种多重填补方法的比较:一项模拟研究。
BMC Med Res Methodol. 2017 Jul 25;17(1):114. doi: 10.1186/s12874-017-0372-y.
8
Multiple imputation using linked proxy outcome data resulted in important bias reduction and efficiency gains: a simulation study.使用关联替代结局数据的多重填补可显著减少偏差并提高效率:一项模拟研究。
Emerg Themes Epidemiol. 2017 Dec 19;14:14. doi: 10.1186/s12982-017-0068-0. eCollection 2017.
9
Joint Longitudinal Models for Dealing With Missing at Random Data in Trial-Based Economic Evaluations.基于试验的经济评估中处理随机缺失数据的联合纵向模型。
Value Health. 2021 May;24(5):699-706. doi: 10.1016/j.jval.2020.11.018. Epub 2021 Feb 18.
10
Multiple imputation for longitudinal data using Bayesian lasso imputation model.基于贝叶斯套索插补模型的纵向数据多重插补方法
Stat Med. 2022 Mar 15;41(6):1042-1058. doi: 10.1002/sim.9315. Epub 2022 Jan 21.

引用本文的文献

1
Impact of missing data on bias and precision when estimating change in patient-reported outcomes from a clinical registry.从临床注册研究中估计患者报告结局变化时缺失数据对偏差和精度的影响。
Health Qual Life Outcomes. 2019 Jun 20;17(1):106. doi: 10.1186/s12955-019-1181-2.
2
Does pattern mixture modelling reduce bias due to informative attrition compared to fitting a mixed effects model to the available cases or data imputed using multiple imputation?: a simulation study.与对可用病例或使用多重插补法插补的数据拟合混合效应模型相比,模式混合建模是否会减少因信息性缺失而产生的偏差?一项模拟研究。
BMC Med Res Methodol. 2018 Aug 29;18(1):89. doi: 10.1186/s12874-018-0548-0.
3
Multiple imputation using linked proxy outcome data resulted in important bias reduction and efficiency gains: a simulation study.使用关联替代结局数据的多重填补可显著减少偏差并提高效率:一项模拟研究。
Emerg Themes Epidemiol. 2017 Dec 19;14:14. doi: 10.1186/s12982-017-0068-0. eCollection 2017.
4
Should multiple imputation be the method of choice for handling missing data in randomized trials?在随机试验中,处理缺失数据时是否应选择多重插补法?
Stat Methods Med Res. 2018 Sep;27(9):2610-2626. doi: 10.1177/0962280216683570. Epub 2016 Dec 19.
5
Correcting Bias Caused by Missing Data in the Estimate of the Effect of Apolipoprotein ε4 on Cognitive Decline.校正载脂蛋白ε4对认知衰退影响估计中因数据缺失导致的偏差。
J Int Neuropsychol Soc. 2015 Jan;21(1):85-90. doi: 10.1017/S1355617714000952. Epub 2014 Nov 12.
6
Non-ignorable loss to follow-up: correcting mortality estimates based on additional outcome ascertainment.不可忽视的失访:基于额外结局确定来校正死亡率估计值。
Stat Med. 2014 Jan 15;33(1):129-42. doi: 10.1002/sim.5912. Epub 2013 Jul 22.
7
Brain MRI markers and dropout in a longitudinal study of cognitive aging: the Three-City Dijon Study.脑 MRI 标志物与认知老化纵向研究中的脱落:第戎三城研究。
Neurology. 2012 Sep 25;79(13):1340-8. doi: 10.1212/WNL.0b013e31826cd62a. Epub 2012 Sep 12.

本文引用的文献

1
Primary care screen for early dementia.早期痴呆症的初级保健筛查
J Am Geriatr Soc. 2008 Feb;56(2):206-13. doi: 10.1111/j.1532-5415.2007.01553.x. Epub 2007 Dec 27.
2
Multiple imputation of discrete and continuous data by fully conditional specification.通过完全条件设定对离散和连续数据进行多重填补
Stat Methods Med Res. 2007 Jun;16(3):219-42. doi: 10.1177/0962280206074463.
3
A comparison of inclusive and restrictive strategies in modern missing data procedures.现代缺失数据处理中包容性策略与限制性策略的比较。
Psychol Methods. 2001 Dec;6(4):330-51.
4
Multiple imputation and posterior simulation for multivariate missing data in longitudinal studies.纵向研究中多元缺失数据的多重填补与后验模拟
Biometrics. 2000 Dec;56(4):1157-63. doi: 10.1111/j.0006-341x.2000.01157.x.
5
The Clinical Dementia Rating (CDR): current version and scoring rules.临床痴呆评定量表(CDR):当前版本及评分规则。
Neurology. 1993 Nov;43(11):2412-4. doi: 10.1212/wnl.43.11.2412-a.
6
Random-effects models for longitudinal data.纵向数据的随机效应模型。
Biometrics. 1982 Dec;38(4):963-74.