• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

采用有向极大似然估计法估计因果效应时处理缺失数据。

Handling missing data when estimating causal effects with targeted maximum likelihood estimation.

出版信息

Am J Epidemiol. 2024 Jul 8;193(7):1019-1030. doi: 10.1093/aje/kwae012.

DOI:10.1093/aje/kwae012
PMID:38400653
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11228874/
Abstract

Targeted maximum likelihood estimation (TMLE) is increasingly used for doubly robust causal inference, but how missing data should be handled when using TMLE with data-adaptive approaches is unclear. Based on data (1992-1998) from the Victorian Adolescent Health Cohort Study, we conducted a simulation study to evaluate 8 missing-data methods in this context: complete-case analysis, extended TMLE incorporating an outcome-missingness model, the missing covariate missing indicator method, and 5 multiple imputation (MI) approaches using parametric or machine-learning models. We considered 6 scenarios that varied in terms of exposure/outcome generation models (presence of confounder-confounder interactions) and missingness mechanisms (whether outcome influenced missingness in other variables and presence of interaction/nonlinear terms in missingness models). Complete-case analysis and extended TMLE had small biases when outcome did not influence missingness in other variables. Parametric MI without interactions had large bias when exposure/outcome generation models included interactions. Parametric MI including interactions performed best in bias and variance reduction across all settings, except when missingness models included a nonlinear term. When choosing a method for handling missing data in the context of TMLE, researchers must consider the missingness mechanism and, for MI, compatibility with the analysis method. In many settings, a parametric MI approach that incorporates interactions and nonlinearities is expected to perform well.

摘要

目标最大似然估计(TMLE)越来越多地用于双重稳健因果推断,但在使用数据自适应方法进行 TMLE 时,如何处理缺失数据尚不清楚。基于维多利亚青少年健康队列研究(1992-1998 年)的数据,我们进行了一项模拟研究,以评估 8 种在这种情况下的缺失数据方法:完全案例分析、纳入结果缺失模型的扩展 TMLE、缺失协变量缺失指示符方法以及使用参数或机器学习模型的 5 种多重插补(MI)方法。我们考虑了 6 种情况,这些情况在暴露/结果生成模型(混杂因素-混杂因素相互作用的存在)和缺失机制(结果是否影响其他变量的缺失以及缺失模型中是否存在相互作用/非线性项)方面有所不同。当结果不影响其他变量的缺失时,完全案例分析和扩展 TMLE 的偏差较小。当暴露/结果生成模型包括相互作用时,没有相互作用的参数 MI 存在较大的偏差。参数 MI 包括相互作用,除了缺失模型包括非线性项外,在所有情况下都能在偏差和方差减少方面表现最佳。在选择 TMLE 背景下处理缺失数据的方法时,研究人员必须考虑缺失机制,对于 MI,还必须考虑与分析方法的兼容性。在许多情况下,预计包含相互作用和非线性的参数 MI 方法将表现良好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2356/11228874/8ce6f788150e/kwae012f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2356/11228874/3a8a1d0fb277/kwae012f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2356/11228874/f0af6f55050c/kwae012f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2356/11228874/cb8afe297851/kwae012f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2356/11228874/39141d7069ea/kwae012f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2356/11228874/8ce6f788150e/kwae012f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2356/11228874/3a8a1d0fb277/kwae012f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2356/11228874/f0af6f55050c/kwae012f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2356/11228874/cb8afe297851/kwae012f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2356/11228874/39141d7069ea/kwae012f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2356/11228874/8ce6f788150e/kwae012f5.jpg

相似文献

1
Handling missing data when estimating causal effects with targeted maximum likelihood estimation. 采用有向极大似然估计法估计因果效应时处理缺失数据。
Am J Epidemiol. 2024 Jul 8;193(7):1019-1030. doi: 10.1093/aje/kwae012.
2
Recoverability and estimation of causal effects under typical multivariable missingness mechanisms.典型多变量缺失机制下因果效应的可恢复性和估计。
Biom J. 2024 Apr;66(3):e2200326. doi: 10.1002/bimj.202200326.
3
Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies.观察性研究中因果推断的靶向最大似然估计
Am J Epidemiol. 2017 Jan 1;185(1):65-73. doi: 10.1093/aje/kww165. Epub 2016 Dec 9.
4
Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure.比较处理二分类暴露因素倾向性评分估计中协变量缺失的方法。
BMC Med Res Methodol. 2020 Jun 26;20(1):168. doi: 10.1186/s12874-020-01053-4.
5
Collaborative double robust targeted maximum likelihood estimation.协作双稳健靶向最大似然估计
Int J Biostat. 2010 May 17;6(1):Article 17. doi: 10.2202/1557-4679.1181.
6
Multiple imputation with missing indicators as proxies for unmeasured variables: simulation study.缺失指标的多重插补作为未测量变量的代理:模拟研究。
BMC Med Res Methodol. 2020 Jul 8;20(1):185. doi: 10.1186/s12874-020-01068-x.
7
Evaluation of multiple imputation approaches for handling missing covariate information in a case-cohort study with a binary outcome.评价在二分类结局病例-对照研究中采用多种插补方法处理协变量缺失信息的效果。
BMC Med Res Methodol. 2022 Apr 3;22(1):87. doi: 10.1186/s12874-021-01495-4.
8
On the use of multiple imputation to address data missing by design as well as unintended missing data in case-cohort studies with a binary endpoint.关于在以二分类结局为研究终点的病例-队列研究中,针对设计缺失和非故意缺失数据,采用多重填补方法进行处理。
BMC Med Res Methodol. 2023 Dec 7;23(1):287. doi: 10.1186/s12874-023-02090-5.
9
Multiple imputation using auxiliary imputation variables that only predict missingness can increase bias due to data missing not at random.仅使用辅助预测缺失变量的多重插补可能会因数据缺失而增加偏差。
BMC Med Res Methodol. 2024 Oct 7;24(1):231. doi: 10.1186/s12874-024-02353-9.
10
Should multiple imputation be stratified by exposure group when estimating causal effects via outcome regression in observational studies?在观察性研究中,通过结局回归估计因果效应时,多重填补是否应该按暴露组分层?
BMC Med Res Methodol. 2023 Feb 16;23(1):42. doi: 10.1186/s12874-023-01843-6.

引用本文的文献

1
Causal estimation of time-varying treatments in observational studies: a scoping review of methods, applications, and missing data practices.观察性研究中时变治疗的因果估计:方法、应用及缺失数据处理的范围综述
BMC Med Res Methodol. 2025 Aug 27;25(1):202. doi: 10.1186/s12874-025-02633-y.
2
Antidopaminergic Medications and Clinical Changes in Measures of Huntington's Disease: A Causal Analysis.抗多巴胺能药物与亨廷顿病测量指标的临床变化:因果分析
Mov Disord. 2025 May;40(5):928-937. doi: 10.1002/mds.30164. Epub 2025 Mar 18.
3
Psychological distress in adolescence and later economic and health outcomes in the United States population: A retrospective and modeling study.

本文引用的文献

1
Assumptions and analysis planning in studies with missing data in multiple variables: moving beyond the MCAR/MAR/MNAR classification.多变量缺失数据研究中的假设和分析计划:超越 MCAR/MAR/MNAR 分类。
Int J Epidemiol. 2023 Aug 2;52(4):1268-1275. doi: 10.1093/ije/dyad008.
2
Challenges in Obtaining Valid Causal Effect Estimates with Machine Learning Algorithms.使用机器学习算法获取有效因果效应估计值面临的挑战。
Am J Epidemiol. 2023 Sep 1;192(9). doi: 10.1093/aje/kwab201. Epub 2021 Jul 15.
3
Demystifying Statistical Inference When Using Machine Learning in Causal Research.
美国人群中青少年时期的心理困扰及其对后期经济和健康状况的影响:一项回顾性和建模研究。
PLoS Med. 2025 Jan 16;22(1):e1004506. doi: 10.1371/journal.pmed.1004506. eCollection 2025 Jan.
4
Using machine-learning methods to identify early-life predictors of 11-year language outcome.使用机器学习方法识别 11 岁语言预后的早期生命预测指标。
J Child Psychol Psychiatry. 2023 Aug;64(8):1242-1252. doi: 10.1111/jcpp.13733. Epub 2022 Dec 7.
5
Emulating a target trial of intensive nurse home visiting in the policy-relevant population using linked administrative data.利用关联的行政数据模拟强化家庭访视的目标试验在政策相关人群中的应用。
Int J Epidemiol. 2023 Feb 8;52(1):119-131. doi: 10.1093/ije/dyac092.
在因果研究中使用机器学习时揭开统计推断的神秘面纱。
Am J Epidemiol. 2021 Jul 15;192(9):1545-9. doi: 10.1093/aje/kwab200.
4
Framework for the treatment and reporting of missing data in observational studies: The Treatment And Reporting of Missing data in Observational Studies framework.观察性研究中缺失数据的处理和报告框架:观察性研究中缺失数据的处理和报告框架。
J Clin Epidemiol. 2021 Jun;134:79-88. doi: 10.1016/j.jclinepi.2021.01.008. Epub 2021 Feb 2.
5
Infection risk in sarcoidosis patients treated with methotrexate compared to azathioprine: A retrospective 'target trial' emulated with Swedish real-world data.与硫唑嘌呤相比,接受甲氨蝶呤治疗的类肉瘤病患者的感染风险:一项使用瑞典真实世界数据进行回顾性“目标试验”模拟研究。
Respirology. 2021 May;26(5):452-460. doi: 10.1111/resp.14001. Epub 2021 Jan 4.
6
Exercise During the First Trimester of Pregnancy and the Risks of Abnormal Screening and Gestational Diabetes Mellitus.孕期第一 trimester 运动与异常筛查和妊娠期糖尿病风险的关系。
Diabetes Care. 2021 Feb;44(2):425-432. doi: 10.2337/dc20-1475. Epub 2020 Dec 21.
7
Bootstrap inference for multiple imputation under uncongeniality and misspecification.在不相容性和错误设定下多重填补的自助法推断
Stat Methods Med Res. 2020 Dec;29(12):3533-3546. doi: 10.1177/0962280220932189. Epub 2020 Jun 30.
8
The Impact of Delayed Switch to Second-Line Antiretroviral Therapy on Mortality, Depending on Definition of Failure Time and CD4 Count at Failure.根据失败时间和失败时的 CD4 计数定义,延迟转换为二线抗逆转录病毒治疗对死亡率的影响。
Am J Epidemiol. 2020 Aug 1;189(8):811-819. doi: 10.1093/aje/kwaa049.
9
Propensity scores using missingness pattern information: a practical guide.基于缺失模式信息的倾向评分:实用指南。
Stat Med. 2020 May 20;39(11):1641-1657. doi: 10.1002/sim.8503. Epub 2020 Feb 27.
10
Estimating treatment effects with partially observed covariates using outcome regression with missing indicators.使用带有缺失指示符的结果回归估计部分观测协变量的治疗效果。
Biom J. 2020 Mar;62(2):428-443. doi: 10.1002/bimj.201900041. Epub 2020 Jan 29.