• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在观察性研究中,通过结局回归估计因果效应时,多重填补是否应该按暴露组分层?

Should multiple imputation be stratified by exposure group when estimating causal effects via outcome regression in observational studies?

机构信息

Clinical Epidemiology and Biostatistics Unit, Department of Paediatrics, University of Melbourne, Parkville, Australia.

Clinical Epidemiology and Biostatistics Unit, Murdoch Children's Research Institute, 50 Flemington Road, 3052, Parkville, Australia.

出版信息

BMC Med Res Methodol. 2023 Feb 16;23(1):42. doi: 10.1186/s12874-023-01843-6.

DOI:10.1186/s12874-023-01843-6
PMID:36797679
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9933305/
Abstract

BACKGROUND

Despite recent advances in causal inference methods, outcome regression remains the most widely used approach for estimating causal effects in epidemiological studies with a single-point exposure and outcome. Missing data are common in these studies, and complete-case analysis (CCA) and multiple imputation (MI) are two frequently used methods for handling them. In randomised controlled trials (RCTs), it has been shown that MI should be conducted separately by treatment group. In observational studies, causal inference is now understood as the task of emulating an RCT, which raises the question of whether MI should be conducted by exposure group in such studies.

METHODS

We addressed this question by evaluating the performance of seven methods for handling missing data when estimating causal effects with outcome regression. We conducted an extensive simulation study based on an illustrative case study from the Victorian Adolescent Health Cohort Study, assessing a range of scenarios, including seven outcome generation models with exposure-confounder interactions of differing strength.

RESULTS

The simulation results showed that MI by exposure group led to the least bias when the size of the smallest exposure group was relatively large, followed by MI approaches that included the exposure-confounder interactions.

CONCLUSIONS

The findings from our simulation study, which was designed based on a real case study, suggest that current practice for the conduct of MI in causal inference may need to shift to stratifying by exposure group where feasible, or otherwise including exposure-confounder interactions in the imputation model.

摘要

背景

尽管因果推断方法最近取得了进展,但在单点暴露和结局的流行病学研究中,结局回归仍然是最广泛使用的估计因果效应的方法。这些研究中经常出现缺失数据,完全案例分析(CCA)和多重插补(MI)是两种常用的处理方法。在随机对照试验(RCT)中,已经表明 MI 应该按治疗组分别进行。在观察性研究中,因果推断现在被理解为模拟 RCT 的任务,这就提出了在这类研究中 MI 是否应该按暴露组进行的问题。

方法

我们通过评估在使用结局回归估计因果效应时处理缺失数据的七种方法的性能来解决这个问题。我们基于维多利亚青少年健康队列研究中的一个实例研究进行了广泛的模拟研究,评估了一系列场景,包括七个具有不同强度暴露-混杂因素相互作用的结局生成模型。

结果

模拟结果表明,当最小暴露组的规模相对较大时,按暴露组进行 MI 导致的偏差最小,其次是包含暴露-混杂因素相互作用的 MI 方法。

结论

我们的模拟研究结果基于实际案例研究设计,表明在因果推断中进行 MI 的当前实践可能需要根据实际情况按暴露组分层,或者在插补模型中包含暴露-混杂因素相互作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0f/9933305/3b2ba64312fe/12874_2023_1843_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0f/9933305/b27d87cdc82f/12874_2023_1843_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0f/9933305/01973394ed74/12874_2023_1843_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0f/9933305/d625da871f3e/12874_2023_1843_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0f/9933305/68ae7404d1aa/12874_2023_1843_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0f/9933305/3b2ba64312fe/12874_2023_1843_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0f/9933305/b27d87cdc82f/12874_2023_1843_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0f/9933305/01973394ed74/12874_2023_1843_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0f/9933305/d625da871f3e/12874_2023_1843_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0f/9933305/68ae7404d1aa/12874_2023_1843_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d0f/9933305/3b2ba64312fe/12874_2023_1843_Fig5_HTML.jpg

相似文献

1
Should multiple imputation be stratified by exposure group when estimating causal effects via outcome regression in observational studies?在观察性研究中,通过结局回归估计因果效应时,多重填补是否应该按暴露组分层?
BMC Med Res Methodol. 2023 Feb 16;23(1):42. doi: 10.1186/s12874-023-01843-6.
2
Handling missing data when estimating causal effects with targeted maximum likelihood estimation. 采用有向极大似然估计法估计因果效应时处理缺失数据。
Am J Epidemiol. 2024 Jul 8;193(7):1019-1030. doi: 10.1093/aje/kwae012.
3
Evaluation of multiple imputation approaches for handling missing covariate information in a case-cohort study with a binary outcome.评价在二分类结局病例-对照研究中采用多种插补方法处理协变量缺失信息的效果。
BMC Med Res Methodol. 2022 Apr 3;22(1):87. doi: 10.1186/s12874-021-01495-4.
4
Recoverability and estimation of causal effects under typical multivariable missingness mechanisms.典型多变量缺失机制下因果效应的可恢复性和估计。
Biom J. 2024 Apr;66(3):e2200326. doi: 10.1002/bimj.202200326.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
A comparison of methods to estimate the survivor average causal effect in the presence of missing data: a simulation study.存在缺失数据时估计幸存者平均因果效应的方法比较:一项模拟研究。
BMC Med Res Methodol. 2019 Dec 3;19(1):223. doi: 10.1186/s12874-019-0874-x.
7
Multiple imputation for patient reported outcome measures in randomised controlled trials: advantages and disadvantages of imputing at the item, subscale or composite score level.在随机对照试验中对患者报告结局测量进行多重插补:在项目、分量表或综合评分层面插补的优缺点。
BMC Med Res Methodol. 2018 Aug 28;18(1):87. doi: 10.1186/s12874-018-0542-6.
8
Gaps in the usage and reporting of multiple imputation for incomplete data: findings from a scoping review of observational studies addressing causal questions.缺失数据下多重插补使用和报告的差距:针对因果问题的观察性研究的范围综述结果。
BMC Med Res Methodol. 2024 Sep 4;24(1):193. doi: 10.1186/s12874-024-02302-6.
9
Multiple imputation of missing data under missing at random: compatible imputation models are not sufficient to avoid bias if they are mis-specified.在随机缺失下对缺失数据进行多重插补:如果插补模型指定错误,即使相容的插补模型也不足以避免偏差。
J Clin Epidemiol. 2023 Aug;160:100-109. doi: 10.1016/j.jclinepi.2023.06.011. Epub 2023 Jun 19.
10
A review of the use of controlled multiple imputation in randomised controlled trials with missing outcome data.缺失结局数据的随机对照试验中使用控制多重填补的综述。
BMC Med Res Methodol. 2021 Apr 15;21(1):72. doi: 10.1186/s12874-021-01261-6.

引用本文的文献

1
Total Hip Arthroplasty: The Influence of Different Management Structures in Germany and the Netherlands on Preoperative Expectations, Self-Efficacy, and Postoperative Satisfaction.全髋关节置换术:德国和荷兰不同管理结构对术前期望、自我效能感及术后满意度的影响
Cureus. 2025 May 14;17(5):e84088. doi: 10.7759/cureus.84088. eCollection 2025 May.
2
Periconceptional Dietary Quality and Metabolic Syndrome at 3 Years Postpartum.围孕期饮食质量与产后 3 年代谢综合征的关系。
J Am Heart Assoc. 2024 Aug 20;13(16):e035555. doi: 10.1161/JAHA.124.035555. Epub 2024 Aug 19.
3
Temporal changes in the risk of six-month post-COVID symptoms: a national population-based cohort study.

本文引用的文献

1
Recoverability and estimation of causal effects under typical multivariable missingness mechanisms.典型多变量缺失机制下因果效应的可恢复性和估计。
Biom J. 2024 Apr;66(3):e2200326. doi: 10.1002/bimj.202200326.
2
Noncollapsibility, confounding, and sparse-data bias. Part 2: What should researchers make of persistent controversies about the odds ratio?不可 collapsibility、混杂因素和稀疏数据偏差。第 2 部分:对于比值比的持续争议,研究人员应该如何看待?
J Clin Epidemiol. 2021 Nov;139:264-268. doi: 10.1016/j.jclinepi.2021.06.004. Epub 2021 Jun 11.
3
Noncollapsibility, confounding, and sparse-data bias. Part 1: The oddities of odds.
新冠后症状持续六个月风险的时间变化:一项基于全国人群的队列研究。
Am J Epidemiol. 2025 Jan 8;194(1):162-171. doi: 10.1093/aje/kwae174.
4
Long-term causal effects of far-right terrorism in New Zealand.新西兰极右翼恐怖主义的长期因果效应。
PNAS Nexus. 2023 Aug 22;2(8):pgad242. doi: 10.1093/pnasnexus/pgad242. eCollection 2023 Aug.
不可压缩性、混杂因素和稀疏数据偏差。第1部分:比值的奇特之处。
J Clin Epidemiol. 2021 Oct;138:178-181. doi: 10.1016/j.jclinepi.2021.06.007. Epub 2021 Jun 11.
4
Propensity scores using missingness pattern information: a practical guide.基于缺失模式信息的倾向评分:实用指南。
Stat Med. 2020 May 20;39(11):1641-1657. doi: 10.1002/sim.8503. Epub 2020 Feb 27.
5
A comparison of different methods to handle missing data in the context of propensity score analysis.不同方法在倾向评分分析中处理缺失数据的比较。
Eur J Epidemiol. 2019 Jan;34(1):23-36. doi: 10.1007/s10654-018-0447-z. Epub 2018 Oct 19.
6
Canonical Causal Diagrams to Guide the Treatment of Missing Data in Epidemiologic Studies.规范因果图指导流行病学研究中缺失数据的处理。
Am J Epidemiol. 2018 Dec 1;187(12):2705-2715. doi: 10.1093/aje/kwy173.
7
Propensity score analysis with partially observed covariates: How should multiple imputation be used?倾向评分分析与部分观测协变量:应如何使用多重插补?
Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.
8
Should multiple imputation be the method of choice for handling missing data in randomized trials?在随机试验中,处理缺失数据时是否应选择多重插补法?
Stat Methods Med Res. 2018 Sep;27(9):2610-2626. doi: 10.1177/0962280216683570. Epub 2016 Dec 19.
9
Appropriate inclusion of interactions was needed to avoid bias in multiple imputation.为避免多重填补中的偏差,需要适当纳入交互作用。
J Clin Epidemiol. 2016 Dec;80:107-115. doi: 10.1016/j.jclinepi.2016.07.004. Epub 2016 Jul 19.
10
Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available.在没有随机试验时使用大数据模拟目标试验。
Am J Epidemiol. 2016 Apr 15;183(8):758-64. doi: 10.1093/aje/kwv254. Epub 2016 Mar 18.