• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用不完全队列数据进行因果发现的多重插补和逐个检验删除。

Multiple imputation and test-wise deletion for causal discovery with incomplete cohort data.

机构信息

Leibniz Institute for Prevention Research and Epidemiology - BIPS, Bremen, Germany.

Faculty of Mathematics and Computer Science, University of Bremen, Bremen, Germany.

出版信息

Stat Med. 2022 Oct 15;41(23):4716-4743. doi: 10.1002/sim.9535. Epub 2022 Jul 31.

DOI:10.1002/sim.9535
PMID:35908775
Abstract

Causal discovery algorithms estimate causal graphs from observational data. This can provide a valuable complement to analyses focusing on the causal relation between individual treatment-outcome pairs. Constraint-based causal discovery algorithms rely on conditional independence testing when building the graph. Until recently, these algorithms have been unable to handle missing values. In this article, we investigate two alternative solutions: test-wise deletion and multiple imputation. We establish necessary and sufficient conditions for the recoverability of causal structures under test-wise deletion, and argue that multiple imputation is more challenging in the context of causal discovery than for estimation. We conduct an extensive comparison by simulating from benchmark causal graphs: as one might expect, we find that test-wise deletion and multiple imputation both clearly outperform list-wise deletion and single imputation. Crucially, our results further suggest that multiple imputation is especially useful in settings with a small number of either Gaussian or discrete variables, but when the dataset contains a mix of both neither method is uniformly best. The methods we compare include random forest imputation and a hybrid procedure combining test-wise deletion and multiple imputation. An application to data from the IDEFICS cohort study on diet- and lifestyle-related diseases in European children serves as an illustrating example.

摘要

因果发现算法可以根据观测数据估计因果图。这可以为专注于个体治疗效果对之间因果关系的分析提供有价值的补充。基于约束的因果发现算法在构建图时依赖于条件独立性检验。直到最近,这些算法还无法处理缺失值。在本文中,我们研究了两种替代解决方案:逐个测试删除和多重插补。我们为逐个测试删除下因果结构的可恢复性建立了必要和充分条件,并认为在因果发现的背景下,多重插补比估计更为复杂。我们通过从基准因果图进行模拟进行了广泛的比较:正如人们所料,我们发现逐个测试删除和多重插补都明显优于逐个删除和单重插补。至关重要的是,我们的结果进一步表明,在存在少量正态或离散变量的情况下,多重插补特别有用,但当数据集同时包含两者时,两种方法都不是统一的最佳选择。我们比较的方法包括随机森林插补和一种结合逐个测试删除和多重插补的混合程序。对来自欧洲儿童饮食和生活方式相关疾病的 IDEFICS 队列研究数据的应用作为一个说明性示例。

相似文献

1
Multiple imputation and test-wise deletion for causal discovery with incomplete cohort data.利用不完全队列数据进行因果发现的多重插补和逐个检验删除。
Stat Med. 2022 Oct 15;41(23):4716-4743. doi: 10.1002/sim.9535. Epub 2022 Jul 31.
2
Fast Causal Inference with Non-Random Missingness by Test-Wise Deletion.通过逐测试删除法对非随机缺失值进行快速因果推断
Int J Data Sci Anal. 2018 Aug;6(1):47-62. doi: 10.1007/s41060-017-0094-6. Epub 2018 Jan 19.
3
Generalizing treatment effects with incomplete covariates: Identifying assumptions and multiple imputation algorithms.用不完全协变量推广治疗效果:确定假设和多重插补算法。
Biom J. 2023 Jun;65(5):e2100294. doi: 10.1002/bimj.202100294. Epub 2023 Mar 12.
4
Gaps in the usage and reporting of multiple imputation for incomplete data: findings from a scoping review of observational studies addressing causal questions.缺失数据下多重插补使用和报告的差距:针对因果问题的观察性研究的范围综述结果。
BMC Med Res Methodol. 2024 Sep 4;24(1):193. doi: 10.1186/s12874-024-02302-6.
5
An algorithm for direct causal learning of influences on patient outcomes.一种用于直接因果学习对患者预后影响的算法。
Artif Intell Med. 2017 Jan;75:1-15. doi: 10.1016/j.artmed.2016.10.003. Epub 2016 Nov 5.
6
Developing a novel causal inference algorithm for personalized biomedical causal graph learning using meta machine learning.利用元机器学习开发个性化生物医学因果图学习的新因果推理算法。
BMC Med Inform Decis Mak. 2024 May 27;24(1):137. doi: 10.1186/s12911-024-02510-6.
7
A longitudinal causal graph analysis investigating modifiable risk factors and obesity in a European cohort of children and adolescents.一项纵向因果图分析研究了欧洲儿童和青少年队列中可改变的风险因素与肥胖之间的关系。
Sci Rep. 2024 Mar 21;14(1):6822. doi: 10.1038/s41598-024-56721-y.
8
Recoverability and estimation of causal effects under typical multivariable missingness mechanisms.典型多变量缺失机制下因果效应的可恢复性和估计。
Biom J. 2024 Apr;66(3):e2200326. doi: 10.1002/bimj.202200326.
9
A comparison of different methods to handle missing data in the context of propensity score analysis.不同方法在倾向评分分析中处理缺失数据的比较。
Eur J Epidemiol. 2019 Jan;34(1):23-36. doi: 10.1007/s10654-018-0447-z. Epub 2018 Oct 19.
10
Imputation approaches for potential outcomes in causal inference.因果推断中潜在结果的插补方法。
Int J Epidemiol. 2015 Oct;44(5):1731-7. doi: 10.1093/ije/dyv135. Epub 2015 Jul 25.

引用本文的文献

1
Software application profile: tpc and micd-R packages for causal discovery with incomplete cohort data.软件应用程序简介:用于不完全队列数据因果发现的 tpc 和 micd-R 包。
Int J Epidemiol. 2024 Aug 14;53(5). doi: 10.1093/ije/dyae113.
2
A longitudinal causal graph analysis investigating modifiable risk factors and obesity in a European cohort of children and adolescents.一项纵向因果图分析研究了欧洲儿童和青少年队列中可改变的风险因素与肥胖之间的关系。
Sci Rep. 2024 Mar 21;14(1):6822. doi: 10.1038/s41598-024-56721-y.
3
Constructing Causal Life-Course Models: Comparative Study of Data-Driven and Theory-Driven Approaches.
构建因果生命历程模型:数据驱动与理论驱动方法的比较研究。
Am J Epidemiol. 2023 Nov 3;192(11):1917-1927. doi: 10.1093/aje/kwad144.