• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

G 计算和机器学习在估计二项暴露状态对二项结果的因果效应中的应用。

G-computation and machine learning for estimating the causal effects of binary exposure statuses on binary outcomes.

机构信息

INSERM UMR 1246 - SPHERE, Nantes University, Tours University, 22 Boulevard Bénoni Goullin, 44200, Nantes, France.

IDBC-A2COM, Pacé, France.

出版信息

Sci Rep. 2021 Jan 14;11(1):1435. doi: 10.1038/s41598-021-81110-0.

DOI:10.1038/s41598-021-81110-0
PMID:33446866
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7809122/
Abstract

In clinical research, there is a growing interest in the use of propensity score-based methods to estimate causal effects. G-computation is an alternative because of its high statistical power. Machine learning is also increasingly used because of its possible robustness to model misspecification. In this paper, we aimed to propose an approach that combines machine learning and G-computation when both the outcome and the exposure status are binary and is able to deal with small samples. We evaluated the performances of several methods, including penalized logistic regressions, a neural network, a support vector machine, boosted classification and regression trees, and a super learner through simulations. We proposed six different scenarios characterised by various sample sizes, numbers of covariates and relationships between covariates, exposure statuses, and outcomes. We have also illustrated the application of these methods, in which they were used to estimate the efficacy of barbiturates prescribed during the first 24 h of an episode of intracranial hypertension. In the context of GC, for estimating the individual outcome probabilities in two counterfactual worlds, we reported that the super learner tended to outperform the other approaches in terms of both bias and variance, especially for small sample sizes. The support vector machine performed well, but its mean bias was slightly higher than that of the super learner. In the investigated scenarios, G-computation associated with the super learner was a performant method for drawing causal inferences, even from small sample sizes.

摘要

在临床研究中,人们越来越感兴趣地使用倾向评分匹配方法来估计因果效应。由于其具有较高的统计功效,G 计算是一种替代方法。由于其可能对模型误设具有稳健性,机器学习也越来越多地被使用。在本文中,我们旨在提出一种方法,当结局和暴露状态都是二分类时,将机器学习和 G 计算结合起来,并且能够处理小样本。我们通过模拟评估了几种方法的性能,包括惩罚逻辑回归、神经网络、支持向量机、提升分类和回归树以及超级学习者。我们提出了六个不同的场景,这些场景具有不同的样本量、协变量的数量和协变量之间、暴露状态和结局之间的关系。我们还说明了这些方法的应用,这些方法用于估计颅内高压发作的 24 小时内开处方的巴比妥类药物的疗效。在 GC 的背景下,为了估计两个反事实世界中的个体结局概率,我们报告超级学习者在偏差和方差方面往往优于其他方法,特别是对于小样本量。支持向量机表现良好,但平均偏差略高于超级学习者。在所研究的场景中,即使在小样本量的情况下,与超级学习者结合的 G 计算也是一种用于进行因果推断的有效方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/7809122/29baf176633e/41598_2021_81110_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/7809122/620eb67f0011/41598_2021_81110_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/7809122/3eaeff64dafd/41598_2021_81110_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/7809122/5089472be92f/41598_2021_81110_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/7809122/62acd7af847e/41598_2021_81110_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/7809122/29baf176633e/41598_2021_81110_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/7809122/620eb67f0011/41598_2021_81110_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/7809122/3eaeff64dafd/41598_2021_81110_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/7809122/5089472be92f/41598_2021_81110_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/7809122/62acd7af847e/41598_2021_81110_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/7809122/29baf176633e/41598_2021_81110_Fig5_HTML.jpg

相似文献

1
G-computation and machine learning for estimating the causal effects of binary exposure statuses on binary outcomes.G 计算和机器学习在估计二项暴露状态对二项结果的因果效应中的应用。
Sci Rep. 2021 Jan 14;11(1):1435. doi: 10.1038/s41598-021-81110-0.
2
External control arm analysis: an evaluation of propensity score approaches, G-computation, and doubly debiased machine learning.外部控制臂分析:倾向评分方法、G-计算和双重无偏机器学习的评估。
BMC Med Res Methodol. 2022 Dec 28;22(1):335. doi: 10.1186/s12874-022-01799-z.
3
The Balance Super Learner: A robust adaptation of the Super Learner to improve estimation of the average treatment effect in the treated based on propensity score matching.平衡超级学习者:超级学习者的稳健自适应方法,可提高基于倾向评分匹配的处理组平均处理效应估计的稳健性。
Stat Methods Med Res. 2018 Aug;27(8):2504-2518. doi: 10.1177/0962280216682055. Epub 2016 Dec 15.
4
Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies.观察性研究中因果推断的靶向最大似然估计
Am J Epidemiol. 2017 Jan 1;185(1):65-73. doi: 10.1093/aje/kww165. Epub 2016 Dec 9.
5
Should a propensity score model be super? The utility of ensemble procedures for causal adjustment.应该使用倾向性评分模型吗?集成方法在因果调整中的效用。
Stat Med. 2019 Apr 30;38(9):1690-1702. doi: 10.1002/sim.8075. Epub 2018 Dec 26.
6
Parametric and nonparametric propensity score estimation in multilevel observational studies.多层次观察性研究中的参数和非参数倾向评分估计。
Stat Med. 2023 Oct 15;42(23):4147-4176. doi: 10.1002/sim.9852. Epub 2023 Aug 2.
7
Reflection on modern methods: when worlds collide-prediction, machine learning and causal inference.反思现代方法:当世界碰撞——预测、机器学习和因果推断。
Int J Epidemiol. 2021 Jan 23;49(6):2058-2064. doi: 10.1093/ije/dyz132.
8
Covariate adjustment and estimation of difference in proportions in randomized clinical trials.随机临床试验中协变量调整及比例差异估计
Pharm Stat. 2024 Nov-Dec;23(6):884-905. doi: 10.1002/pst.2397. Epub 2024 May 19.
9
Understanding and diagnosing the potential for bias when using machine learning methods with doubly robust causal estimators.理解和诊断在使用具有双重稳健因果估计器的机器学习方法时潜在的偏差。
Stat Methods Med Res. 2019 Jun;28(6):1637-1650. doi: 10.1177/0962280218772065. Epub 2018 May 2.
10
Machine learning versus traditional risk stratification methods in acute coronary syndrome: a pooled randomized clinical trial analysis.机器学习与传统风险分层方法在急性冠状动脉综合征中的比较:一项汇总随机临床试验分析。
J Thromb Thrombolysis. 2020 Jan;49(1):1-9. doi: 10.1007/s11239-019-01940-8.

引用本文的文献

1
Machine learning prediction models for mode of delivery in prolonged pregnancies in Sweden.瑞典过期妊娠分娩方式的机器学习预测模型
Sci Rep. 2025 Sep 12;15(1):32487. doi: 10.1038/s41598-025-19198-x.
2
Causal estimation of time-varying treatments in observational studies: a scoping review of methods, applications, and missing data practices.观察性研究中时变治疗的因果估计:方法、应用及缺失数据处理的范围综述
BMC Med Res Methodol. 2025 Aug 27;25(1):202. doi: 10.1186/s12874-025-02633-y.
3
How to select predictive models for decision-making or causal inference.

本文引用的文献

1
G-computation, propensity score-based methods, and targeted maximum likelihood estimator for causal inference with different covariates sets: a comparative simulation study.基于 G-计算、倾向评分匹配方法和有不同协变量集的因果推断的目标极大似然估计器:一项比较模拟研究。
Sci Rep. 2020 Jun 8;10(1):9219. doi: 10.1038/s41598-020-65917-x.
2
Machine learning in the estimation of causal effects: targeted minimum loss-based estimation and double/debiased machine learning.机器学习在因果效应估计中的应用:基于有向最小损失的估计和双重/无偏机器学习。
Biostatistics. 2020 Apr 1;21(2):353-358. doi: 10.1093/biostatistics/kxz042.
3
On the relationship of machine learning with causal inference.
如何选择用于决策或因果推断的预测模型。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf016.
4
Serum metabolomic signatures of fatty acid oxidation defects differentiate host-response subphenotypes of acute respiratory distress syndrome.血清代谢组学特征可区分急性呼吸窘迫综合征宿主反应亚表型的脂肪酸氧化缺陷。
Respir Res. 2023 May 20;24(1):136. doi: 10.1186/s12931-023-02447-w.
5
Machine Learning for Causal Inference in Biological Networks: Perspectives of This Challenge.用于生物网络因果推断的机器学习:这一挑战的视角
Front Bioinform. 2021 Sep 22;1:746712. doi: 10.3389/fbinf.2021.746712. eCollection 2021.
论机器学习与因果推断的关系。
Eur J Epidemiol. 2020 Feb;35(2):183-185. doi: 10.1007/s10654-019-00564-9. Epub 2019 Sep 27.
4
What is Machine Learning? A Primer for the Epidemiologist.什么是机器学习?流行病学人员入门指南。
Am J Epidemiol. 2019 Dec 31;188(12):2222-2239. doi: 10.1093/aje/kwz189.
5
Reflection on modern methods: when worlds collide-prediction, machine learning and causal inference.反思现代方法:当世界碰撞——预测、机器学习和因果推断。
Int J Epidemiol. 2021 Jan 23;49(6):2058-2064. doi: 10.1093/ije/dyz132.
6
A comparison of machine learning algorithms and covariate balance measures for propensity score matching and weighting.用于倾向得分匹配和加权的机器学习算法与协变量平衡度量的比较
Biom J. 2019 Jul;61(4):1049-1072. doi: 10.1002/bimj.201800132. Epub 2019 May 14.
7
Principles of confounder selection.混杂因素选择原则。
Eur J Epidemiol. 2019 Mar;34(3):211-219. doi: 10.1007/s10654-019-00494-6. Epub 2019 Mar 6.
8
You are smarter than you think: (super) machine learning in context.你比你想象的更聪明:语境中的(超级)机器学习。
Eur J Epidemiol. 2018 May;33(5):437-440. doi: 10.1007/s10654-018-0405-9. Epub 2018 May 9.
9
Stacked generalization: an introduction to super learning.堆叠泛化:超级学习导论。
Eur J Epidemiol. 2018 May;33(5):459-464. doi: 10.1007/s10654-018-0390-z. Epub 2018 Apr 10.
10
RESOLVING AN APPARENT PARADOX IN DOUBLY ROBUST ESTIMATORS.解决双重稳健估计量中的一个明显悖论。
Am J Epidemiol. 2018 Apr 1;187(4):891-892. doi: 10.1093/aje/kwx385.