• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

样本观察到的效应:枚举、随机化和泛化。

Sample observed effects: enumeration, randomization and generalization.

作者信息

Ribeiro Andre F

机构信息

Division of Biosciences, University College London, Gower Street, WC1E, London, United Kingdom.

Department of Applied Mathematics and Statistics, University of Sao Paulo, 13560-970, São Carlos, Brazil.

出版信息

Sci Rep. 2025 Mar 11;15(1):8423. doi: 10.1038/s41598-024-80839-8.

DOI:10.1038/s41598-024-80839-8
PMID:40069178
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11897334/
Abstract

We study generalization of intervention effects across several simulated and real-world samples. We start by formulating the concept of the 'background' of a sample effect observation. We then formulate conditions for effect generalization based on a sample's set of (observed and unobserved) backgrounds. This reveals two limits for effect generalization: (1) when effects of a variable are observed under all their enumerable backgrounds, or, (2) when backgrounds have become sufficiently randomized. We use the resulting combinatorial framework to re-examine open issues in current causal effect estimators: out-of-sample validity, concurrent estimation of multiple effects, bias-variance tradeoffs, statistical power, and connections to current predictive and explaining techniques. Methodologically, these definitions also allow us to replace the parametric estimation problems that followed the 'counterfactual' definition of causal effects by combinatorial enumeration and randomization problems in non-experimental samples. We use the resulting non-parametric framework to demonstrate (External Validity, Unconfoundness and Precision) tradeoffs in the performance of popular supervised, explaining, and causal-effect estimators.

摘要

我们研究了干预效应在多个模拟样本和真实世界样本中的泛化情况。我们首先阐述了样本效应观察“背景”的概念。然后,我们基于样本的(观察到的和未观察到的)背景集制定了效应泛化的条件。这揭示了效应泛化的两个限制:(1)当一个变量的效应在其所有可枚举的背景下都被观察到时;或者,(2)当背景已经充分随机化时。我们使用由此产生的组合框架来重新审视当前因果效应估计器中的开放性问题:样本外有效性、多种效应的并发估计、偏差 - 方差权衡、统计功效以及与当前预测和解释技术的联系。从方法上讲,这些定义还使我们能够通过组合枚举和非实验样本中的随机化问题,取代遵循因果效应“反事实”定义的参数估计问题。我们使用由此产生的非参数框架来展示流行的监督、解释和因果效应估计器在性能方面的(外部有效性、无混杂性和精度)权衡。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/11897334/a5fedb6f542b/41598_2024_80839_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/11897334/4faa7748bb95/41598_2024_80839_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/11897334/9096347487b1/41598_2024_80839_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/11897334/6af054ecec57/41598_2024_80839_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/11897334/c39b74c73d90/41598_2024_80839_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/11897334/a5fedb6f542b/41598_2024_80839_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/11897334/4faa7748bb95/41598_2024_80839_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/11897334/9096347487b1/41598_2024_80839_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/11897334/6af054ecec57/41598_2024_80839_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/11897334/c39b74c73d90/41598_2024_80839_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0f/11897334/a5fedb6f542b/41598_2024_80839_Fig8_HTML.jpg

相似文献

1
Sample observed effects: enumeration, randomization and generalization.样本观察到的效应:枚举、随机化和泛化。
Sci Rep. 2025 Mar 11;15(1):8423. doi: 10.1038/s41598-024-80839-8.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Selection criteria and generalizability within the counterfactual framework: explaining the paradox of antidepressant-induced suicidality?反事实框架内的选择标准与可推广性:解释抗抑郁药诱发自杀行为的悖论?
Clin Trials. 2009 Apr;6(2):109-18. doi: 10.1177/1740774509102563.
4
Direct effect models.直接效应模型。
Int J Biostat. 2008;4(1):Article 23. doi: 10.2202/1557-4679.1064.
5
Machine Learning for Causal Inference: On the Use of Cross-fit Estimators.机器学习在因果推断中的应用:基于交叉拟合估计量的研究。
Epidemiology. 2021 May 1;32(3):393-401. doi: 10.1097/EDE.0000000000001332.
6
A generalization of moderated statistics to data adaptive semiparametric estimation in high-dimensional biology.高维生物学中数据自适应半参数估计的适度统计的推广。
Stat Methods Med Res. 2023 Mar;32(3):539-554. doi: 10.1177/09622802221146313. Epub 2022 Dec 26.
7
Statistical inference and power analysis for direct and spillover effects in two-stage randomized experiments.两阶段随机试验中直接和溢出效应的统计推断和功效分析。
Biometrics. 2023 Sep;79(3):2370-2381. doi: 10.1111/biom.13782. Epub 2022 Nov 10.
8
A review of instrumental variable estimators for Mendelian randomization.孟德尔随机化中工具变量估计量的综述。
Stat Methods Med Res. 2017 Oct;26(5):2333-2355. doi: 10.1177/0962280215597579. Epub 2015 Aug 17.
9
A note on semiparametric efficient generalization of causal effects from randomized trials to target populations.关于因果效应从随机试验到目标人群的半参数有效推广的一则注释。
Commun Stat Theory Methods. 2023;52(16):5767-5798. doi: 10.1080/03610926.2021.2020291. Epub 2021 Dec 29.
10
Blinded sample size re-estimation in superiority and noninferiority trials: bias versus variance in variance estimation.优效性和非劣效性试验中的盲法样本量重新估计:方差估计中的偏差与方差
Pharm Stat. 2013 May-Jun;12(3):141-6. doi: 10.1002/pst.1564. Epub 2013 Mar 19.

本文引用的文献

1
Explaining neural scaling laws.解释神经缩放定律。
Proc Natl Acad Sci U S A. 2024 Jul 2;121(27):e2311878121. doi: 10.1073/pnas.2311878121. Epub 2024 Jun 24.
2
Individual and community-level risk for COVID-19 mortality in the United States.美国 COVID-19 死亡率的个体和社区水平风险。
Nat Med. 2021 Feb;27(2):264-269. doi: 10.1038/s41591-020-01191-8. Epub 2020 Dec 11.
3
Individual-Level Fatality Prediction of COVID-19 Patients Using AI Methods.利用人工智能方法预测 COVID-19 患者的个体病死率。
Front Public Health. 2020 Sep 30;8:587937. doi: 10.3389/fpubh.2020.587937. eCollection 2020.
4
Crowding and the shape of COVID-19 epidemics.拥挤程度与 COVID-19 疫情的形状。
Nat Med. 2020 Dec;26(12):1829-1834. doi: 10.1038/s41591-020-1104-0. Epub 2020 Oct 5.
5
Associations Between Built Environment, Neighborhood Socioeconomic Status, and SARS-CoV-2 Infection Among Pregnant Women in New York City.纽约市孕妇的建筑环境、社区社会经济地位与 SARS-CoV-2 感染之间的关联。
JAMA. 2020 Jul 28;324(4):390-392. doi: 10.1001/jama.2020.11370.
6
G-computation, propensity score-based methods, and targeted maximum likelihood estimator for causal inference with different covariates sets: a comparative simulation study.基于 G-计算、倾向评分匹配方法和有不同协变量集的因果推断的目标极大似然估计器:一项比较模拟研究。
Sci Rep. 2020 Jun 8;10(1):9219. doi: 10.1038/s41598-020-65917-x.
7
Estimates of the severity of coronavirus disease 2019: a model-based analysis.新型冠状病毒疾病 2019 严重程度的估计:基于模型的分析。
Lancet Infect Dis. 2020 Jun;20(6):669-677. doi: 10.1016/S1473-3099(20)30243-7. Epub 2020 Mar 30.
8
The UK Biobank resource with deep phenotyping and genomic data.英国生物银行资源库,具有深度表型和基因组数据。
Nature. 2018 Oct;562(7726):203-209. doi: 10.1038/s41586-018-0579-z. Epub 2018 Oct 10.
9
Causal Inference on Discrete Data via Estimating Distance Correlations.通过估计距离相关性对离散数据进行因果推断。
Neural Comput. 2016 May;28(5):801-14. doi: 10.1162/NECO_a_00820. Epub 2016 Feb 18.
10
The parametric g-formula for time-to-event data: intuition and a worked example.用于事件发生时间数据的参数化g公式:直观理解与实例分析。
Epidemiology. 2014 Nov;25(6):889-97. doi: 10.1097/EDE.0000000000000160.