通过将前者嵌入后者，弥合观察性研究和随机实验之间的差距。

Bridging observational studies and randomized experiments by embedding the former in the latter.

机构信息

Faculty of Arts and Sciences, Department of Statistics, Harvard University, Cambridge, MA, USA.

出版信息

Stat Methods Med Res. 2019 Jul;28(7):1958-1978. doi: 10.1177/0962280217740609. Epub 2017 Nov 29.

DOI:10.1177/0962280217740609

PMID:29187059

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5902671/

Abstract

Consider a statistical analysis that draws causal inferences from an observational dataset, inferences that are presented as being valid in the standard frequentist senses; i.e. the analysis produces: (1) consistent point estimates, (2) valid -values, valid in the sense of rejecting true null hypotheses at the nominal level or less often, and/or (3) confidence intervals, which are presented as having at least their nominal coverage for their estimands. For the hypothetical validity of these statements, the analysis must embed the observational study in a hypothetical randomized experiment that created the observed data, or a subset of that hypothetical randomized data set. This multistage effort with thought-provoking tasks involves: (1) a purely that precisely formulate the causal question in terms of a hypothetical randomized experiment where the exposure is assigned to units; (2) a that approximates a randomized experiment before any outcome data are observed, (3) a comparing the outcomes of interest in the exposed and non-exposed units of the hypothetical randomized experiment, and (4) a providing conclusions about statistical evidence for the sizes of possible causal effects. Stages 2 and 3 may rely on modern computing to implement the effort, whereas Stage 1 demands careful scientific argumentation to make the embedding plausible to scientific readers of the proffered statistical analysis. Otherwise, the resulting analysis is vulnerable to criticism for being simply a presentation of scientifically meaningless arithmetic calculations. The conceptually most demanding tasks are often the most scientifically interesting to the dedicated researcher and readers of the resulting statistical analyses. This perspective is rarely implemented with any rigor, for example, completely eschewing the first stage. We illustrate our approach using an example examining the effect of parental smoking on children's lung function collected in families living in East Boston in the 1970s.

摘要

考虑一项统计分析，该分析从观察性数据集得出因果推论，这些推论在标准的频率派意义上被认为是有效的；即该分析产生：（1）一致的点估计值，（2）有效的 - 值，即在名义水平或更频繁地拒绝真实零假设的意义上有效，和/或（3）置信区间，其被表示为具有至少其名义覆盖范围的估计量。对于这些陈述的假设有效性，分析必须将观察性研究嵌入创建观察数据的假设随机实验中，或者是该假设随机数据集的一个子集。这项涉及深思熟虑任务的多阶段努力包括：（1）纯粹的思考，即根据暴露于单位的假设随机实验精确地表述因果问题；（2）在观察到任何结果数据之前近似随机实验的努力，（3）在假设随机实验的暴露和非暴露单位中比较感兴趣的结果的努力，以及（4）提供关于统计证据的结论，说明可能的因果效应的大小。第 2 阶段和第 3 阶段可能依赖于现代计算来实现该努力，而第 1 阶段则需要仔细的科学论证来使嵌入式分析对所提供的统计分析的科学读者具有合理性。否则，该分析很容易受到批评，因为它只是呈现了无意义的科学计算。对于专注的研究人员和分析结果的读者来说，概念上最具挑战性的任务通常是最具科学趣味性的。这种观点很少被严格执行，例如，完全回避第一阶段。我们使用一个示例来说明我们的方法，该示例检查了 20 世纪 70 年代在东波士顿居住的家庭中父母吸烟对儿童肺功能的影响。

相似文献

Bridging observational studies and randomized experiments by embedding the former in the latter.

Stat Methods Med Res. 2019 Jul;28(7):1958-1978. doi: 10.1177/0962280217740609. Epub 2017 Nov 29.

Using Bounds to Compare the Strength of Exchangeability Assumptions for Internal and External Validity.

Am J Epidemiol. 2019 Jul 1;188(7):1355-1360. doi: 10.1093/aje/kwz060.

Translating questions to estimands in randomized clinical trials with intercurrent events.

Stat Med. 2022 Jul 20;41(16):3211-3228. doi: 10.1002/sim.9398. Epub 2022 May 16.

The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials.

Stat Med. 2007 Jan 15;26(1):20-36. doi: 10.1002/sim.2739.

Estimation of causal effects of binary treatments in unconfounded studies.

Stat Med. 2015 Nov 20;34(26):3381-98. doi: 10.1002/sim.6532. Epub 2015 May 26.

Re: The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials.

Stat Med. 2008 Jun 30;27(14):2740-1; author reply 2741-2. doi: 10.1002/sim.3172.

Causal inference methods to assess safety upper bounds in randomized trials with noncompliance.

Clin Trials. 2015 Jun;12(3):265-75. doi: 10.1177/1740774515572352. Epub 2015 Mar 1.

A fast bootstrap algorithm for causal inference with large data.

Stat Med. 2024 Jul 10;43(15):2894-2927. doi: 10.1002/sim.10075. Epub 2024 May 13.

A guide to improve your causal inferences from observational data.

Eur J Cardiovasc Nurs. 2020 Dec;19(8):757-762. doi: 10.1177/1474515120957241. Epub 2020 Oct 10.

A Bayesian approach to estimating causal vaccine effects on binary post-infection outcomes.

Stat Med. 2016 Jan 15;35(1):53-64. doi: 10.1002/sim.6573. Epub 2015 Jul 20.

引用本文的文献

Counternull sets in randomized experiments.

Am Stat. 2025;79(2):275-285. doi: 10.1080/00031305.2024.2432884. Epub 2025 Jan 17.

Fear of Missing Out's (FoMO) relationship with moral judgment and behavior.

PLoS One. 2024 Nov 7;19(11):e0312724. doi: 10.1371/journal.pone.0312724. eCollection 2024.

Researching COVID to enhance recovery (RECOVER) tissue pathology study protocol: Rationale, objectives, and design.

PLoS One. 2024 Jan 10;19(1):e0285645. doi: 10.1371/journal.pone.0285645. eCollection 2024.

Quasi-rerandomization for observational studies.

BMC Med Res Methodol. 2023 Jun 30;23(1):155. doi: 10.1186/s12874-023-01977-7.

Testing Biased Randomization Assumptions and Quantifying Imperfect Matching and Residual Confounding in Matched Observational Studies.

J Comput Graph Stat. 2023;32(2):528-538. doi: 10.1080/10618600.2022.2116447. Epub 2022 Oct 19.

Improving the design stage of air pollution studies based on wind patterns.

Sci Rep. 2022 May 13;12(1):7917. doi: 10.1038/s41598-022-11939-6.

A randomization-based causal inference framework for uncovering environmental exposure effects on human gut microbiota.

PLoS Comput Biol. 2022 May 9;18(5):e1010044. doi: 10.1371/journal.pcbi.1010044. eCollection 2022 May.

The importance of having a conceptual stage when reporting non-randomized studies.

Biostat Epidemiol. 2021;5(1):9-18. doi: 10.1080/24709360.2021.1913707. Epub 2021 Apr 30.

Causal Isotonic Regression.

J R Stat Soc Series B Stat Methodol. 2020 Jul;82(3):719-747. doi: 10.1111/rssb.12372. Epub 2020 May 13.

The impact of outdoor air pollution on COVID-19: a review of evidence from , animal, and human studies.

Eur Respir Rev. 2021 Feb 9;30(159). doi: 10.1183/16000617.0242-2020. Print 2021 Mar 31.

本文引用的文献

Joint Bayesian weight and height postnatal growth model to study the effects of maternal smoking during pregnancy.

Stat Med. 2017 Nov 10;36(25):3990-4006. doi: 10.1002/sim.7407. Epub 2017 Aug 10.

Robust estimation of causal effects of binary treatments in unconfounded studies with dichotomous outcomes.

Stat Med. 2013 May 20;32(11):1795-814. doi: 10.1002/sim.5627. Epub 2012 Sep 28.

A Review of Hot Deck Imputation for Survey Non-response.

Int Stat Rev. 2010 Apr;78(1):40-64. doi: 10.1111/j.1751-5823.2010.00103.x.

Re: The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials.

Stat Med. 2008 Jun 30;27(14):2740-1; author reply 2741-2. doi: 10.1002/sim.3172.

The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials.

Stat Med. 2007 Jan 15;26(1):20-36. doi: 10.1002/sim.2739.

Parental smoking and lung function: misclassification due to background exposure to passive smoking.

Respir Med. 2007 Apr;101(4):768-73. doi: 10.1016/j.rmed.2006.08.004. Epub 2006 Sep 26.

Heart failure, chronic diuretic use, and increase in mortality and hospitalization: an observational study using propensity score methods.

Eur Heart J. 2006 Jun;27(12):1431-9. doi: 10.1093/eurheartj/ehi890. Epub 2006 May 18.

The exposure-response curve for ozone and risk of mortality and the adequacy of current ozone regulations.

Environ Health Perspect. 2006 Apr;114(4):532-6. doi: 10.1289/ehp.8816.

Air pollution and blood markers of cardiovascular risk.

Environ Health Perspect. 2001 Jun;109 Suppl 3(Suppl 3):405-9. doi: 10.1289/ehp.01109s3405.

More powerful randomization-based p-values in double-blind trials with non-compliance.

Stat Med. 1998 Feb 15;17(3):371-85; discussion 387-9. doi: 10.1002/(sici)1097-0258(19980215)17:3<371::aid-sim768>3.0.co;2-o.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr
超能文献

通过将前者嵌入后者，弥合观察性研究和随机实验之间的差距。

Bridging observational studies and randomized experiments by embedding the former in the latter.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr超能文献

通过将前者嵌入后者，弥合观察性研究和随机实验之间的差距。

Bridging observational studies and randomized experiments by embedding the former in the latter.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr
超能文献