Aikens Rachael C, Baiocchi Michael
Department of Biomedical Data Science, Stanford University.
Department of Epidemiology and Population Health, Stanford University.
Am Stat. 2023;77(1):72-84. doi: 10.1080/00031305.2022.2051605. Epub 2022 Apr 11.
An important step for any causal inference study design is understanding the distribution of the subjects in terms of measured baseline covariates. However, not all baseline variation is equally important. We propose a set of visualizations that reduce the space of measured covariates into two components of baseline variation important to the design of an observational causal inference study: a propensity score summarizing baseline variation associated with treatment assignment, and prognostic score summarizing baseline variation associated with the untreated potential outcome. These and variations thereof visualize study design trade-offs and illustrate core methodological concepts in causal inference. As a practical demonstration, we apply assignment-control plots to a hypothetical study of cardiothoracic surgery. To demonstrate how these plots can be used to illustrate nuanced concepts, we use them to visualize unmeasured confounding and to consider the relationship between propensity scores and instrumental variables. While the family of visualization tools for studies of causality is relatively sparse, simple visual tools can be an asset to education, application, and methods development.
对于任何因果推断研究设计而言,重要的一步是根据测量的基线协变量了解研究对象的分布情况。然而,并非所有基线变异都同等重要。我们提出了一组可视化方法,将测量的协变量空间缩减为对观察性因果推断研究设计重要的两个基线变异成分:一个倾向得分,总结与治疗分配相关的基线变异;一个预后得分,总结与未治疗潜在结果相关的基线变异。这些及其变体可视化了研究设计的权衡,并阐明了因果推断中的核心方法概念。作为一个实际演示,我们将分配-对照图应用于一个心胸外科的假设研究。为了展示这些图如何用于阐明细微的概念,我们用它们来可视化未测量的混杂因素,并考虑倾向得分与工具变量之间的关系。虽然用于因果关系研究的可视化工具种类相对较少,但简单的可视化工具对教育、应用和方法开发可能是一项资产。