Department of Statistics, University of Padova, Padova 35121, Italy.
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, 02115 MA, United States.
Biometrics. 2024 Mar 27;80(2). doi: 10.1093/biomtc/ujae025.
Several epidemiological studies have provided evidence that long-term exposure to fine particulate matter (pm2.5) increases mortality rate. Furthermore, some population characteristics (e.g., age, race, and socioeconomic status) might play a crucial role in understanding vulnerability to air pollution. To inform policy, it is necessary to identify groups of the population that are more or less vulnerable to air pollution. In causal inference literature, the group average treatment effect (GATE) is a distinctive facet of the conditional average treatment effect. This widely employed metric serves to characterize the heterogeneity of a treatment effect based on some population characteristics. In this paper, we introduce a novel Confounder-Dependent Bayesian Mixture Model (CDBMM) to characterize causal effect heterogeneity. More specifically, our method leverages the flexibility of the dependent Dirichlet process to model the distribution of the potential outcomes conditionally to the covariates and the treatment levels, thus enabling us to: (i) identify heterogeneous and mutually exclusive population groups defined by similar GATEs in a data-driven way, and (ii) estimate and characterize the causal effects within each of the identified groups. Through simulations, we demonstrate the effectiveness of our method in uncovering key insights about treatment effects heterogeneity. We apply our method to claims data from Medicare enrollees in Texas. We found six mutually exclusive groups where the causal effects of pm2.5 on mortality rate are heterogeneous.
已有多项流行病学研究表明,长期暴露于细颗粒物(PM2.5)会增加死亡率。此外,一些人口特征(如年龄、种族和社会经济地位)可能在理解对空气污染的脆弱性方面发挥关键作用。为了制定政策,有必要确定对空气污染更易受影响或不易受影响的人群群体。在因果推理文献中,群体平均处理效应(GATE)是条件平均处理效应的一个独特方面。这个广泛使用的指标用于根据一些人口特征来描述处理效果的异质性。在本文中,我们引入了一种新的依赖混杂因素的贝叶斯混合模型(CDBMM)来描述因果效应的异质性。更具体地说,我们的方法利用依赖 Dirichlet 过程的灵活性来对潜在结果的分布进行建模,条件是协变量和处理水平,从而使我们能够:(i)以数据驱动的方式识别由相似 GATE 定义的异构和互斥的人群群体,以及(ii)在每个识别出的群体中估计和描述因果效应。通过模拟,我们证明了我们的方法在揭示处理效果异质性方面的有效性。我们将我们的方法应用于德克萨斯州医疗保险参保者的索赔数据。我们发现了六个互斥的群体,其中 PM2.5 对死亡率的因果效应是异构的。