Lee Changwoo J, Symanski Elaine, Rammah Amal, Kang Dong Hun, Hopke Philip K, Park Eun Sug
Department of Statistics, Texas A&M University, 3143 TAMU, 155 Ireland St, College Station, TX 77843, United States.
Center for Precision Environmental Health, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States.
Biostatistics. 2024 Dec 31;26(1). doi: 10.1093/biostatistics/kxae038.
Accounting for exposure measurement errors has been recognized as a crucial problem in environmental epidemiology for over two decades. Bayesian hierarchical models offer a coherent probabilistic framework for evaluating associations between environmental exposures and health effects, which take into account exposure measurement errors introduced by uncertainty in the estimated exposure as well as spatial misalignment between the exposure and health outcome data. While two-stage Bayesian analyses are often regarded as a good alternative to fully Bayesian analyses when joint estimation is not feasible, there has been minimal research on how to properly propagate uncertainty from the first-stage exposure model to the second-stage health model, especially in the case of a large number of participant locations along with spatially correlated exposures. We propose a scalable two-stage Bayesian approach, called a sparse multivariate normal (sparse MVN) prior approach, based on the Vecchia approximation for assessing associations between exposure and health outcomes in environmental epidemiology. We compare its performance with existing approaches through simulation. Our sparse MVN prior approach shows comparable performance with the fully Bayesian approach, which is a gold standard but is impossible to implement in some cases. We investigate the association between source-specific exposures and pollutant (nitrogen dioxide [NO2])-specific exposures and birth weight of full-term infants born in 2012 in Harris County, Texas, using several approaches, including the newly developed method.
二十多年来,考虑暴露测量误差一直被认为是环境流行病学中的一个关键问题。贝叶斯分层模型提供了一个连贯的概率框架,用于评估环境暴露与健康效应之间的关联,该框架考虑了估计暴露中的不确定性以及暴露与健康结果数据之间的空间错位所引入的暴露测量误差。虽然当联合估计不可行时,两阶段贝叶斯分析通常被视为完全贝叶斯分析的一个很好的替代方法,但对于如何将不确定性从第一阶段暴露模型正确传播到第二阶段健康模型的研究很少,特别是在有大量参与者位置以及空间相关暴露的情况下。我们基于Vecchia近似提出了一种可扩展的两阶段贝叶斯方法,称为稀疏多元正态(sparse MVN)先验方法,用于评估环境流行病学中暴露与健康结果之间的关联。我们通过模拟将其性能与现有方法进行比较。我们的稀疏MVN先验方法显示出与完全贝叶斯方法相当的性能,完全贝叶斯方法是一个黄金标准,但在某些情况下无法实施。我们使用包括新开发的方法在内的几种方法,研究了德克萨斯州哈里斯县2012年出生的足月婴儿的特定源暴露与特定污染物(二氧化氮[NO2])暴露与出生体重之间的关联。