Department of Environmental Health, Harvard T.H. Chan School of Public Health, Landmark Center 4th West, 401 Park Drive, Boston, MA, 02215, USA.
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
Environ Health. 2021 Feb 23;20(1):19. doi: 10.1186/s12940-021-00704-3.
We previously found additive effects of long- and short-term exposures to fine particulate matter (PM), ozone (O), and nitrogen dioxide (NO) on all-cause mortality rate using a generalized propensity score (GPS) adjustment approach. The study addressed an important question of how many early deaths were caused by each exposure. However, the study was computationally expensive, did not capture possible interactions and high-order nonlinearities, and omitted potential confounders.
We proposed two new methods and reconducted the analysis using the same cohort of Medicare beneficiaries in Massachusetts during 2000-2012, which consisted of 1.5 million individuals with 3.8 billion person-days of follow-up. The first method, weighted least squares (WLS), leveraged large volume of data by aggregating person-days, which gave equivalent results to the linear probability model (LPM) method in the previous analysis but significantly reduced computational burden. The second method, m-out-of-n random forests (moonRF), implemented scaling random forests that captured all possible interactions and nonlinearities in the GPS model. To minimize confounding bias, we additionally controlled relative humidity and health care utilizations that were not included previously. Further, we performed low-level analysis by restricting to person-days with exposure levels below increasingly stringent thresholds.
We found consistent results between LPM/WLS and moonRF: all exposures were positively associated with mortality rate, even at low levels. For long-term PM and O, the effect estimates became larger at lower levels. Long-term exposure to PM posed the highest risk: 1 μg/m increase in long-term PM was associated with 1053 (95% confidence interval [CI]: 984, 1122; based on LPM/WLS methods) or 1058 (95% CI: 988, 1127; based on moonRF method) early deaths each year among the Medicare population in Massachusetts.
This study provides more rigorous causal evidence between PM, O, and NO exposures and mortality, even at low levels. The largest effect estimate for long-term PM suggests that reducing PM could gain the most substantial benefits. The consistency between LPM/WLS and moonRF suggests that there were not many interactions and high-order nonlinearities. In the big data context, the proposed methods will be useful for future scientific work in estimating causality on an additive scale.
我们之前使用广义倾向评分(GPS)调整方法发现,长期和短期暴露于细颗粒物(PM)、臭氧(O)和二氧化氮(NO)对全因死亡率有累加效应。该研究解决了一个重要问题,即每种暴露导致了多少早逝。然而,该研究计算成本高,没有捕捉到可能的相互作用和高阶非线性,并且忽略了潜在的混杂因素。
我们提出了两种新方法,并使用马萨诸塞州 2000-2012 年期间的同一组医疗保险受益人重新进行了分析,该组由 150 万人组成,随访时间为 38 亿人天。第一种方法,加权最小二乘法(WLS),通过聚合人天来利用大量数据,这与之前分析中的线性概率模型(LPM)方法结果等效,但大大降低了计算负担。第二种方法,m-out-of-n 随机森林(moonRF),实施了缩放随机森林,该森林捕获了 GPS 模型中的所有可能相互作用和非线性。为了最小化混杂偏差,我们还控制了之前未包含的相对湿度和医疗保健利用情况。此外,我们通过将分析限制在暴露水平低于越来越严格的阈值的人天来进行低级分析。
我们在 LPM/WLS 和 moonRF 之间发现了一致的结果:所有暴露均与死亡率呈正相关,即使在低水平也是如此。对于长期 PM 和 O,在较低水平时,效应估计值会更大。长期暴露于 PM 带来的风险最高:长期 PM 每增加 1μg/m,马萨诸塞州医疗保险人群中每年就会有 1053 人(95%置信区间[CI]:984,1122;基于 LPM/WLS 方法)或 1058 人(95%CI:988,1127;基于 moonRF 方法)过早死亡。
这项研究提供了在低水平下 PM、O 和 NO 暴露与死亡率之间更严格的因果证据。长期 PM 的最大效应估计表明,减少 PM 可以获得最大的收益。LPM/WLS 和 moonRF 之间的一致性表明,相互作用和高阶非线性并不多。在大数据背景下,所提出的方法将有助于未来在加性尺度上估计因果关系的科学工作。