Am J Epidemiol. 2021 Feb 1;190(3):468-476. doi: 10.1093/aje/kwaa172.
The initial aim of environmental epidemiology is to estimate the causal effects of environmental exposures on health outcomes. However, due to lack of enough covariates in most environmental data sets, current methods without enough adjustments for confounders inevitably lead to residual confounding. We propose a negative-control exposure based on a time-series studies (NCE-TS) model to effectively eliminate unobserved confounders using an after-outcome exposure as a negative-control exposure. We show that the causal effect is identifiable and can be estimated by the NCE-TS for continuous and categorical outcomes. Simulation studies indicate unbiased estimation by the NCE-TS model. The potential of NCE-TS is illustrated by 2 challenging applications: We found that living in areas with higher levels of surrounding greenness over 6 months was associated with less risk of stroke-specific mortality, based on the Shandong Ecological Health Cohort during January 1, 2010, to December 31, 2018. In addition, we found that the widely established negative association between temperature and cancer risks was actually caused by numbers of unobserved confounders, according to the Global Open Database from 2003-2012. The proposed NCE-TS model is implemented in an R package (R Foundation for Statistical Computing, Vienna, Austria) called NCETS, freely available on GitHub.
环境流行病学的最初目标是估计环境暴露对健康结果的因果效应。然而,由于大多数环境数据集缺乏足够的协变量,因此当前没有充分调整混杂因素的方法不可避免地会导致残余混杂。我们提出了一种基于时间序列研究(NCE-TS)模型的负对照暴露,使用事后暴露作为负对照暴露,有效地消除未观察到的混杂因素。我们表明,对于连续和分类结果,可以通过 NCE-TS 来识别和估计因果效应。模拟研究表明,NCE-TS 模型可以进行无偏估计。通过 2 个具有挑战性的应用来说明 NCE-TS 的潜力:我们发现,基于 2010 年 1 月 1 日至 2018 年 12 月 31 日期间的山东生态健康队列,在 6 个月内生活在周围绿化水平较高的地区与中风特异性死亡率降低相关。此外,我们发现,根据 2003-2012 年全球开放数据库,温度与癌症风险之间广泛存在的负相关实际上是由未观察到的混杂因素数量引起的。所提出的 NCE-TS 模型在一个名为 NCETS 的 R 包(奥地利维也纳统计计算基金会,R 基金会)中实现,可在 GitHub 上免费获得。