Department of Industrial Engineering, Ben Gurion University of the Negev, Be'er Sheva, Israel.
Department of Geography and Environmental Development, Ben Gurion University of the Negev, Be'er Sheva, Israel.
J Expo Sci Environ Epidemiol. 2023 Nov;33(6):963-970. doi: 10.1038/s41370-022-00438-5. Epub 2022 Apr 22.
In the first stage of a two-stage study, the researcher uses a statistical model to impute the unobserved exposures. In the second stage, imputed exposures serve as covariates in epidemiological models. Imputation error in the first stage operate as measurement errors in the second stage, and thus bias exposure effect estimates.
This study aims to improve the estimation of exposure effects by sharing information between the first and second stages.
At the heart of our estimator is the observation that not all second-stage observations are equally important to impute. We thus borrow ideas from the optimal-experimental-design theory, to identify individuals of higher importance. We then improve the imputation of these individuals using ideas from the machine-learning literature of domain adaptation.
Our simulations confirm that the exposure effect estimates are more accurate than the current best practice. An empirical demonstration yields smaller estimates of PM effect on hyperglycemia risk, with tighter confidence bands.
Sharing information between environmental scientist and epidemiologist improves health effect estimates. Our estimator is a principled approach for harnessing this information exchange, and may be applied to any two stage study.
在两阶段研究的第一阶段,研究人员使用统计模型来推断未观测到的暴露情况。在第二阶段,推断出的暴露情况作为流行病学模型中的协变量。第一阶段的推断误差在第二阶段充当测量误差,从而会导致暴露效应估计值产生偏差。
本研究旨在通过在第一阶段和第二阶段之间共享信息来提高暴露效应的估计值。
我们的估计器的核心思想是,并非所有第二阶段的观测值对于推断都是同等重要的。因此,我们借鉴了最优实验设计理论的思想,以确定更重要的个体。然后,我们使用领域自适应机器学习文献中的思想,改进这些个体的推断。
我们的模拟证实,暴露效应的估计值比当前的最佳实践更为准确。实证演示得出了 PM 对高血糖风险影响的估计值较小,置信带更紧。
环境科学家和流行病学家之间的信息共享可以改善健康效应的估计值。我们的估计器是利用这种信息交换的一种有原则的方法,并且可以应用于任何两阶段研究。