Coull Brent A, Lee Seokho, McGee Glen, Manjourides Justin, Mittleman Murray A, Wellenius Gregory A
Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, 02115, Massachusetts.
Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, 02115, Massachusetts.
Biometrics. 2020 Sep;76(3):963-972. doi: 10.1111/biom.13173. Epub 2019 Nov 28.
Epidemiologic studies of the short-term effects of ambient particulate matter (PM) on the risk of acute cardiovascular or cerebrovascular events often use data from administrative databases in which only the date of hospitalization is known. A common study design for analyzing such data is the case-crossover design, in which exposure at a time when a patient experiences an event is compared to exposure at times when the patient did not experience an event within a case-control paradigm. However, the time of true event onset may precede hospitalization by hours or days, which can yield attenuated effect estimates. In this article, we consider a marginal likelihood estimator, a regression calibration estimator, and a conditional score estimator, as well as parametric bootstrap versions of each, to correct for this bias. All considered approaches require validation data on the distribution of the delay times. We compare the performance of the approaches in realistic scenarios via simulation, and apply the methods to analyze data from a Boston-area study of the association between ambient air pollution and acute stroke onset. Based on both simulation and the case study, we conclude that a two-stage regression calibration estimator with a parametric bootstrap bias correction is an effective method for correcting bias in health effect estimates arising from delayed onset in a case-crossover study.
关于环境颗粒物(PM)对急性心血管或脑血管事件风险的短期影响的流行病学研究,通常使用行政数据库中的数据,而这些数据库中仅知道住院日期。分析此类数据的一种常见研究设计是病例交叉设计,即在病例对照范式下,将患者发生事件时的暴露情况与患者未发生事件时的暴露情况进行比较。然而,真正的事件发作时间可能比住院时间提前数小时或数天,这可能会导致效应估计值减弱。在本文中,我们考虑了一种边际似然估计器、一种回归校准估计器和一种条件得分估计器,以及它们各自的参数自助法版本,以校正这种偏差。所有考虑的方法都需要关于延迟时间分布的验证数据。我们通过模拟比较了这些方法在实际场景中的性能,并应用这些方法分析了波士顿地区一项关于环境空气污染与急性中风发作之间关联的研究数据。基于模拟和案例研究,我们得出结论,具有参数自助法偏差校正的两阶段回归校准估计器是校正病例交叉研究中因发作延迟而导致的健康效应估计偏差的有效方法。