Perrakis Konstantinos, Gryparis Alexandros, Schwartz Joel, Le Tertre Alain, Katsouyanni Klea, Forastiere Francesco, Stafoggia Massimo, Samoli Evangelia
Department of Hygiene, Epidemiology and Medical Statistics, Medical School, University of Athens, Athens, Greece.
Stat Med. 2014 Dec 10;33(28):4904-18. doi: 10.1002/sim.6271. Epub 2014 Jul 23.
An important topic when estimating the effect of air pollutants on human health is choosing the best method to control for seasonal patterns and time varying confounders, such as temperature and humidity. Semi-parametric Poisson time-series models include smooth functions of calendar time and weather effects to control for potential confounders. Case-crossover (CC) approaches are considered efficient alternatives that control seasonal confounding by design and allow inclusion of smooth functions of weather confounders through their equivalent Poisson representations. We evaluate both methodological designs with respect to seasonal control and compare spline-based approaches, using natural splines and penalized splines, and two time-stratified CC approaches. For the spline-based methods, we consider fixed degrees of freedom, minimization of the partial autocorrelation function, and general cross-validation as smoothing criteria. Issues of model misspecification with respect to weather confounding are investigated under simulation scenarios, which allow quantifying omitted, misspecified, and irrelevant-variable bias. The simulations are based on fully parametric mechanisms designed to replicate two datasets with different mortality and atmospheric patterns. Overall, minimum partial autocorrelation function approaches provide more stable results for high mortality counts and strong seasonal trends, whereas natural splines with fixed degrees of freedom perform better for low mortality counts and weak seasonal trends followed by the time-season-stratified CC model, which performs equally well in terms of bias but yields higher standard errors.
在评估空气污染物对人类健康的影响时,一个重要的课题是选择最佳方法来控制季节模式和随时间变化的混杂因素,如温度和湿度。半参数泊松时间序列模型包括日历时间和平滑的天气效应函数,以控制潜在的混杂因素。病例交叉(CC)方法被认为是有效的替代方法,它通过设计控制季节混杂,并通过其等效的泊松表示法纳入天气混杂因素的平滑函数。我们评估了两种关于季节控制的方法设计,并比较了基于样条的方法,包括自然样条和惩罚样条,以及两种时间分层的CC方法。对于基于样条的方法,我们考虑固定自由度、最小化偏自相关函数以及将广义交叉验证作为平滑标准。在模拟场景下研究了关于天气混杂的模型误设问题,这使得能够量化遗漏、误设和无关变量偏差。模拟基于完全参数化机制,旨在复制两个具有不同死亡率和大气模式的数据集。总体而言,最小偏自相关函数方法对于高死亡率计数和强烈季节趋势提供更稳定的结果,而固定自由度的自然样条对于低死亡率计数和较弱季节趋势表现更好,其次是时间季节分层的CC模型,其在偏差方面表现相当,但标准误差更高。