Zanobetti A, Wand M P, Schwartz J, Ryan L M
Department of Environmental Health, Harvard School of Public Health, 665 Huntington Avenue, Boston, MA 02115, USA.
Biostatistics. 2000 Sep;1(3):279-92. doi: 10.1093/biostatistics/1.3.279.
There are a number of applied settings where a response is measured repeatedly over time, and the impact of a stimulus at one time is distributed over several subsequent response measures. In the motivating application the stimulus is an air pollutant such as airborne particulate matter and the response is mortality. However, several other variables (e.g. daily temperature) impact the response in a possibly non-linear fashion. To quantify the effect of the stimulus in the presence of covariate data we combine two established regression techniques: generalized additive models and distributed lag models. Generalized additive models extend multiple linear regression by allowing for continuous covariates to be modeled as smooth, but otherwise unspecified, functions. Distributed lag models aim to relate the outcome variable to lagged values of a time-dependent predictor in a parsimonious fashion. The resultant, which we call generalized additive distributed lag models, are seen to effectively quantify the so-called 'mortality displacement effect' in environmental epidemiology, as illustrated through air pollution/mortality data from Milan, Italy.
在许多实际应用场景中,响应会随时间被反复测量,并且某一时刻刺激的影响会分布在随后的几个响应测量值中。在具有启发性的应用中,刺激因素是一种空气污染物,如空气中的颗粒物,而响应则是死亡率。然而,其他几个变量(如每日气温)可能以非线性方式影响响应。为了在存在协变量数据的情况下量化刺激因素的影响,我们结合了两种既定的回归技术:广义相加模型和分布滞后模型。广义相加模型通过允许将连续协变量建模为平滑但未明确指定的函数,对多元线性回归进行了扩展。分布滞后模型旨在以简洁的方式将结果变量与时变预测变量的滞后值联系起来。由此产生的我们称之为广义相加分布滞后模型的方法,被证明能够有效地量化环境流行病学中所谓的“死亡率替代效应”,这一点通过意大利米兰的空气污染/死亡率数据得到了说明。