Kent State University; Department of Geography, 413 McGilvrey Hall, 325 S. Lincoln St., Kent, OH 44242 USA.
Kent State University; Department of Geography, 413 McGilvrey Hall, 325 S. Lincoln St., Kent, OH 44242 USA.
Environ Res. 2018 Jul;164:53-64. doi: 10.1016/j.envres.2018.02.020. Epub 2018 Feb 23.
Temperature-mortality relationships are nonlinear, time-lagged, and can vary depending on the time of year and geographic location, all of which limits the applicability of simple regression models in describing these associations. This research demonstrates the utility of an alternative method for modeling such complex relationships that has gained recent traction in other environmental fields: nonlinear autoregressive models with exogenous input (NARX models). All-cause mortality data and multiple temperature-based data sets were gathered from 41 different US cities, for the period 1975-2010, and subjected to ensemble NARX modeling. Models generally performed better in larger cities and during the winter season. Across the US, median absolute percentage errors were 10% (ranging from 4% to 15% in various cities), the average improvement in the r-squared over that of a simple persistence model was 17% (6-24%), and the hit rate for modeling spike days in mortality (>80th percentile) was 54% (34-71%). Mortality responded acutely to hot summer days, peaking at 0-2 days of lag before dropping precipitously, and there was an extended mortality response to cold winter days, peaking at 2-4 days of lag and dropping slowly and continuing for multiple weeks. Spring and autumn showed both of the aforementioned temperature-mortality relationships, but generally to a lesser magnitude than what was seen in summer or winter. When compared to distributed lag nonlinear models, NARX model output was nearly identical. These results highlight the applicability of NARX models for use in modeling complex and time-dependent relationships for various applications in epidemiology and environmental sciences.
温度-死亡率关系是非线性的、具有时滞性的,并且可能因季节和地理位置的不同而有所变化,所有这些都限制了简单回归模型在描述这些关联中的适用性。本研究展示了一种用于建模此类复杂关系的替代方法的实用性,这种方法在其他环境领域中最近得到了广泛应用:具有外生输入的非线性自回归模型(NARX 模型)。从 1975 年至 2010 年,从 41 个不同的美国城市收集了全因死亡率数据和多个基于温度的数据组,并对其进行了集合 NARX 建模。模型在较大的城市和冬季的表现通常更好。在美国,中位数绝对百分比误差为 10%(在不同城市的范围为 4%至 15%),与简单持续模型相比,r 平方的平均提高了 17%(6-24%),并且死亡率高峰日(>第 80 百分位)建模的命中率为 54%(34-71%)。死亡率对炎热的夏日反应敏锐,在滞后 0-2 天达到峰值,然后急剧下降,对寒冷的冬日有较长的滞后反应,在滞后 2-4 天达到峰值,然后缓慢下降并持续数周。春季和秋季都表现出了上述温度-死亡率关系,但通常比夏季或冬季的关系要小。与分布式滞后非线性模型相比,NARX 模型的输出几乎相同。这些结果突出了 NARX 模型在流行病学和环境科学的各种应用中用于建模复杂和时变关系的适用性。