Dupont Emiko, Wood Simon N, Augustin Nicole H
Department of Mathematical Sciences, University of Bath, Bath, UK.
School of Mathematics, University of Edinburgh, Edinburgh, UK.
Biometrics. 2022 Dec;78(4):1279-1290. doi: 10.1111/biom.13656. Epub 2022 Mar 30.
In spatial regression models, collinearity between covariates and spatial effects can lead to significant bias in effect estimates. This problem, known as spatial confounding, is encountered modeling forestry data to assess the effect of temperature on tree health. Reliable inference is difficult as results depend on whether or not spatial effects are included in the model. We propose a novel approach, spatial+, for dealing with spatial confounding when the covariate of interest is spatially dependent but not fully determined by spatial location. Using a thin plate spline model formulation we see that, in this case, the bias in covariate effect estimates is a direct result of spatial smoothing. Spatial+ reduces the sensitivity of the estimates to smoothing by replacing the covariates by their residuals after spatial dependence has been regressed away. Through asymptotic analysis we show that spatial+ avoids the bias problems of the spatial model. This is also demonstrated in a simulation study. Spatial+ is straightforward to implement using existing software and, as the response variable is the same as that of the spatial model, standard model selection criteria can be used for comparisons. A major advantage of the method is also that it extends to models with non-Gaussian response distributions. Finally, while our results are derived in a thin plate spline setting, the spatial+ methodology transfers easily to other spatial model formulations.
在空间回归模型中,协变量与空间效应之间的共线性会导致效应估计出现显著偏差。这个问题被称为空间混杂,在对林业数据进行建模以评估温度对树木健康的影响时会遇到。由于结果取决于模型中是否包含空间效应,因此进行可靠的推断很困难。当感兴趣的协变量在空间上相关但不完全由空间位置决定时,我们提出了一种新的方法——Spatial+,用于处理空间混杂问题。通过薄板样条模型公式我们发现,在这种情况下,协变量效应估计中的偏差是空间平滑的直接结果。Spatial+通过在去除空间相关性后用协变量的残差替换协变量,降低了估计对平滑的敏感性。通过渐近分析我们表明,Spatial+避免了空间模型的偏差问题。这在模拟研究中也得到了证明。使用现有软件很容易实现Spatial+,并且由于响应变量与空间模型的相同,因此可以使用标准的模型选择标准进行比较。该方法的一个主要优点还在于它可以扩展到具有非高斯响应分布的模型。最后,虽然我们的结果是在薄板样条设置中得出的,但Spatial+方法很容易转移到其他空间模型公式中。