Leong Yin-Yee, Yue Jack C
Department of Statistics, National Chengchi University, Taipei, 11605, Taiwan, ROC.
Int J Health Geogr. 2017 Mar 31;16(1):11. doi: 10.1186/s12942-017-0085-9.
Geographically weighted regression (GWR) is a modelling technique designed to deal with spatial non-stationarity, e.g., the mean values vary by locations. It has been widely used as a visualization tool to explore the patterns of spatial data. However, the GWR tends to produce unsmooth surfaces when the mean parameters have considerable variations, partly due to that all parameter estimates are derived from a fixed- range (bandwidth) of observations. In order to deal with the varying bandwidth problem, this paper proposes an alternative approach, namely Conditional geographically weighted regression (CGWR).
The estimation of CGWR is based on an iterative procedure, analogy to the numerical optimization problem. Computer simulation, under realistic settings, is used to compare the performance between the traditional GWR, CGWR, and a local linear modification of GWR. Furthermore, this study also applies the CGWR to two empirical datasets for evaluating the model performance. The first dataset consists of disability status of Taiwan's elderly, along with some social-economic variables and the other is Ohio's crime dataset.
Under the positively correlated scenario, we found that the CGWR produces a better fit for the response surface. Both the computer simulation and empirical analysis support the proposed approach since it significantly reduces the bias and variance of data fitting. In addition, the response surface from the CGWR reviews local spatial characteristics according to the corresponded variables.
As an explanatory tool for spatial data, producing accurate surface is essential in order to provide a first look at the data. Any distorted outcomes would likely mislead the following analysis. Since the CGWR can generate more accurate surface, it is more appropriate to use it exploring data that contain suspicious variables with varying characteristics.
地理加权回归(GWR)是一种旨在处理空间非平稳性的建模技术,例如均值因位置而异。它已被广泛用作探索空间数据模式的可视化工具。然而,当均值参数存在相当大的变化时,GWR往往会产生不光滑的表面,部分原因是所有参数估计都来自固定范围(带宽)的观测值。为了解决带宽变化问题,本文提出了一种替代方法,即条件地理加权回归(CGWR)。
CGWR的估计基于一个迭代过程,类似于数值优化问题。在实际设置下进行计算机模拟,以比较传统GWR、CGWR和GWR的局部线性修正之间的性能。此外,本研究还将CGWR应用于两个实证数据集以评估模型性能。第一个数据集包括台湾老年人的残疾状况以及一些社会经济变量,另一个是俄亥俄州的犯罪数据集。
在正相关情况下,我们发现CGWR对响应面的拟合效果更好。计算机模拟和实证分析都支持所提出的方法,因为它显著降低了数据拟合的偏差和方差。此外,CGWR的响应面根据相应变量反映了局部空间特征。
作为空间数据的一种解释工具,生成准确的表面对于初步查看数据至关重要。任何扭曲的结果都可能误导后续分析。由于CGWR可以生成更准确的表面,因此更适合用于探索包含具有不同特征的可疑变量的数据。