George Lemaitre Centre for Earth and Climate Research, Earth and Life Institute, UCLouvain, Louvain-la-Neuve, Belgium.
Spatial Epidemiology Lab (SpELL), Université Libre de Bruxelles, Brussels, Belgium.
PLoS One. 2020 Jan 27;15(1):e0221070. doi: 10.1371/journal.pone.0221070. eCollection 2020.
The analysis of census data aggregated by administrative units introduces a statistical bias known as the modifiable areal unit problem (MAUP). Previous researches have mostly assessed the effect of MAUP on upscaling models. The present study contributes to clarify the effects of MAUP on the downscaling methodologies, highlighting how a priori choices of scales and shapes could influence the results. We aggregated chicken and duck fine-resolution census in Thailand, using three administrative census levels in regular and irregular shapes. We then disaggregated the data within the Gridded Livestock of the World analytical framework, sampling predictors in two different ways. A sensitivity analysis on Pearson's r correlation statistics and RMSE was carried out to understand how size and shapes of the response variables affect the goodness-of-fit and downscaling performances. We showed that scale, rather than shapes and sampling methods, affected downscaling precision, suggesting that training the model using the finest administrative level available is preferable. Moreover, datasets showing non-homogeneous distribution but instead spatial clustering seemed less affected by MAUP, yielding higher Pearson's r values and lower RMSE compared to a more spatially homogenous dataset. Implementing aggregation sensitivity analysis in spatial studies could help to interpret complex results and disseminate robust products.
对按行政单位汇总的人口普查数据进行分析会引入一种称为可修改区域单元问题(MAUP)的统计偏差。先前的研究主要评估了 MAUP 对上推模型的影响。本研究有助于阐明 MAUP 对下推方法的影响,强调了预先选择的比例和形状如何影响结果。我们使用常规和不规则形状的三种行政普查级别,汇总了泰国鸡和鸭的精细分辨率普查数据。然后,我们在全球分析框架内对数据进行了细分,以两种不同的方式对预测因子进行了采样。我们对 Pearson r 相关统计和 RMSE 进行了敏感性分析,以了解响应变量的大小和形状如何影响拟合优度和下推性能。结果表明,比例而不是形状和采样方法影响了下推精度,这表明使用可用的最精细行政级别来训练模型更为可取。此外,与更具空间均一性的数据集相比,显示非均匀分布但具有空间聚类的数据集受 MAUP 的影响较小,与 RMSE 相比,Pearson r 值更高。在空间研究中实施聚合敏感性分析有助于解释复杂的结果并传播稳健的产品。