Department of Geography, University of Illinois at Urbana-Champaign, Room 220 Davenport Hall, 607 S. Mathews Ave, Urbana, IL 61801-3671, USA.
Int J Health Geogr. 2010 Oct 19;9:51. doi: 10.1186/1476-072X-9-51.
This paper examines the effect of spatial aggregation error on statistical estimates of the association between spatial access to health care and late-stage cancer.
Monte Carlo simulation was used to disaggregate cancer cases for two Illinois counties from zip code to census block in proportion to the age-race composition of the block population. After the disaggregation, a hierarchical logistic model was estimated examining the relationship between late-stage breast cancer and risk factors including travel distance to mammography, at both the zip code and census block levels. Model coefficients were compared between the two levels to assess the impact of spatial aggregation error.
We found that spatial aggregation error influences the coefficients of regression-type models at the zip code level, and this impact is highly dependent on the study area. In one study area (Kane County), block-level coefficients were very similar to those estimated on the basis of zip code data; whereas in the other study area (Peoria County), the two sets of coefficients differed substantially raising the possibility of drawing inaccurate inferences about the association between distance to mammography and late-stage cancer risk.
Spatial aggregation error can significantly affect the coefficient values and inferences drawn from statistical models of the association between cancer outcomes and spatial and non-spatial variables. Relying on data at the zip code level may lead to inaccurate findings on health risk factors.
本文研究了空间聚集误差对医疗保健空间可达性与晚期癌症之间关联的统计估计的影响。
使用蒙特卡罗模拟将伊利诺伊州两个县的癌症病例按比例从邮政编码细分到人口普查块,以块人口的年龄种族构成为依据。细分后,在邮政编码和人口普查块两个层面上,使用分层逻辑回归模型来评估晚期乳腺癌与包括乳房 X 光检查旅行距离在内的风险因素之间的关系。在这两个层面上比较模型系数,以评估空间聚集误差的影响。
我们发现,空间聚集误差会影响邮政编码层面回归型模型的系数,而这种影响高度依赖于研究区域。在一个研究区域(凯恩县),块级系数与基于邮政编码数据估计的系数非常相似;而在另一个研究区域(皮奥里亚县),这两组系数则有很大差异,这增加了对乳房 X 光检查距离与晚期癌症风险之间关联的推断不准确的可能性。
空间聚集误差会显著影响癌症结果与空间和非空间变量之间关联的统计模型的系数值和推断。依赖邮政编码层面的数据可能会导致对健康风险因素的不准确发现。