Luo Lan
Geospat Health. 2013 Nov;8(1):22-35. doi: 10.4081/gh.2013.51.
The paper aims to estimate the level and impact of spatial aggregation error for spatial scan statistics where disaggregated data below the zip code level are not available. Data on colorectal cancer cases in Cook county, Illinois, USA with a 5-year interval were used. An innovative procedure using SAS and Java was designed to make SaTScan auto-run. Characteristics of clusters at each reference level were compared to those at zip code level to observe differences related to spatial aggregation. The comparison reveals that spatial scan statistic at the zip code level can generate reliable clusters in areas with a large number of cases, but fail to detect clusters in areas where there are a sparse number of cases, since the spatial aggregation error is minimised in areas with sizeable numbers of cases. Without localised cancer data, zip code level data can be used effectively to identify dominant clusters. However, smaller clusters located in low-density areas may be missed.
本文旨在估计在无法获得邮政编码级别以下的分解数据时,空间扫描统计的空间聚集误差水平及其影响。使用了美国伊利诺伊州库克县5年间隔期的结直肠癌病例数据。设计了一种使用SAS和Java的创新程序,以使SaTScan自动运行。将每个参考级别的聚类特征与邮政编码级别的聚类特征进行比较,以观察与空间聚集相关的差异。比较结果表明,邮政编码级别的空间扫描统计可以在病例数较多的区域生成可靠的聚类,但在病例数较少的区域无法检测到聚类,因为在病例数较多的区域空间聚集误差最小化。如果没有局部癌症数据,邮政编码级别的数据可以有效地用于识别主要聚类。然而,位于低密度区域的较小聚类可能会被遗漏。