Department of Environmental and Occupational Health, College of Medicine, National Cheng Kung University, 1 University Road, Tainan, 701, Taiwan.
Department of Statistics, College of Management, National Cheng Kung University, Tainan, Taiwan.
Int J Health Geogr. 2021 Nov 11;20(1):45. doi: 10.1186/s12942-021-00298-6.
The presence of considerable spatial variability in incidence intensity suggests that risk factors are unevenly distributed in space and influence the geographical disease incidence distribution and pattern. As most human common diseases that challenge investigators are complex traits and as more factors associated with increased risk are discovered, statistical spatial models are needed that investigate geographical variability in the association between disease incidence and confounding variables and evaluate spatially varying effects on disease risk related to known or suspected risk factors. Information on geography that we focus on is geographical disease clusters of peak incidence and paucity of incidence.
We proposed and illustrated a statistical spatial model that incorporates information on known or hypothesized risk factors, previously detected geographical disease clusters of peak incidence and paucity of incidence, and their interactions as covariates into the framework of interaction regression models. The spatial scan statistic and the generalized map-based pattern recognition procedure that we recently developed were both considered for geographical disease cluster detection. The Freeman-Tukey transformation was applied to improve normality of distribution and approximately stabilize the variance in the model. We exemplified the proposed method by analyzing data on the spatial occurrence of sudden infant death syndrome (SIDS) with confounding variables of race and gender in North Carolina.
The analysis revealed the presence of spatial variability in the association between SIDS incidence and race. We differentiated spatial effects of race on SIDS incidence among previously detected geographical disease clusters of peak incidence and incidence paucity and areas outside the geographical disease clusters, determined by the spatial scan statistic and the generalized map-based pattern recognition procedure. Our analysis showed the absence of spatial association between SIDS incidence and gender.
The application to the SIDS incidence data demonstrates the ability of our proposed model to estimate spatially varying associations between disease incidence and confounding variables and distinguish spatially related risk factors from spatially constant ones, providing valuable inference for targeted environmental and epidemiological surveillance and management, risk stratification, and thorough etiologic studies of disease.
发病率强度存在相当大的空间变异性,这表明危险因素在空间上分布不均,并影响地理疾病发病率的分布和模式。由于大多数挑战研究人员的人类常见疾病都是复杂特征,并且随着与风险增加相关的更多因素被发现,因此需要统计空间模型来研究疾病发病率与混杂变量之间的空间变异性,并评估与已知或可疑危险因素相关的疾病风险的空间变化效应。我们关注的地理信息是发病率高峰和发病率低的地理疾病集群。
我们提出并说明了一个统计空间模型,该模型将已知或假设的危险因素、以前检测到的发病率高峰和发病率低的地理疾病集群及其相互作用作为协变量纳入交互回归模型框架。考虑了空间扫描统计量和我们最近开发的广义基于地图的模式识别程序来进行地理疾病集群检测。应用 Freeman-Tukey 变换来改善分布的正态性并近似稳定模型中的方差。我们通过分析北卡罗来纳州种族和性别混杂变量的婴儿猝死综合征(SIDS)的空间发生数据来举例说明所提出的方法。
分析表明,SIDS 发病率与种族之间的关联存在空间变异性。我们通过空间扫描统计量和广义基于地图的模式识别程序区分了种族对 SIDS 发病率的空间效应,这些程序确定了以前检测到的发病率高峰和发病率低的地理疾病集群以及地理疾病集群之外的区域。我们的分析表明,SIDS 发病率与性别之间不存在空间关联。
将该模型应用于 SIDS 发病率数据表明,我们提出的模型能够估计疾病发病率与混杂变量之间的空间变化关联,并区分空间相关的危险因素与空间不变的危险因素,为有针对性的环境和流行病学监测和管理、风险分层以及疾病的彻底病因研究提供有价值的推论。