School of Mathematics and Statistics, 3526University of Glasgow, UK.
Stat Methods Med Res. 2022 Jun;31(6):1184-1203. doi: 10.1177/09622802221084131. Epub 2022 Mar 14.
Conditional autoregressive models are typically used to capture the spatial autocorrelation present in areal unit disease count data when estimating the spatial pattern in disease risk. This correlation is represented by a binary neighbourhood matrix based on a border sharing specification, which enforces spatial correlation between geographically neighbouring areas. However, enforcing such correlation will mask any discontinuities in the disease risk surface, thus impeding the detection of clusters of areas that exhibit higher or lower risks compared to their neighbours. Here we propose novel methodology to account for these clusters and discontinuities in disease risk via a two-stage modelling approach, which either forces the clusters/discontinuities to be the same for all time periods or allows them to evolve dynamically over time. Stage one constructs a set of candidate neighbourhood matrices to represent a range of possible cluster/discontinuity structures in the data, and stage two estimates an appropriate structure(s) by treating the neighbourhood matrix as an additional parameter to estimate within a Bayesian spatio-temporal disease mapping model. The effectiveness of our novel methodology is evidenced by simulation, before being applied to a new study of respiratory disease risk in Greater Glasgow, Scotland from 2011 to 2017.
条件自回归模型通常用于在估计疾病风险的空间模式时,捕获面单元疾病计数数据中存在的空间自相关。这种相关性由基于边界共享规范的二进制邻域矩阵表示,该规范强制地理上相邻区域之间的空间相关性。然而,强制执行这种相关性会掩盖疾病风险表面的任何不连续性,从而阻碍检测与邻居相比表现出更高或更低风险的区域集群。在这里,我们通过两阶段建模方法提出了一种新的方法来考虑这些疾病风险中的聚类和不连续性,该方法要么强制聚类/不连续性在所有时间段内保持相同,要么允许它们随时间动态演变。第一阶段构建了一组候选邻域矩阵,以表示数据中一系列可能的聚类/不连续性结构,第二阶段通过将邻域矩阵视为贝叶斯时空疾病映射模型中要估计的附加参数来估计适当的结构。我们的新方法的有效性通过模拟得到了证明,然后将其应用于 2011 年至 2017 年苏格兰格拉斯哥大地区呼吸系统疾病风险的一项新研究中。