Wang Wei, Xiao Xiong, Qian Jian, Chen Shiqi, Liao Fang, Yin Fei, Zhang Tao, Li Xiaosong, Ma Yue
West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, China.
Women and Children's Health Management Department, Sichuan Provincial Hospital for Women and Children, Chengdu, China.
Stat Med. 2022 Jul 10;41(15):2939-2956. doi: 10.1002/sim.9395. Epub 2022 Mar 28.
Most spatial models include a spatial weights matrix (W) derived from the first law of geography to adjust the spatial dependence to fulfill the independence assumption. In various fields such as epidemiological and environmental studies, the spatial dependence often shows clustering (or geographic discontinuity) due to natural or social factors. In such cases, adjustment using the first-law-of-geography-based W might be inappropriate and leads to inaccuracy estimations and loss of statistical power. In this work, we propose a series of data-driven Ws (DDWs) built following the spatial pattern identified by the scan statistic, which can be easily carried out using existing tools such as SaTScan software. The DDWs take both the clustering (or discontinuous) and the intuitive first-law-of-geographic-based spatial dependence into consideration. Aiming at two common purposes in epidemiology studies (ie, estimating the effect value of explanatory variable X and estimating the risk of each spatial unit in disease mapping), the common spatial autoregressive models and the Leroux-prior-based conditional autoregressive (CAR) models were selected to evaluate performance of DDWs, respectively. Both simulation and case studies show that our DDWs achieve considerably better performance than the classic W in datasets with clustering (or discontinuous) spatial dependence. Furthermore, the latest published density-based spatial clustering models, aiming at dealing with such clustering (or discontinuity) spatial dependence in disease mapping, were also compared as references. The DDWs, incorporated into the CAR models, still show considerable advantage, especially in the datasets for common diseases.
大多数空间模型都包含一个基于地理学第一定律导出的空间权重矩阵(W),以调整空间依赖性,从而满足独立性假设。在流行病学和环境研究等各个领域,由于自然或社会因素,空间依赖性往往表现出聚类(或地理不连续性)。在这种情况下,使用基于地理学第一定律的W进行调整可能不合适,会导致估计不准确和统计功效的损失。在这项工作中,我们提出了一系列数据驱动的W(DDW),它们是根据扫描统计量识别出的空间模式构建的,可以使用诸如SaTScan软件等现有工具轻松实现。DDW既考虑了聚类(或不连续),也考虑了基于地理学第一定律的直观空间依赖性。针对流行病学研究中的两个常见目的(即估计解释变量X的效应值和估计疾病地图中每个空间单元的风险),分别选择了常见的空间自回归模型和基于勒鲁先验的条件自回归(CAR)模型来评估DDW的性能。模拟和案例研究均表明,在具有聚类(或不连续)空间依赖性的数据集中,我们的DDW比经典的W具有显著更好的性能。此外,还比较了最新发表的基于密度的空间聚类模型作为参考,该模型旨在处理疾病地图中的此类聚类(或不连续性)空间依赖性。纳入CAR模型的DDW仍然显示出相当大的优势,尤其是在常见疾病的数据集中。