Foundation for the Promotion of Health and Biomedical Research of Valencia Region, Valencia, Spain.
Departamento de Análisis Económico y Finanzas; Instituto de Desarrollo Regional, Universidad de Castilla la Mancha, Albacete, Spain.
Stat Med. 2018 Oct 15;37(23):3325-3337. doi: 10.1002/sim.7819. Epub 2018 May 27.
Zero excess in the study of geographically referenced mortality data sets has been the focus of considerable attention in the literature, with zero-inflation being the most common procedure to handle this lack of fit. Although hurdle models have also been used in disease mapping studies, their use is more rare. We show in this paper that models using particular treatments of zero excesses are often required for achieving appropriate fits in regular mortality studies since, otherwise, geographical units with low expected counts are oversmoothed. However, as also shown, an indiscriminate treatment of zero excess may be unnecessary and has a problematic implementation. In this regard, we find that naive zero-inflation and hurdle models, without an explicit modeling of the probabilities of zeroes, do not fix zero excesses problems well enough and are clearly unsatisfactory. Results sharply suggest the need for an explicit modeling of the probabilities that should vary across areal units. Unfortunately, these more flexible modeling strategies can easily lead to improper posterior distributions as we prove in several theoretical results. Those procedures have been repeatedly used in the disease mapping literature, and one should bear these issues in mind in order to propose valid models. We finally propose several valid modeling alternatives according to the results mentioned that are suitable for fitting zero excesses. We show that those proposals fix zero excesses problems and correct the mentioned oversmoothing of risks in low populated units depicting geographic patterns more suited to the data.
在地理参考死亡率数据集的研究中,零超额一直是文献中相当关注的焦点,零膨胀是处理这种不拟合的最常见方法。尽管障碍模型也已在疾病映射研究中使用,但它们的使用更为罕见。本文表明,在常规死亡率研究中,为了达到适当的拟合,通常需要使用特定的零超额处理模型,因为否则,预期计数较低的地理单元会过度平滑。然而,正如也表明的那样,对零超额的不加区分的处理可能是不必要的,并且实施起来存在问题。在这方面,我们发现,没有明确建模零的概率的简单零膨胀和障碍模型并不能很好地解决零超额问题,显然是不够令人满意的。结果强烈表明需要明确建模概率,这些概率应因区域单位而异。不幸的是,正如我们在几个理论结果中证明的那样,这些更灵活的建模策略很容易导致不当的后验分布。这些程序在疾病映射文献中被反复使用,为了提出有效的模型,人们应该牢记这些问题。根据上述结果,我们最终提出了几种适合拟合零超额的有效建模替代方案。我们表明,这些建议解决了零超额问题,并纠正了在人口较少的单元中描绘更适合数据的地理模式时存在的风险过度平滑问题。