Zimmerman Dale L, Fang Xiangming, Mazumdar Soumya
Statistics and Actuarial Science, University of Iowa, 241 Schaeffer Hall, Iowa City, IA 52242, USA.
Stat Med. 2008 Sep 20;27(21):4254-66. doi: 10.1002/sim.3288.
Geocoding a study population as completely as possible is an important data assimilation component of many spatial epidemiologic studies. Unfortunately, complete geocoding is rare in practice. The failure of a substantial proportion of study subjects' addresses to geocode has consequences for spatial analyses, some of which are not yet fully understood. This article explicitly demonstrates that the failure to geocode can be spatially clustered, and it investigates the implications of this for the detection of disease clustering. A data set of more than 9000 ground-truthed addresses from Carroll County, Iowa, which was geocoded via a standard address matching and street interpolation algorithm, is used for this purpose. Through simulation of disease processes at these addresses, the authors show that spatial clustering of geocoding failure has no effect on the marginal power to detect spatial disease clustering if the likelihood of disease is independent of the failure to geocode, but that power is substantially reduced if disease likelihood and geocoding failure are positively associated.
尽可能全面地对研究人群进行地理编码是许多空间流行病学研究中一个重要的数据同化组成部分。不幸的是,在实际操作中完全地理编码很少见。相当一部分研究对象的地址未能进行地理编码,这对空间分析产生了影响,其中一些影响尚未完全明了。本文明确表明,地理编码失败可能在空间上聚集,并探讨了这对疾病聚集检测的影响。为此使用了爱荷华州卡罗尔县9000多个经实地验证地址的数据集,该数据集通过标准地址匹配和街道插值算法进行地理编码。通过对这些地址处疾病过程的模拟,作者表明,如果疾病发生的可能性与地理编码失败无关,地理编码失败的空间聚集对检测空间疾病聚集的边际功效没有影响,但如果疾病发生可能性与地理编码失败呈正相关,则功效会大幅降低。