Department of Statistics and Actuarial Science and Department of Biostatistics, and Center for Health Policy and Research, University of Iowa, Iowa City, IA 52242, U.S.A.
Stat Med. 2010 Apr 30;29(9):1025-36. doi: 10.1002/sim.3836. Epub 2010 Jan 19.
Automated geocoding of patient addresses is an important data assimilation component of many spatial epidemiologic studies. Inevitably, the geocoding process results in positional errors. Positional errors incurred by automated geocoding tend to reduce the power of tests for disease clustering and otherwise affect spatial analytic methods. However, there are reasons to believe that the errors may often be positively spatially correlated and that this may mitigate their deleterious effects on spatial analyses. In this article, we demonstrate explicitly that the positional errors associated with automated geocoding of a data set of more than 6000 addresses in Carroll County, Iowa are spatially autocorrelated. Furthermore, through two simulation studies of disease processes, including one in which the disease process is overlain upon the Carroll County addresses, we show that spatial autocorrelation among geocoding errors maintains the power of two tests for disease clustering at a level higher than that which would occur if the errors were independent. Implications of these results for cluster detection, privacy protection, and measurement error modeling of geographic health data are discussed.
自动地址编码是许多空间流行病学研究中重要的数据同化组成部分。不可避免的是,地理编码过程会产生位置误差。自动地理编码产生的位置误差往往会降低疾病聚类检验的功效,从而影响空间分析方法。然而,有理由相信这些错误可能经常具有正空间相关性,这可能会减轻它们对空间分析的有害影响。在本文中,我们明确地证明了爱荷华州卡罗尔县 6000 多个地址数据集的自动地理编码相关的位置误差具有空间自相关性。此外,通过对两种疾病过程的模拟研究,包括一种将疾病过程叠加在卡罗尔县地址上的情况,我们表明,地理编码误差之间的空间自相关保持了两种疾病聚类检验的功效,其水平高于如果误差是独立的情况下的功效。讨论了这些结果对聚类检测、隐私保护和地理健康数据测量误差建模的影响。