Schootman Mario, Sterling David A, Struthers James, Yan Yan, Laboube Ted, Emo Brett, Higgs Gary
Division of Health Behavior Research, Department of Medicine, Washington University School of Medicine, St. Louis, MO 63108, USA.
Ann Epidemiol. 2007 Jun;17(6):464-70. doi: 10.1016/j.annepidem.2006.10.015. Epub 2007 Apr 19.
We examined the geographic bias of four methods of geocoding addresses using ArcGIS, commercial firm, SAS/GIS, and aerial photography. We compared "point-in-polygon" (ArcGIS, commercial firm, and aerial photography) and the "look-up table" method (SAS/GIS) to allocate addresses to census geography, particularly as it relates to census-based poverty rates.
We randomly selected 299 addresses of children treated for asthma at an urban emergency department (1999-2001). The coordinates of the building address side door were obtained by constant offset based on ArcGIS and a commercial firm and true ground location based on aerial photography.
Coordinates were available for 261 addresses across all methods. For 24% to 30% of geocoded road/door coordinates the positional error was 51 meters or greater, which was similar across geocoding methods. The mean bearing was -26.8 degrees for the vector of coordinates based on aerial photography and ArcGIS and 8.5 degrees for the vector based on aerial photography and the commercial firm (p < 0.0001). ArcGIS and the commercial firm performed very well relative to SAS/GIS in terms of allocation to census geography. For 20%, the door location based on aerial photography was assigned to a different block group compared to SAS/GIS. The block group poverty rate varied at least two standard deviations for 6% to 7% of addresses.
We found important differences in distance and bearing between geocoding relative to aerial photography. Allocation of locations based on aerial photography to census-based geographic areas could lead to substantial errors.
我们使用ArcGIS、商业公司、SAS/GIS和航空摄影这四种地理编码地址的方法,研究了地理偏差情况。我们比较了“点在多边形内”(ArcGIS、商业公司和航空摄影)和“查找表”方法(SAS/GIS),以便将地址分配到人口普查地理区域,特别是与基于人口普查的贫困率相关的区域。
我们在一个城市急诊科随机选择了299个接受哮喘治疗儿童的地址(1999 - 2001年)。通过基于ArcGIS和商业公司的固定偏移量以及基于航空摄影的真实地面位置,获取建筑物地址侧门的坐标。
所有方法均获得了261个地址的坐标。对于24%至30%的地理编码道路/门坐标,位置误差为51米或更大,这在各种地理编码方法中相似。基于航空摄影和ArcGIS的坐标向量平均方位为 -26.8度,基于航空摄影和商业公司的坐标向量平均方位为8.5度(p < 0.0001)。在分配到人口普查地理区域方面,ArcGIS和商业公司相对于SAS/GIS表现非常出色。对于20%的情况,基于航空摄影的门位置与SAS/GIS相比被分配到不同的街区组。对于6%至7%的地址,街区组贫困率至少相差两个标准差。
我们发现相对于航空摄影,地理编码在距离和方位上存在重要差异。基于航空摄影将位置分配到基于人口普查的地理区域可能会导致重大误差。