Lin Ge, Gray Judy, Qu Ming
College of Public Health, University of Nebraska Medical Center, Public Health Support, Nebraska Department of Health and Human Services, USA.
J Registry Manag. 2010 Summer;37(2):49-56.
This report summarizes geocoding improvement experiments in the Nebraska Cancer Registry. An initial assessment of previous geocoding suggests that some proven geocoding procedures have not been followed, and overall results were unacceptable. This study concluded that when updating different address files from different time periods, it is sufficient to use the most recent street centerline database. The combination of match score of 80 and spelling sensitivity of 80 in ESRI's ArcGIS geocoder is sufficient for most geocoding purposes. Given the sizable number of unmatched addresses, the Google Maps geocoding service was used. A comparison of 1500 high-quality addresses that were matched by both Google Maps and ArcGIS geocoders shows that, in most cases, the location discrepancies between the two were acceptable. The median distance between each pair of 1500 coded locations was 36.6 meters, with an average of 92.8 meters. Distance discrepancies were larger in urban fringe areas and smaller toward urban centers. It was concluded that by strictly following proven procedures including address coding specification, Internet-based White Pages for reverse address finding, and Internet-based geocoding, a 90% or even a 95% match rate is achievable.
本报告总结了内布拉斯加州癌症登记处的地理编码改进实验。对之前地理编码的初步评估表明,一些已证实的地理编码程序未得到遵循,总体结果不可接受。本研究得出结论,在更新不同时间段的不同地址文件时,使用最新的街道中心线数据库就足够了。ESRI的ArcGIS地理编码器中80的匹配分数和80的拼写敏感度组合足以满足大多数地理编码目的。鉴于有大量未匹配的地址,使用了谷歌地图地理编码服务。对谷歌地图和ArcGIS地理编码器都匹配的1500个高质量地址进行比较表明,在大多数情况下,两者之间的位置差异是可以接受的。1500个编码位置中每对位置之间的中位数距离为36.6米,平均为92.8米。城市边缘地区的距离差异较大,而向城市中心方向则较小。得出的结论是,通过严格遵循包括地址编码规范、基于互联网的反向地址查找白页以及基于互联网的地理编码等已证实的程序,可以实现90%甚至95%的匹配率。