Health Analysis Division, Statistics Canada, Ottawa, Ontario.
Department of Epidemiology and Community Medicine, University of Ottawa.
Health Rep. 2018 Feb 21;29(2):3-9.
Postal codes are often the only geographic identifier available for assigning contextual or environmental information to a study population. This analysis assesses the influence of three factors-delivery mode type (mode of postal delivery), representative point type (source of latitude-longitude coordinates), and community size-on the accuracy of postal code spatial assignment.
PCCF+ (Postal Code Conversion File Plus) was used to assign delivery mode type, representative point type and community size to each individual in the 2011 Census of Canada. A sample (n = 1,004) was randomly selected with a minimum of 90 observations for each category of those three factors. Based on the address information of individuals in the sample, measures of positional accuracy for geocoding from residential postal codes (PCCF+) versus reference locations as determined by full street addresses (Google Maps) were calculated using a geographic information system. Accuracy was measured as the distance that the geocoded position differed from the full street address.
Positional accuracy was related primarily to mode of postal delivery. Rural and mixed (partly urban, partly rural) modes had much higher geocoding error than did urban modes. Rural and small-town Canada and latitude and longitude based on dissemination area centroids had low accuracy, largely because of their close relationship to rural and mixed modes of delivery.
The accuracy of geocoding from postal codes can vary. Geocoding imprecision may result in misclassification, depending on the spatial resolution of the environmental or contextual measures. The spatial resolution required for a study helps to identify subpopulations that should be excluded because of inadequate positional accuracy.
邮政编码通常是为研究人群分配上下文或环境信息时唯一可用的地理标识符。本分析评估了三种因素(邮递方式类型、代表性点类型和社区规模)对邮政编码空间分配准确性的影响。
使用 PCCF+(邮政代码转换文件 Plus)将邮递方式类型、代表性点类型和社区规模分配给 2011 年加拿大人口普查中的每个个体。从这三个因素的每个类别中随机选择一个样本(n=1004),每个类别至少有 90 个观测值。基于样本中个体的地址信息,使用地理信息系统计算了从住宅邮政编码(PCCF+)到全街道地址(谷歌地图)确定的参考位置的地理编码位置精度的度量。准确性通过地理编码位置与全街道地址的差异距离来衡量。
位置精度主要与邮递方式有关。农村和混合(部分城市,部分农村)模式的地理编码误差比城市模式高得多。加拿大农村和小镇以及基于传播区域质心的纬度和经度的位置精度较低,这主要是因为它们与农村和混合邮递方式密切相关。
邮政编码的地理编码准确性可能会有所不同。根据环境或上下文措施的空间分辨率,地理编码不精确可能导致分类错误。研究所需的空间分辨率有助于确定由于位置精度不足而应排除的亚人群。