From the Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA.
Spatial Sciences Institute, University of Southern California, Los Angeles, CA.
Epidemiology. 2025 Jan 1;36(1):115-118. doi: 10.1097/EDE.0000000000001800. Epub 2024 Oct 1.
To our knowledge, no agreed-upon best practices exist for joining U.S. Census ZIP Code Tabulation Areas (ZCTAs) and U.S. Postal Service ZIP Codes (ZIPs). One-to-one linkage using 5-digit ZCTA identifiers excludes ZIPs without direct matches. "Crosswalk" linkage may match a ZCTA to multiple ZIPs, avoiding losses.
We compared noncrosswalk and crosswalk linkages nationally and for mortality and health insurance in California. To elucidate selection implications, generalized additive models related sociodemographics to whether ZCTAs contained nonmatching ZIPs.
Nationwide, 15% of ZCTAs had nonmatching ZIPs, i.e., ZIPs dropped under noncrosswalk linkage. ZCTAs with nonmatching ZIPs were positively associated with metropolitan core location, lower socioeconomics, and non-White population. In California, 34% of ZIPs in the mortality and 25% in the health insurance data had ZCTAs with nonmatching ZIPs; however, these ZIPs constitute only 0.03% of total mortality and 0.44% of total insurance enrollees.
Our study findings support the use of crosswalk linkages and ZCTAs as a unit of analysis. One-to-one linkage may cause bias by differentially excluding ZIPs with more disadvantaged populations, although affected population sizes seem small.
据我们所知,目前尚无针对美国人口普查的 ZCTA(ZIP Code Tabulation Areas)和美国邮政服务 ZIP 码(ZIPs)的一致最佳实践。使用五位数字 ZCTA 标识符进行一对一链接会排除没有直接匹配的 ZIP 码。“交叉引用”链接可以将一个 ZCTA 与多个 ZIP 码匹配,从而避免数据丢失。
我们在全国范围内以及加利福尼亚州的死亡率和医疗保险方面比较了非交叉引用和交叉引用链接。为了阐明选择的影响,广义加性模型将社会人口统计学与 ZCTA 是否包含不匹配的 ZIP 码相关联。
在全国范围内,有 15%的 ZCTA 存在不匹配的 ZIP 码,即通过非交叉引用链接被排除在外的 ZIP 码。与匹配 ZIP 码的 ZCTA 相比,不匹配 ZIP 码的 ZCTA 与大都市核心位置、较低的社会经济地位和非白人人口呈正相关。在加利福尼亚州,死亡率数据中有 34%的 ZIP 码和医疗保险数据中有 25%的 ZIP 码的 ZCTA 存在不匹配的 ZIP 码;然而,这些 ZIP 码仅占总死亡率的 0.03%和总保险参保人数的 0.44%。
我们的研究结果支持使用交叉引用链接和 ZCTA 作为分析单位。一对一链接可能会通过有选择地排除人口劣势较大的 ZIP 码而产生偏差,尽管受影响的人口规模似乎很小。