National Exposure Research Laboratory, U.S. Environmental Protection Agency, RTP, NC, United States.
National Center for Environmental Assessment, U.S. Environmental Protection Agency, RTP, NC, United States.
Sci Total Environ. 2014 Feb 1;470-471:631-8. doi: 10.1016/j.scitotenv.2013.10.019. Epub 2013 Oct 27.
Epidemiological studies have observed between city heterogeneity in PM2.5-mortality risk estimates. These differences could potentially be due to the use of central-site monitors as a surrogate for exposure which do not account for an individual's activities or ambient pollutant infiltration to the indoor environment. Therefore, relying solely on central-site monitoring data introduces exposure error in the epidemiological analysis. The amount of exposure error produced by using the central-site monitoring data may differ by city. The objective of this analysis was to cluster cities with similar exposure distributions based on residential infiltration and in-vehicle commuting characteristics. Factors related to residential infiltration and commuting were developed from the American Housing Survey (AHS) from 2001 to 2005 for 94 Core-Based Statistical Areas (CBSAs). We conducted two separate cluster analyses using a k-means clustering algorithm to cluster CBSAs based on these factors. The first only included residential infiltration factors (i.e. percent of homes with central air conditioning (AC) mean year home was built, and mean home size) while the second incorporated both infiltration and commuting (i.e. mean in-vehicle commuting time and mean in-vehicle commuting distance) factors. Clustering on residential infiltration factors resulted in 5 clusters, with two having distinct exposure distributions. Cluster 1 consisted of cities with older, smaller homes with less central AC while homes in Cluster 2 cities were newer, larger, and more likely to have central AC. Including commuting factors resulted in 10 clusters. Clusters with shorter in-vehicle commuting times had shorter in-vehicle commuting distances. Cities with newer homes also tended to have longer commuting times and distances. This is the first study to employ cluster analysis to group cities based on exposure factors. Identifying cities with similar exposure distributions may help explain city-to-city heterogeneity in PM2.5 mortality risk estimates.
流行病学研究观察到 PM2.5 死亡率风险估计值在城市之间存在异质性。这些差异可能是由于使用中心站点监测器作为暴露的替代物,而这些监测器不能考虑到个体的活动或环境污染物渗透到室内环境中。因此,仅依靠中心站点监测数据会在流行病学分析中引入暴露误差。使用中心站点监测数据产生的暴露误差量可能因城市而异。本分析的目的是根据住宅渗透和车内通勤特征对具有相似暴露分布的城市进行聚类。与住宅渗透和通勤有关的因素是根据 2001 年至 2005 年美国住房调查(AHS)为 94 个核心基础统计区(CBSA)开发的。我们使用 k-均值聚类算法进行了两次单独的聚类分析,根据这些因素对 CBSA 进行聚类。第一次聚类仅包括住宅渗透因素(即家中装有中央空调的百分比、房屋平均建造年份和平均房屋面积),而第二次聚类则包含渗透和通勤因素(即平均车内通勤时间和平均车内通勤距离)。基于住宅渗透因素的聚类产生了 5 个聚类,其中两个聚类具有明显不同的暴露分布。聚类 1 由房屋较旧、较小且中央空调较少的城市组成,而聚类 2 城市的房屋更新、更大且更有可能装有中央空调。包括通勤因素后,结果产生了 10 个聚类。车内通勤时间较短的聚类车内通勤距离也较短。新房较多的城市通勤时间和距离也往往更长。这是第一项利用聚类分析根据暴露因素对城市进行分组的研究。确定具有相似暴露分布的城市可能有助于解释 PM2.5 死亡率风险估计值在城市之间的异质性。