Department of Environmental Health, Harvard School of Public Health, Boston, MA 02115, USA.
Environ Int. 2013 Sep;59:244-54. doi: 10.1016/j.envint.2013.06.003. Epub 2013 Jul 9.
Heterogeneity in the response to PM2.5 is hypothesized to be related to differences in particle composition across monitoring sites which reflect differences in source types as well as climatic and topographic conditions impacting different geographic locations. Identifying spatial patterns in particle composition is a multivariate problem that requires novel methodologies.
Use cluster analysis methods to identify spatial patterns in PM2.5 composition. Verify that the resulting clusters are distinct and informative.
109 monitoring sites with 75% reported speciation data during the period 2003-2008 were selected. These sites were categorized based on their average PM2.5 composition over the study period using k-means cluster analysis. The obtained clusters were validated and characterized based on their physico-chemical characteristics, geographic locations, emissions profiles, population density and proximity to major emission sources.
Overall 31 clusters were identified. These include 21 clusters with 2 or more sites which were further grouped into 4 main types using hierarchical clustering. The resulting groupings are chemically meaningful and represent broad differences in emissions. The remaining clusters, encompassing single sites, were characterized based on their particle composition and geographic location.
The framework presented here provides a novel tool which can be used to identify and further classify sites based on their PM2.5 composition. The solution presented is fairly robust and yielded groupings that were meaningful in the context of air-pollution research.
人们假设,对 PM2.5 的反应存在异质性,这与监测点的颗粒物成分差异有关,而这些差异反映了源类型的差异以及影响不同地理位置的气候和地形条件的差异。识别颗粒物成分的空间模式是一个多变量问题,需要新的方法。
使用聚类分析方法识别 PM2.5 成分的空间模式。验证得到的聚类是明显且有信息的。
选择了 109 个监测点,这些监测点在 2003 年至 2008 年期间有 75%的报告了特定物质的数据。这些站点根据其在研究期间的平均 PM2.5 成分使用 K-均值聚类分析进行分类。根据其理化特性、地理位置、排放特征、人口密度和与主要排放源的接近程度,对获得的聚类进行验证和描述。
总共确定了 31 个聚类。其中包括 21 个有 2 个或更多站点的聚类,这些聚类进一步使用层次聚类法分为 4 种主要类型。由此产生的分组具有化学意义,代表了排放的广泛差异。其余的聚类,包括单个站点,根据其颗粒物成分和地理位置进行描述。
这里提出的框架提供了一种新的工具,可以用于根据 PM2.5 成分识别和进一步对站点进行分类。所提出的解决方案相当稳健,产生的分组在空气污染研究的背景下具有意义。