Suppr超能文献

利用土地利用回归和约束因子分析评估室内和室外空气污染的异质性。

Evaluating heterogeneity in indoor and outdoor air pollution using land-use regression and constrained factor analysis.

作者信息

Levy Jonathan I, Clougherty Jane E, Baxter Lisa K, Houseman E Andres, Paciorek Christopher J

机构信息

Harvard School of Public Health, Department of Environmental Health, USA.

出版信息

Res Rep Health Eff Inst. 2010 Dec(152):5-80; discussion 81-91.

Abstract

Previous studies have identified associations between traffic exposures and a variety of adverse health effects, but many of these studies relied on proximity measures rather than measured or modeled concentrations of specific air pollutants, complicating interpretability of the findings. An increasing number of studies have used land-use regression (LUR) or other techniques to model small-scale variability in concentrations of specific air pollutants. However, these studies have generally considered a limited number of pollutants, focused on outdoor concentrations (or indoor concentrations of ambient origin) when indoor concentrations are better proxies for personal exposures, and have not taken full advantage of statistical methods for source apportionment that may have provided insight about the structure of the LUR models and the interpretability of model results. Given these issues, the primary objective of our study was to determine predictors of indoor and outdoor residential concentrations of multiple traffic-related air pollutants within an urban area, based on a combination of central site monitoring data; geographic information system (GIS) covariates reflecting traffic and other outdoor sources; questionnaire data reflecting indoor sources and activities that affect ventilation rates; and factor-analytic methods to better infer source contributions. As part of a prospective birth cohort study assessing asthma etiology in urban Boston, we collected indoor and/or outdoor 3-to-4 day samples of nitrogen dioxide (NO2) and fine particulate matter with an aerodynamic diameter or = 2.5 pm (PM2.5) at 44 residences during multiple seasons of the year from 2003 through 2005. We performed reflectance analysis, x-ray fluorescence spectroscopy (XRF), and high-resolution inductively coupled plasma-mass spectrometry (ICP-MS) on particle filters to estimate the concentrations of elemental carbon (EC), trace elements, and water-soluble metals, respectively. We derived multiple indicators of traffic using Massachusetts Highway Department (MHD) data and traffic counts collected outside the residences where the air monitoring was conducted. We used a standardized questionnaire to collect data on home characteristics and occupant behaviors. Additional housing information was collected through property tax records. Ambient concentrations of pollutants as well as meteorological data were collected from centrally located ambient monitors. We used GIS-based LUR models to explain spatial and temporal variability in residential outdoor concentrations of PM2.5, EC, and NO2. We subsequently derived latent-source factors for residential outdoor concentrations using confirmatory factor analysis constrained to nonnegative loadings. We developed LUR models to determine whether GIS covariates and other predictors explain factor variability and thereby support initial factor interpretations. To evaluate indoor concentrations, we developed physically interpretable regression models that explored the relationship between measured indoor and outdoor concentrations, relying on questionnaire data to characterize indoor sources and activities. Because outdoor pollutant concentrations measured directly outside of homes are unlikely to be available for most large epidemiologic studies, we developed regression models to explain indoor concentrations of PM2.5, EC, and NO2 as a function of other, more readily available data: GIS covariates, questionnaire data reflecting both sources and ventilation, and central site monitoring data. As we did for outdoor concentrations, we then derived latent-source factors for residential indoor concentrations and developed regression models explaining variability in these indoor latent-source factors. Finally, to provide insight about the effects of improved characterization of exposures for the results of subsequent epidemiologic investigations, we developed a simulation framework to quantitatively compare the implications of using exposure models derived from validation studies with the use of other surrogate models with varying amounts of measurement error. The concentrations of outdoor PM2.5 were strongly associated with the central site monitor data, whereas EC concentrations showed greater spatial variability, especially during colder months, and were predicted by the length of roadway within 200 m of the home. Outdoor NO2 also showed significant spatial variability, predicted in part by population density and roadway length within 50 m of the home. Our constrained factor analysis of outdoor concentrations produced loadings indicating long-range transport, brake wear and traffic exhaust, diesel exhaust, fuel oil combustion, and resuspended road dust as sources; corresponding LUR models largely corroborated these factor interpretations through covariate significance. For example, long-range transport was predicted by central site PM2.5, and season, brake wear and traffic exhaust and resuspended road dust by traffic and residential density, diesel exhaust by the percentage of diesel traffic on the nearest major road, and fuel oil combustion by population density. Our modeling of the concentrations of indoor pollutants demonstrated substantial variability in indoor-outdoor relationships across constituents, helping to separate constituents dominated by outdoor sources (e.g., S, Se, and V) from those dominated by indoor sources (e.g., Ca and Si). Regression models indicated that indoor PM2.5 was not influenced substantially by local traffic but had significant indoor sources (cooking activity and occupant density), while EC was associated with distance to the nearest designated truck route, and NO2 was associated with both traffic density within 50 m of the home and gas stove usage. Our constrained factor analysis of indoor concentrations helped to separate outdoor-dominated factors from indoor-dominated factors, though some factors appeared to be influenced by both indoor and outdoor sources. Subsequent factor analyses of the indoor-attributable fractions from indoor-outdoor regression models provided generally consistent interpretations of indoor-dominated factors. The use of regression models on indoor factors demonstrated the limited predictive power of questionnaire data related to indoor sources, but reinforced the viability of modeling indoor concentrations of pollutants of ambient origin. In spite of the relatively weak predictive power of some of the indoor-concentration regression models, our epidemiologic simulations illustrated that exposure models with fairly modest R2 values (in the range of 0.3 through 0.4, corresponding with the regression models for PM2.5 and NO2) yielded substantial improvements in epidemiologic study performance relative to the use of exposure proxies that could be applied in the absence of validation studies. In spite of limitations related to sample size and available covariate data, our study demonstrated significant outdoor spatial variability within an urban area in NO2 and in several constituents of airborne particles. LUR techniques combined with constrained factor analysis helped to disentangle the contributions to temporal variability of local, long-range transport, and other sources, ultimately allowing exposures from defined source categories to be investigated in epidemiologic studies. For the indoor residential environment, we demonstrated substantial variability in indoor-outdoor relationships among particle constituents; then, using information from public databases and focused questionnaire data, we were able to predict indoor concentrations for a subset of key pollutants. Constrained factor analysis methods applied to the indoor environment helped to separate indoor sources from outdoor sources. The corresponding indoor regression models had limited predictive power, reinforcing the complexity of characterizing the indoor environment when only limited information about key predictors is available. This finding also underscores the likelihood that these regression models might characterize indoor concentrations of pollutants with ambient origins better than they can the indoor concentrations from all sources. Our findings provide direction for future studies characterizing indoor exposure sources and patterns, and our epidemiologic simulation reinforced the importance of reducing measurement error in a context where many traffic-related air pollutants are influenced by both indoor and outdoor sources. The combination of analytical techniques used in our study could ultimately allow for more refined exposure characterization and evaluation of the relative contributions of various sources to health outcomes in epidemiologic studies.

摘要

以往的研究已经确定了交通暴露与多种不良健康影响之间的关联,但其中许多研究依赖于接近度测量,而非特定空气污染物的实测浓度或模拟浓度,这使得研究结果的可解释性变得复杂。越来越多的研究使用土地利用回归(LUR)或其他技术来模拟特定空气污染物浓度的小尺度变异性。然而,这些研究通常只考虑了有限数量的污染物,在室内浓度更能代表个人暴露时关注的却是室外浓度(或源自室外的室内浓度),并且没有充分利用源解析统计方法,而这些方法可能有助于深入了解LUR模型的结构和模型结果的可解释性。鉴于这些问题,我们研究的主要目的是基于以下数据的组合,确定城市区域内与交通相关的多种空气污染物在室内和室外的住宅浓度预测因子:中心站点监测数据;反映交通和其他室外源的地理信息系统(GIS)协变量;反映室内源和影响通风率活动的问卷数据;以及因子分析方法,以更好地推断源贡献。作为一项评估波士顿市区哮喘病因的前瞻性出生队列研究的一部分,我们在2003年至2005年的多个季节里,在44处住宅中收集了二氧化氮(NO₂)和空气动力学直径≤2.5微米的细颗粒物(PM₂.₅)的室内和/或室外3至4天样本。我们对颗粒物过滤器进行了反射率分析、X射线荧光光谱(XRF)和高分辨率电感耦合等离子体质谱(ICP-MS),以分别估算元素碳(EC)、微量元素和水溶性金属的浓度。我们利用马萨诸塞州公路部(MHD)的数据和在进行空气监测的住宅外收集的交通流量计数得出了多个交通指标。我们使用标准化问卷收集了房屋特征和居住者行为的数据。通过财产税记录收集了额外的住房信息。从位于中心位置的环境监测站收集了污染物的环境浓度以及气象数据。我们使用基于GIS的LUR模型来解释住宅室外PM₂.₅、EC和NO₂浓度的时空变异性。随后,我们使用受非负负荷约束的验证性因子分析得出了住宅室外浓度的潜在源因子。我们开发了LUR模型,以确定GIS协变量和其他预测因子是否能解释因子变异性,从而支持初始因子解释。为了评估室内浓度,我们开发了物理上可解释的回归模型,该模型利用问卷数据来描述室内源和活动,探讨实测室内和室外浓度之间的关系。由于大多数大型流行病学研究不太可能获得直接在房屋外测量的室外污染物浓度,我们开发了回归模型,将PM₂.₅、EC和NO₂的室内浓度解释为其他更容易获得的数据的函数:GIS协变量、反映源和通风的问卷数据以及中心站点监测数据。与室外浓度的做法一样,我们随后得出了住宅室内浓度的潜在源因子,并开发了回归模型来解释这些室内潜在源因子的变异性。最后,为了深入了解暴露特征改进对后续流行病学调查结果的影响,我们开发了一个模拟框架,以定量比较使用来自验证研究的暴露模型与使用具有不同测量误差量的其他替代模型的影响。室外PM₂.₅的浓度与中心站点监测数据密切相关,而EC浓度表现出更大的空间变异性,尤其是在较冷的月份,并且由房屋200米范围内的道路长度预测。室外NO₂也表现出显著的空间变异性,部分由房屋50米范围内的人口密度和道路长度预测。我们对室外浓度进行的约束因子分析得出的负荷表明,源包括长距离传输、制动磨损和交通尾气、柴油尾气、燃油燃烧以及道路扬尘再悬浮;相应的LUR模型通过协变量显著性在很大程度上证实了这些因子解释。例如,长距离传输由中心站点的PM₂.₅预测,制动磨损和交通尾气以及道路扬尘再悬浮由交通和居住密度预测,柴油尾气由最近主要道路上柴油交通的百分比预测,燃油燃烧由人口密度预测。我们对室内污染物浓度的建模表明,不同成分的室内-室外关系存在很大变异性,有助于将以室外源为主的成分(如硫、硒和钒)与以室内源为主的成分(如钙和硅)区分开来。回归模型表明,室内PM₂.₅受当地交通的影响不大,但有显著的室内源(烹饪活动和居住者密度),而EC与到最近指定卡车路线的距离相关,NO₂与房屋50米范围内的交通密度和燃气炉灶使用情况相关。我们对室内浓度进行的约束因子分析有助于将以室外为主的因子与以室内为主的因子区分开来,尽管有些因子似乎受到室内和室外源的共同影响。随后对室内-室外回归模型中室内归因分数的因子分析对以室内为主的因子提供了大致一致的解释。对室内因子使用回归模型表明,与室内源相关的问卷数据的预测能力有限,但加强了对源自室外的污染物室内浓度建模的可行性。尽管一些室内浓度回归模型的预测能力相对较弱,但我们的流行病学模拟表明,R²值相当适中(在0.3至0.4范围内,与PM₂.₅和NO₂的回归模型相对应)的暴露模型相对于使用在没有验证研究时可应用的暴露代理,在流行病学研究性能方面有显著改进。尽管存在样本量和可用协变量数据方面的限制,但我们的研究表明,城市区域内NO₂和空气中颗粒物的几种成分存在显著的室外空间变异性。LUR技术与约束因子分析相结合,有助于厘清本地、长距离传输和其他源对时间变异性的贡献,最终使流行病学研究能够调查来自特定源类别的暴露情况。对于室内居住环境,我们证明了颗粒物成分之间室内-室外关系存在很大变异性;然后,利用公共数据库的信息和重点问卷数据,我们能够预测一部分关键污染物的室内浓度。应用于室内环境的约束因子分析方法有助于将室内源与室外源区分开来。相应的室内回归模型预测能力有限,这进一步凸显了在仅有有限关键预测因子信息时描述室内环境的复杂性。这一发现还强调了这些回归模型可能更能表征源自室外的污染物的室内浓度,而不是所有源产生的室内浓度。我们的研究结果为未来表征室内暴露源和模式的研究提供了方向,并且我们的流行病学模拟强化了在许多与交通相关的空气污染物受室内和室外源共同影响的情况下减少测量误差的重要性。我们研究中使用的分析技术组合最终可能使流行病学研究中对暴露的表征更加精细,并评估各种源对健康结果的相对贡献。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验