State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing, China; Department of Environmental and Occupational Health Sciences, University of Washington, Seattle, WA, United States.
State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing, China.
Environ Res. 2022 Jul;210:112858. doi: 10.1016/j.envres.2022.112858. Epub 2022 Feb 8.
Geo-statistical models have been applied to assess fine-scale air pollution exposures in epidemiological studies. Many of the models were developed for criteria air pollutants rather than others that have not been regulated (e.g., ultrafine particles, black carbon, and benzene) which may also be harmful to human health. We aim to develop spatial models for regulated and non-regulated air pollutants using 6 algorithms and compare their prediction performances. A mobile platform with fast-response monitors was used to measure gaseous air pollutants (nitrogen dioxides, carbon monoxide, sulfur dioxides, ozone, benzene, toluene, methanol) and particulate matters (black carbon, surface area, count- and volume-concentrations of ultrafine particles) in Beijing, China for 30 days from July to October 2008. Mobile monitoring data for model building were spatially aggregated into 130 road segments of approximately 600-m interval on the sampling routes after temporal adjustment of background concentrations. The best models for the air pollutants were dominated by traffic variables, which explained more than 60% of the spatial variations (range: 0.61 for methanol to 0.88 for ozone) based on the highest cross-validation R and the lowest root mean square error among different algorithms. Amongst the 6 algorithms, the spatial models using partial least squares regression (PLS, a dimension reduction algorithm) and random forest (RF, a machine learning algorithm) algorithms outperformed the models with other algorithms. Exposure predictions from the best models varied substantially with distinct spatial patterns between the air pollutants. Predictions with multiple modeling algorithms were moderately correlated with each other for the same pollutant at the fine-scale grids across the city. Exposure models, especially based on PLS and RF algorithms, captured the spatial variation of short-term average concentrations, had adequate predictive validity, and could be applied to assess toxic air pollutant exposures in human health studies.
地统计学模型已被应用于评估流行病学研究中的细颗粒空气污染暴露。许多模型是为标准空气污染物而开发的,而不是为其他未受监管的污染物(例如超细颗粒、黑碳和苯)开发的,这些污染物也可能对人类健康有害。我们的目标是使用 6 种算法为受监管和不受监管的空气污染物开发空间模型,并比较它们的预测性能。一个带有快速响应监测器的移动平台用于测量气态空气污染物(二氧化氮、一氧化碳、二氧化硫、臭氧、苯、甲苯、甲醇)和颗粒物(黑碳、表面积、超细颗粒的计数和体积浓度)在北京,中国从 2008 年 7 月到 10 月进行了 30 天的测量。移动监测数据在时间调整背景浓度后,空间聚集到采样路线上的 130 个约 600 米间隔的道路段,用于模型构建。最佳模型的空气污染物主要由交通变量主导,根据不同算法之间最高交叉验证 R 和最低均方根误差,解释了超过 60%的空间变化(范围:甲醇为 0.61,臭氧为 0.88)。在 6 种算法中,基于偏最小二乘回归(PLS,一种降维算法)和随机森林(RF,一种机器学习算法)的空间模型优于其他算法的模型。最佳模型的暴露预测在不同空气污染物之间存在明显的空间差异。来自最佳模型的预测在整个城市的细网格上,对于相同的污染物,具有不同的空间模式。暴露模型,特别是基于 PLS 和 RF 算法的模型,捕捉了短期平均浓度的空间变化,具有足够的预测有效性,可用于评估人类健康研究中的有毒空气污染物暴露。