Rehan Mohammad, Munir Said
Center of Excellence in Environmental Studies (CEES), King Abdulaziz University, Jeddah 21589, Saudi Arabia.
Institute for Transport Studies, Faculty of Environment, University of Leeds, Leeds LS2 9JT, UK.
Toxics. 2022 Jul 5;10(7):376. doi: 10.3390/toxics10070376.
Air pollution has serious environmental and human health-related consequences; however, little work seems to be undertaken to address the harms in Middle Eastern countries, including Saudi Arabia. We installed a continuous air quality monitoring station in Jeddah, Saudi Arabia and monitored several air pollutants and meteorological parameters over a 2-year period (2018-2019). Here, we developed two supervised machine learning models, known as quantile regression models, to analyze the whole distribution of the modeled pollutants, not only the mean values. Two pollutants, namely NO and O, were modeled by dividing their concentrations into several quantiles (0.05, 0.25, 0.50, 0.75, and 0.95) and the effect of several pollutants and meteorological variables was analyzed on each quantile. The effect of the explanatory variables changed at different segments of the distribution of NO and O concentrations. For instance, for the modeling of O, the coefficients of wind speed at quantiles 0.05, 0.25, 0.5, 0.75, and 0.95 were 1.40, 2.15, 2.34, 2.31, and 1.56, respectively. Correlation coefficients of 0.91 and 0.92 and RMSE values of 14.41 and 8.96, which are calculated for the cross-validated models of NO and O, showed an acceptable model performance. Quantile analysis aids in better understanding the behavior of air pollution and how it interacts with the influencing factors.
空气污染会造成严重的环境问题并对人类健康产生影响;然而,在包括沙特阿拉伯在内的中东国家,似乎很少有人致力于解决这些危害。我们在沙特阿拉伯吉达安装了一个空气质量连续监测站,并在两年时间(2018 - 2019年)内监测了多种空气污染物和气象参数。在此,我们开发了两种监督式机器学习模型,即分位数回归模型,以分析模拟污染物的整体分布情况,而不仅仅是平均值。通过将两种污染物(即一氧化氮和臭氧)的浓度划分为几个分位数(0.05、0.25、0.50、0.75和0.95)来对其进行建模,并分析了多种污染物和气象变量对每个分位数的影响。解释变量的影响在一氧化氮和臭氧浓度分布的不同区间有所变化。例如,对于臭氧建模,在分位数0.05、0.25、0.5、0.75和0.95处风速的系数分别为1.40、2.15、2.34、2.31和1.56。为一氧化氮和臭氧的交叉验证模型计算得出的相关系数分别为0.91和0.92,均方根误差值分别为14.41和8.96,表明模型性能可接受。分位数分析有助于更好地理解空气污染的行为及其与影响因素的相互作用。