Ireland's National Centre for Artificial Intelligence (CeADAR), University College Dublin, NexusUCD, Belfield Office Park, Dublin, Ireland.
Ireland's National Centre for Artificial Intelligence (CeADAR), University College Dublin, NexusUCD, Belfield Office Park, Dublin, Ireland.
Chemosphere. 2024 Sep;364:143097. doi: 10.1016/j.chemosphere.2024.143097. Epub 2024 Aug 16.
Over the past decades, air pollution has caused severe environmental and public health problems. According to the World Health Organization (WHO), fine particulate matter (PM), a key component reflecting air quality, is the fourth leading cause of death worldwide after cardiovascular disease, smoking, and diet. Various research efforts have aimed to develop PM forecasting models that can be integrated into a solution to mitigate the adverse effects of air pollution. However, PM forecasting is challenging because air pollution data are non-stationary and influenced by multiple random effects. This paper proposes an effective multivariate multi-step ensemble machine learning model for predicting continuous 24-h PM concentrations, considering meteorological conditions, the rolling mean of PM time series, and temporal features. PM is strongly correlated with space and time. Therefore, forecasting results from one location are insufficient to represent the level of air pollution for an entire city. In this study, we established six real-time air quality monitoring sites in different regions, including traffic, residential, and industrial areas in Ho Chi Minh City (HCMC), and generated forecasting results for each station. Various statistical methods are incorporated to evaluate the performance of the model. The experimental results confirm that the model performs well, substantially improving its forecasting accuracy compared to existing PM forecasting models developed for HCMC. In addition, we analyze to determine the contribution of different feature groups to model performance. The model can serve as a reference for citizens scheduling local travel and for healthcare providers to provide early warnings.
在过去的几十年里,空气污染已经造成了严重的环境和公共卫生问题。根据世界卫生组织(WHO)的说法,细颗粒物(PM)是反映空气质量的一个关键成分,是全球第四大死亡原因,仅次于心血管疾病、吸烟和饮食。各种研究都致力于开发 PM 预测模型,可以将其整合到减轻空气污染的解决方案中。然而,PM 预测具有挑战性,因为空气污染数据是非平稳的,受到多个随机效应的影响。本文提出了一种有效的多变量多步集成机器学习模型,用于预测连续 24 小时的 PM 浓度,同时考虑了气象条件、PM 时间序列的滚动平均值和时间特征。PM 与空间和时间高度相关。因此,一个地点的预测结果不足以代表整个城市的空气污染水平。在这项研究中,我们在胡志明市(HCMC)的不同区域建立了六个实时空气质量监测站,包括交通、住宅和工业区,并为每个站点生成了预测结果。采用了各种统计方法来评估模型的性能。实验结果证实,该模型表现良好,与为 HCMC 开发的现有 PM 预测模型相比,大大提高了预测精度。此外,我们还分析了不同特征组对模型性能的贡献。该模型可以为市民规划当地出行提供参考,也可以为医疗保健提供者提供早期预警。