Liu Ying, Wen Lixia, Lin Zhengjiang, Xu Cong, Chen Yu, Li Yong
School of Enviromental Science and Engineering, Southwest Jiaotong University, Chengdu, 611756, China.
BYD Company Limited, Shenzhen, 518119, China.
Sci Rep. 2024 Oct 1;14(1):22791. doi: 10.1038/s41598-024-74246-2.
Air quality is closely linked to human health and social development, making accurate air quality prediction highly significant. The Air Quality Index (AQI) is inherently a time series. However, most previous studies have overlooked its temporal features and have not thoroughly explored the relationship between pollutant emissions and air quality. To address this issue, this study establishes a historical correlation model for air quality based on a time series model-the Gaussian Hidden Markov Model (GHMM)-using industrial exhaust emissions and historical air quality data. Firstly, a traversal method is used to select the optimal number of hidden states for the GHMM. To optimize the traditional GHMM and reduce error accumulation in the prediction process, the Multi-day Weighted Matching method and the Fixed Training Set Length method are utilized. Both direct and indirect prediction modes are then used to predict the AQI in the Zhangdian District. Experimental results indicate that the improved GHMM with the indirect mode provides higher accuracy and more stable state estimation results (MAE = 13.59, RMSE = 17.59, mean forecasted value = 117.94). Finally, the air quality historical correlation model is integrated with the air quality meteorological correlation model from a previous study, further improving prediction accuracy (MAE = 11.59, RMSE = 14.87, mean forecasted value = 120.88). This study demonstrates that the GHMM's strong ability to analyze temporal features significantly enhances the accuracy and stability of air quality predictions. The integration of the air quality historical correlation model with the air quality meteorological correlation model from a previous study leverages the strengths of each sub-model in handling different feature groups, leading to even more accurate predictions.
空气质量与人类健康和社会发展密切相关,因此准确的空气质量预测具有重要意义。空气质量指数(AQI)本质上是一个时间序列。然而,以往的大多数研究都忽略了其时间特征,并且没有深入探讨污染物排放与空气质量之间的关系。为了解决这个问题,本研究基于时间序列模型——高斯隐马尔可夫模型(GHMM),利用工业废气排放和历史空气质量数据,建立了空气质量历史相关模型。首先,采用遍历方法为GHMM选择最优的隐藏状态数量。为了优化传统的GHMM并减少预测过程中的误差积累,使用了多日加权匹配方法和固定训练集长度方法。然后采用直接和间接预测模式对张店区的AQI进行预测。实验结果表明,采用间接模式的改进型GHMM具有更高的精度和更稳定的状态估计结果(平均绝对误差=13.59,均方根误差=17.59,平均预测值=117.94)。最后,将空气质量历史相关模型与先前研究中的空气质量气象相关模型相结合,进一步提高了预测精度(平均绝对误差=11.59,均方根误差=14.87,平均预测值=120.88)。本研究表明,GHMM强大的时间特征分析能力显著提高了空气质量预测的准确性和稳定性。将空气质量历史相关模型与先前研究中的空气质量气象相关模型相结合,利用了每个子模型在处理不同特征组方面的优势,从而实现了更准确的预测。