基于迁移学习两阶段注意力机制的空气污染物预测模型

Air pollutant prediction model based on transfer learning two-stage attention mechanism.

作者信息

Ma Zhanfei, Wang Bisheng, Luo Wenli, Jiang Jing, Liu Dongxiang, Wei Hui, Luo HaoYe

机构信息

School of Information Science and Technology, Baotou Teachers' College, Baotou, 014010, Inner Mongolia, China.

School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou, 014010, Inner Mongolia, China.

出版信息

Sci Rep. 2024 Mar 28;14(1):7385. doi: 10.1038/s41598-024-57784-7.

DOI:10.1038/s41598-024-57784-7

PMID:38548823

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10978953/

Abstract

Atmospheric pollution significantly impacts the regional economy and human health, and its prediction has been increasingly emphasized. The performance of traditional prediction methods is limited due to the lack of historical data support in new atmospheric monitoring sites. Therefore, this paper proposes a two-stage attention mechanism model based on transfer learning (TL-AdaBiGRU). First, the first stage of the model utilizes a temporal distribution characterization algorithm to segment the air pollutant sequences into periods. It introduces a temporal attention mechanism to assign self-learning weights to the period segments in order to filter out essential period features. Then, in the second stage of the model, a multi-head external attention mechanism is introduced to mine the network's hidden layer key features. Finally, the adequate knowledge learned by the model at the source domain site is migrated to the new site to improve the prediction capability of the new site. The results show that (1) the model is modeled from the data distribution perspective, and the critical information within the sequence of periodic segments is mined in depth. (2) The model employs a unique two-stage attention mechanism to capture complex nonlinear relationships in air pollutant data. (3) Compared with the existing models, the mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) of the model decreased by 14%, 13%, and 4%, respectively, and the prediction accuracy was greatly improved.

摘要

大气污染对区域经济和人类健康有重大影响，其预测也越来越受到重视。由于新的大气监测站点缺乏历史数据支持，传统预测方法的性能受到限制。因此，本文提出了一种基于迁移学习的两阶段注意力机制模型（TL-AdaBiGRU）。首先，模型的第一阶段利用时间分布表征算法将空气污染物序列划分为不同时期。它引入了时间注意力机制，为时期片段分配自学习权重，以过滤出关键的时期特征。然后，在模型的第二阶段，引入多头外部注意力机制来挖掘网络隐藏层的关键特征。最后，将模型在源域站点学到的充分知识迁移到新站点，以提高新站点的预测能力。结果表明：（1）该模型从数据分布角度进行建模，深入挖掘了周期片段序列中的关键信息。（2）该模型采用独特的两阶段注意力机制来捕捉空气污染物数据中的复杂非线性关系。（3）与现有模型相比，该模型的平均绝对误差（MAE）、均方根误差（RMSE）和平均绝对百分比误差（MAPE）分别降低了14%、13%和4%，预测精度得到了大幅提高。