School of Environment and Energy, South China University of Technology, Guangzhou 510006, China.
School of Environment and Energy, South China University of Technology, Guangzhou 510006, China.
Sci Total Environ. 2022 Jun 25;827:154279. doi: 10.1016/j.scitotenv.2022.154279. Epub 2022 Mar 4.
Tropospheric ozone (O) pollution is worsening in China, and an accurate forecast is a prerequisite to lower the O peak level. In recent years, machine learning techniques have attracted increasing attention in O prediction owing to their high efficiency and simple operation. However, the accuracy of predicting the daily O level is low. This study proposed a novel model by coupling long short-term memory neural network with transfer learning (TL-LSTM), with meteorology and pollutant concentration information as the model input. L2 regularization was applied to reduce the risk of overfitting and to improve the accuracy and generalization ability of the model prediction. Our results indicated that by transferring the knowledge in the model configuration from the hourly LSTM module, TL-LSTM greatly improves the predictability of the daily maximum 8 h average (MDA8) of O in Hong Kong. The coefficient of determination (R) increased from 0.684 to 0.783 and the mean square error (MSE) reduced from 1.36 × 10 to 1.05 × 10. Furthermore, R and MSE were the highest in summer, indicating an under-prediction of peak O levels. This was a result of the limited number of high O days, which did not provide sufficient knowledge for the model to make an accurate prediction. Sobol analysis indicated that wind speed was the most sensitive factor in O prediction, largely due to the development of land-sea breeze circulation which effectively traps pollutants and expedites O formation. The results clearly demonstrate the effectiveness of the TL-LSTM in predicting the daily O concentration in Hong Kong. Thus, TL-LSTM can be promulgated into other photochemically active regions to assist in O pollution forecasting and management.
在中国,对流层臭氧(O)污染日益严重,准确的预测是降低 O 峰值水平的前提。近年来,机器学习技术由于其高效、操作简单等特点,在 O 预测中受到越来越多的关注。然而,预测日 O 浓度的精度较低。本研究提出了一种新的模型,通过将长短期记忆神经网络与迁移学习(TL-LSTM)相结合,以气象和污染物浓度信息作为模型输入。应用 L2 正则化来降低过拟合的风险,提高模型预测的准确性和泛化能力。研究结果表明,通过将模型配置中的知识从每小时 LSTM 模块转移,TL-LSTM 大大提高了香港日最大 8 小时平均(MDA8)O 的可预测性。决定系数(R)从 0.684 增加到 0.783,均方误差(MSE)从 1.36×10减少到 1.05×10。此外,R 和 MSE 在夏季最高,表明 O 峰值水平的预测不足。这是由于高 O 日数量有限,模型没有足够的知识进行准确预测。Sobol 分析表明,风速是 O 预测中最敏感的因素,这主要是由于海陆风环流的发展,有效地捕获污染物并加速 O 的形成。结果清楚地表明了 TL-LSTM 在预测香港日 O 浓度方面的有效性。因此,TL-LSTM 可以推广到其他光化学反应活性区域,以协助 O 污染预测和管理。