School of Environmental Science and Engineering, Tianjin University, Tianjin, 300350, PR China.
Environ Res. 2023 Jan 1;216(Pt 3):114723. doi: 10.1016/j.envres.2022.114723. Epub 2022 Nov 3.
Ammonia nitrogen (NH-N) is closely related to the occurrence of cyanobacterial blooms and destruction of surface water ecosystems, and thus it is of great significance to develop predictive models for NH-N. However, traditional models cannot fully consider the complex nonlinear relationship between NH-N and various relative environmental parameters. The long short-term memory (LSTM) neural network can overcome this limitation. A new hybrid model BC-MODWT-DA-LSTM was proposed based on LSTM combining with the dual-stage attention (DA) mechanism and boundary corrected maximal overlap discrete wavelet transform (BC-MODWT) data decomposition method. By introducing attention mechanism, LSTM could selectively focus on the input data. BC-MODWT could decompose the input data into sublayers to determine the main swings and trends of the input feature series. The BC-MODWT-DA-LSTM hybrid model was superior to other studied models with lower average prediction errors. It could maintain NASH Sutcliffe efficiency coefficient (NSE) values above 0.900 under the lead time up to 7 days, and the area under the receiver operating characteristic (ROC) curve could reach 0.992. The hybrid model also had higher prediction accuracies at the peak spots, indicating that it was capable of early warning when sudden high NH-N pollution occurred. The high forecasting accuracy of the suggested hybrid method proved that further improving LSTM model without introducing more complex topologies was a promising water quality prediction method.
氨氮(NH-N)与蓝藻水华的发生和地表水生态系统的破坏密切相关,因此开发 NH-N 的预测模型具有重要意义。然而,传统模型不能充分考虑 NH-N 与各种相对环境参数之间的复杂非线性关系。长短期记忆(LSTM)神经网络可以克服这一局限性。本文提出了一种基于 LSTM 的新型混合模型 BC-MODWT-DA-LSTM,该模型结合了双阶段注意力(DA)机制和边界修正最大重叠离散小波变换(BC-MODWT)数据分解方法。通过引入注意力机制,LSTM 可以选择性地关注输入数据。BC-MODWT 可以将输入数据分解为子层,以确定输入特征序列的主要波动和趋势。与其他研究模型相比,BC-MODWT-DA-LSTM 混合模型具有更低的平均预测误差。它可以在提前期长达 7 天的情况下保持纳什效率系数(NSE)值高于 0.900,接收器操作特性(ROC)曲线下的面积可达 0.992。该混合模型在峰值点也具有更高的预测精度,表明当突然发生高 NH-N 污染时,它能够进行预警。该混合方法的高预测精度证明,在不引入更复杂拓扑结构的情况下进一步改进 LSTM 模型是一种很有前途的水质预测方法。