State Key Laboratory of Geohazard Prevention and Geoenvironment Protection (Chengdu University of Technology), Chengdu, 610059, China.
Sichuan Academy of Environmental Policy and Planning, Department of Ecology and Environment of Sichuan Province, Chengdu, 610059, China.
J Environ Manage. 2024 Jun;360:121097. doi: 10.1016/j.jenvman.2024.121097. Epub 2024 May 10.
With high-frequency data of nitrate (NO-N) concentrations in waters becoming increasingly important for understanding of watershed system behaviors and ecosystem managements, the accurate and economic acquisition of high-frequency NO-N concentration data has become a key point. This study attempted to use coupled deep learning neural networks and routine monitored data to predict hourly NO-N concentrations in a river. The hourly NO-N concentration at the outlet of the Oyster River watershed in New Hampshire, USA, was predicted through neural networks with a hybrid model architecture coupling the Convolutional Neural Networks and the Long Short-Term Memory model (CNN-LSTM). The routine monitored data (the river depth, water temperature, air temperature, precipitation, specific conductivity, pH and dissolved oxygen concentrations) for model training were collected from a nested high-frequency monitoring network, while the high-frequency NO-N concentration data obtained at the outlet were not included as inputs. The whole dataset was separated into training, validation, and testing processes according to the ratio of 5:3:2, respectively. The hybrid CNN-LSTM model with different input lengths (1d, 3d, 7d, 15d, 30d) displayed comparable even better performance than other studies with lower frequencies, showing mean values of the Nash-Sutcliffe Efficiency 0.60-0.83. Models with shorter input lengths demonstrated both the higher modeling accuracy and stability. The water level, water temperature and pH values at monitoring sites were main controlling factors for forecasting performances. This study provided a new insight of using deep learning networks with a coupled architecture and routine monitored data for high-frequency riverine NO-N concentration forecasting and suggestions about strategies about variable and input length selection during preprocessing of input data.
随着水体硝酸盐(NO-N)浓度高频数据在理解流域系统行为和生态系统管理方面变得越来越重要,准确、经济地获取高频 NO-N 浓度数据已成为关键。本研究试图使用耦合深度学习神经网络和常规监测数据来预测河流中的小时 NO-N 浓度。通过神经网络,使用耦合卷积神经网络和长短期记忆模型(CNN-LSTM)的混合模型架构,预测美国新罕布什尔州牡蛎河流域出口处的小时 NO-N 浓度。用于模型训练的常规监测数据(河深、水温、气温、降水、比电导率、pH 值和溶解氧浓度)来自嵌套高频监测网络,而出口处获得的高频 NO-N 浓度数据不包括作为输入。整个数据集根据 5:3:2 的比例分别分为训练、验证和测试过程。具有不同输入长度(1d、3d、7d、15d、30d)的混合 CNN-LSTM 模型显示出与其他使用较低频率的研究相当甚至更好的性能,表现出纳什-苏特克里夫效率的平均值为 0.60-0.83。输入长度较短的模型表现出更高的建模精度和稳定性。监测站点的水位、水温、pH 值是预测性能的主要控制因素。本研究为使用具有耦合架构的深度学习网络和常规监测数据进行高频河流水体 NO-N 浓度预测提供了新的思路,并就输入数据预处理过程中的变量和输入长度选择策略提出了建议。