Nguyen Anh Duy, Le Nguyen Phi, Vu Viet Hung, Pham Quoc Viet, Nguyen Viet Huy, Nguyen Minh Hieu, Nguyen Thanh Hung, Nguyen Kien
School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi, Vietnam.
Institute for Advanced Academic Research, Chiba University, Chiba, Japan.
Sci Rep. 2022 Nov 18;12(1):19870. doi: 10.1038/s41598-022-22057-8.
Forecasting discharge (Q) and water level (H) are essential factors in hydrological research and flood prediction. In recent years, deep learning has emerged as a viable technique for capturing the non-linear relationship of historical data to generate highly accurate prediction results. Despite the success in various domains, applying deep learning in Q and H prediction is hampered by three critical issues: a shortage of training data, the occurrence of noise in the collected data, and the difficulty in adjusting the model's hyper-parameters. This work proposes a novel deep learning-based Q-H prediction model that overcomes all the shortcomings encountered by existing approaches. Specifically, to address data scarcity and increase prediction accuracy, we design an ensemble learning architecture that takes advantage of multiple deep learning techniques. Furthermore, we leverage the Singular-Spectrum Analysis (SSA) to remove noise and outliers from the original data. Besides, we exploit the Genetic Algorithm (GA) to propose a novel mechanism that can automatically determine the prediction model's optimal hyper-parameters. We conducted extensive experiments on two datasets collected from Vietnam's Red and Dakbla rivers. The results show that our proposed solution outperforms current techniques across a wide range of metrics, including NSE, MSE, MAE, and MAPE. Specifically, by exploiting the ensemble learning technique, we can improve the NSE by at least [Formula: see text]. Moreover, with the aid of the SSA-based data preprocessing technique, the NSE is further enhanced by more than [Formula: see text]. Finally, thanks to GA-based optimization, our proposed model increases the NSE by at least [Formula: see text] and up to [Formula: see text] in the best case.
预测流量(Q)和水位(H)是水文研究和洪水预测中的关键因素。近年来,深度学习已成为一种可行的技术,用于捕捉历史数据的非线性关系,以生成高度准确的预测结果。尽管在各个领域都取得了成功,但在流量和水位预测中应用深度学习仍受到三个关键问题的阻碍:训练数据短缺、收集的数据中存在噪声以及调整模型超参数困难。这项工作提出了一种基于深度学习的新型流量-水位预测模型,该模型克服了现有方法遇到的所有缺点。具体而言,为了解决数据稀缺问题并提高预测准确性,我们设计了一种集成学习架构,利用多种深度学习技术。此外,我们利用奇异谱分析(SSA)从原始数据中去除噪声和异常值。此外,我们利用遗传算法(GA)提出了一种新颖的机制,可以自动确定预测模型的最佳超参数。我们对从越南红河和达克拉河收集的两个数据集进行了广泛的实验。结果表明,我们提出的解决方案在包括NSE、MSE、MAE和MAPE在内的广泛指标上优于当前技术。具体而言,通过利用集成学习技术,我们可以将NSE至少提高[公式:见原文]。此外,借助基于SSA的数据预处理技术,NSE进一步提高了超过[公式:见原文]。最后,由于基于GA的优化,我们提出的模型在最佳情况下将NSE至少提高[公式:见原文],最高可达[公式:见原文]。