Department of Watershed Management, Sari Agricultural Sciences and Natural Resources University, P.O. Box 737, Sari, Iran.
Soil Conservation and Watershed Management Research Department, West Azarbaijan Agricultural and Natural Resources Research and Education Center, AREEO, Urmia, Iran.
Sci Total Environ. 2020 Feb 25;705:135983. doi: 10.1016/j.scitotenv.2019.135983. Epub 2019 Dec 6.
Flood susceptibility projections relying on standalone models, with one-time train-test data splitting for model calibration, yields biased results. This study proposed novel integrative flood susceptibility prediction models based on multi-time resampling approaches, random subsampling (RS) and bootstrapping (BT) algorithms, integrated with machine learning models: generalized additive model (GAM), boosted regression tree (BTR) and multivariate adaptive regression splines (MARS). RS and BT algorithms provided 10 runs of data resampling for learning and validation of the models. Then the mean of 10 runs of predictions is used to produce the flood susceptibility maps (FSM). This methodology was applied to Ardabil Province on coastal margins of the Caspian Sea which faced destructive floods. The area under curve (AUC) of receiver operating characteristic (ROC) and true skill statistic (TSS) and correlation coefficient (COR) were utilized to evaluate the predictive accuracy of the proposed models. Results demonstrated that resampling algorithms improved the performance of Standalone GAM, MARS and BRT models. Results also revealed that Standalone models had better performance with the BT algorithm compared to the RS algorithm. BT-GAM model attained superior performance in terms of statistical measures (AUC = 0.98, TSS = 0.93, COR = 0.91), followed by BT-MARS (AUC = 0.97, TSS = 0.91, COR = 0.91) and BT-BRT model (AUC = 0.95, TSS = 0.79, COR = 0.79). Results demonstrated that the proposed models outperformed the benchmark models such as Standalone GAM, MARS, BRT, multilayer perceptron (MLP) and support vector machine (SVM). Given the admirable performance of the proposed models in a large scale area, the promising results can be expected from these models for other regions.
基于单机模型的洪水易感性预测,采用一次性的训练-测试数据分割进行模型校准,会产生有偏的结果。本研究提出了基于多次重采样方法的新型综合洪水易感性预测模型,包括随机抽样(RS)和自举法(BT)算法,以及机器学习模型:广义加性模型(GAM)、提升回归树(BTR)和多元自适应回归样条(MARS)。RS 和 BT 算法提供了 10 次数据重采样,用于模型的学习和验证。然后,使用 10 次预测的平均值生成洪水易感性图(FSM)。该方法应用于里海沿海边缘的阿尔达比勒省,该地区曾遭受破坏性洪水。采用曲线下面积(AUC)、接收者操作特征(ROC)的真实技能统计(TSS)和相关系数(COR)来评估所提出模型的预测精度。结果表明,重采样算法提高了单机 GAM、MARS 和 BRT 模型的性能。结果还表明,与 RS 算法相比,BT 算法下的单机模型具有更好的性能。BT-GAM 模型在统计指标方面表现出色(AUC=0.98,TSS=0.93,COR=0.91),其次是 BT-MARS(AUC=0.97,TSS=0.91,COR=0.91)和 BT-BRT 模型(AUC=0.95,TSS=0.79,COR=0.79)。结果表明,所提出的模型优于基准模型,如单机 GAM、MARS、BRT、多层感知机(MLP)和支持向量机(SVM)。鉴于这些模型在大规模区域的卓越性能,可以预期这些模型在其他地区也会有良好的表现。