Lee Eunmi, You Young-Wook, Jung Young-Hun, Kam Jonghun
Division of Environmental Science and Engineering, Pohang University of Science and Technology, Pohang, 37673, South Korea.
Department of Advanced Science and Technology Convergence, Kyungpook National University, Sangju, 37224, South Korea.
J Environ Manage. 2025 Jun;385:125640. doi: 10.1016/j.jenvman.2025.125640. Epub 2025 May 6.
Analytic Hierarchy Process (AHP) of pluvial flood risk assessment has been widely used, incorporating multiple assessment indices. However, uncertainty assessment of expert judgement-based flood risk remains limited. This study proposes a Machine Learning (ML) model-based AHP approach, using pluvial flood-related data of hazard, exposure, vulnerability, and capacity of South Korea over 2002-2021. In this study, the trained eXtreme Gradient Boosting (XGBoost) and Random Forest (RF) models successfully predict flood economic losses using 21 flood-related variables, outperforming LightGBM and CatBoost. Permutation importance scores from the trained XGBoost and RF models are used to estimate the mean and 95 % confidence intervals of the assessment factor weights. Both models show that rainfall amount, river area, population density, and green belt area are important factors for flood damage prediction, but the XGBoost (RF) model identifies impermeable areal fraction (river area) as the most important component in exposure, resulting in disparity in the uncertainty range in major cities over South Korea where the XGBoost and RF models show a high risk consistently. This study substantiates the practical application of the proposed ML based-AHP approach for uncertainty assessment of flood risk, highlighting the need for balanced land development and green infrastructure for flood mitigation.
暴雨洪水风险评估的层次分析法(AHP)已被广泛应用,纳入了多个评估指标。然而,基于专家判断的洪水风险不确定性评估仍然有限。本研究提出了一种基于机器学习(ML)模型的层次分析法,使用了2002年至2021年韩国暴雨洪水相关的灾害、暴露、脆弱性和应对能力数据。在本研究中,经过训练的极端梯度提升(XGBoost)和随机森林(RF)模型使用21个与洪水相关的变量成功预测了洪水经济损失,优于LightGBM和CatBoost。从经过训练的XGBoost和RF模型中得到的排列重要性得分用于估计评估因子权重的均值和95%置信区间。两个模型均表明,降雨量、河流面积、人口密度和绿化带面积是洪水损失预测的重要因素,但XGBoost(RF)模型将不透水面积比例(河流面积)确定为暴露方面最重要的组成部分,导致在韩国主要城市中XGBoost和RF模型一致显示高风险的不确定性范围存在差异。本研究证实了所提出的基于机器学习的层次分析法在洪水风险不确定性评估中的实际应用,强调了平衡土地开发和绿色基础设施以减轻洪水灾害的必要性。