He Fei, Liu Suxia, Mo Xingguo, Wang Zhonggen
Key Laboratory of Water Cycle and Related Land Surface Processes, Institute of Geographic Sciences and Natural Resources Research (IGSNRR), Chinese Academy of Sciences (CAS), Beijing, 100101, China.
College of Resources and Environment/Sino-Danish College, University of Chinese Academy of Sciences (UCAS), Beijing, 100190, China.
Sci Rep. 2025 Jan 11;15(1):1702. doi: 10.1038/s41598-024-84655-y.
Flash flood susceptibility mapping is essential for identifying areas prone to flooding events and aiding decision-makers in formulating effective prevention measures. This study aims to evaluate the flash flood susceptibility in the Yarlung Tsangpo River Basin (YTRB) using multiple machine learning (ML) models facilitated by the H2O automated ML platform. The best-performing model was used to generate a flash flood susceptibility map, and its interpretability was analyzed using the Shapley Additive Explanations (SHAP) tree interpretation method. The results revealed that the top four models, including both single and ensemble models, demonstrated high accuracy in the tests. The flash flood susceptibility map generated by the best-performing eXtreme Randomized Trees (XRT) model showed that 8.92%, 12.95%, 15.42%, 31.34%, and 31.37% of the study area exhibited very high, high, moderate, low, and very low flash flood susceptibility, respectively, with approximately 74.9% of the historical flash floods occurring in areas classified as moderate to very high susceptibility. The SHAP plot identified topographic factors as the primary drivers of flash floods, with the importance analysis ranking the most influential factors in such descending order as DEM, topographic wetness index, topographic position index, normalized difference vegetation index, and average multi-year precipitation. This study demonstrates the benefits of interpretable machine learning, which can provide guidance for flash flood mitigation.
山洪灾害易发性制图对于识别易发生洪水的地区以及帮助决策者制定有效的预防措施至关重要。本研究旨在利用由H2O自动化机器学习平台推动的多种机器学习(ML)模型,评估雅鲁藏布江流域(YTRB)的山洪灾害易发性。使用性能最佳的模型生成山洪灾害易发性地图,并采用夏普利值加法解释(SHAP)树解释方法分析其可解释性。结果显示,包括单一模型和集成模型在内的前四个模型在测试中表现出较高的准确性。性能最佳的极端随机树(XRT)模型生成的山洪灾害易发性地图显示,研究区域分别有8.92%、12.95%、15.42%、31.34%和31.37%的区域表现出极高、高、中、低和极低的山洪灾害易发性,约74.9%的历史山洪发生在被归类为中度到极高易发性的区域。SHAP图将地形因素确定为山洪的主要驱动因素,重要性分析将最具影响力的因素按降序排列为数字高程模型(DEM)、地形湿度指数、地形位置指数、归一化植被指数和多年平均降水量。本研究证明了可解释机器学习的好处,可为减轻山洪灾害提供指导。