Lv Ming-Zhou, Li Kun-Lun, Cai Jia-Zeng, Mao Jun, Gao Jia-Jun, Xu Hui
School of Civil Engineering and Architecture, Zhejiang Sci-Tech University, Hangzhou, China.
Zhenan Comprehensive Engineering Surveying and Mapping Institute, Lishui, China.
PLoS One. 2025 May 21;20(5):e0323487. doi: 10.1371/journal.pone.0323487. eCollection 2025.
Landslides are frequent and hazardous geological disasters, posing significant risks to human safety and infrastructure. Accurate assessments of landslide susceptibility are crucial for risk management and mitigation. However, geological surveys of landslide areas are typically conducted at the township level, have lowsample sizes, and rely on experience. This study proposes a framework for assessing landslide susceptibility in Taiping Township, Zhejiang Province, China, using data balancing, machine learning, and data from 1,325 slope units with nine slope characteristics. The dataset was balanced using the Synthetic Minority Oversampling Technique and the Tomek link undersampling method (SMOTE-Tomek). A comparative analysis of six machine learning models was performed, and the SHapley Additive exPlanation (SHAP) method was used to assess the influencing factors. The results indicate that the machine learning algorithms provide high accuracy, and the random forest (RF) algorithm achieves the optimum model accuracy (0.791, F1 = 0.723). The very low, low, medium, and high sensitivity zones account for 92.27%, 5.12%, 1.78%, and 0.83% of the area, respectively. The height of cut slopes has the most significant impact on landslide sensitivity, whereas the altitude has a minor impact. The proposed model accurately assesses landslide susceptibility at the township scale, providing valuable insights for risk management and mitigation.
山体滑坡是频繁且危险的地质灾害,对人类安全和基础设施构成重大风险。准确评估山体滑坡易发性对于风险管理和减灾至关重要。然而,山体滑坡地区的地质调查通常在乡镇层面进行,样本量小,且依赖经验。本研究提出了一个利用数据平衡、机器学习以及来自1325个具有九个斜坡特征的斜坡单元的数据,评估中国浙江省太平乡山体滑坡易发性的框架。使用合成少数过采样技术和托梅克链接欠采样方法(SMOTE - Tomek)对数据集进行平衡。对六种机器学习模型进行了比较分析,并使用夏普利加法解释(SHAP)方法评估影响因素。结果表明,机器学习算法具有较高的准确性,随机森林(RF)算法实现了最优模型精度(0.791,F1 = 0.723)。极低、低、中、高敏感区分别占该区域面积的92.27%、5.12%、1.78%和0.83%。挖方边坡高度对滑坡敏感性影响最为显著,而海拔高度影响较小。所提出的模型准确评估了乡镇尺度的山体滑坡易发性,为风险管理和减灾提供了有价值的见解。