School of Geography and Information Engineering, China University of Geosciences, Wuhan, 430074, China.
Class 3 Grade3, Wuhan No.11 High School, Wuhan, 430030, China.
Sci Rep. 2019 Oct 25;9(1):15369. doi: 10.1038/s41598-019-51941-z.
Landslide disasters cause huge casualties and economic losses every year, how to accurately forecast the landslides has always been an important issue in geo-environment research. In this paper, a hybrid machine learning approach RSLMT is firstly proposed by coupling Random Subspace (RS) and Logistic Model Tree (LMT) for producing a landslide susceptibility map (LSM). With this method, the uncertainty introduced by input features is considered, the problem of overfitting is solved by reducing dimensions to increase the prediction rate of landslide occurrence. Moreover, the uncertainty of prediction will be deeply discussed with the rank probability score (RPS) series, which is an important evaluation of uncertainty but rarely used in LSM. Qingchuan county, China was taken as a study area. 12 landslide causal factors were selected and their contribution on landslide occurrence was evaluated by ReliefF method. In addition, Logistic Model Tree (LMT), Naive Bayes (NB) and Logistic Regression (LR) were researched for comparison. The results showed that RSLMT (AUC = 0.815) outperformed LMT (AUC = 0.805), NB (AUC = 0.771), LR (AUC = 0.785). LSM of Qingchuan county was produced using the novel model, it indicated that landslides tend to occur along with the fault belts and the middle-low mountain area that is strongly influenced by the large numbers of human engineering activities.
滑坡灾害每年都会造成巨大的人员伤亡和经济损失,如何准确预测滑坡一直是地质环境研究的重要问题。本文首次提出了一种混合机器学习方法 RSLMT,该方法通过随机子空间(RS)和逻辑模型树(LMT)的耦合,生成滑坡易发性图(LSM)。该方法考虑了输入特征引入的不确定性,通过降维来解决过拟合问题,从而提高滑坡发生的预测率。此外,还将通过等级概率得分(RPS)系列深入讨论预测的不确定性,这是不确定性的一个重要评估指标,但在 LSM 中很少使用。中国青川县被选为研究区。选择了 12 个滑坡成因因素,并通过 ReliefF 方法评估它们对滑坡发生的贡献。此外,还研究了逻辑模型树(LMT)、朴素贝叶斯(NB)和逻辑回归(LR)进行比较。结果表明,RSLMT(AUC=0.815)优于 LMT(AUC=0.805)、NB(AUC=0.771)和 LR(AUC=0.785)。利用该模型生成了青川县的 LSM,表明滑坡易沿断裂带和中低山区发生,这些地区受大量人类工程活动的强烈影响。