Onyelowe Kennedy C, Kamchoom Viroon, Gnananandarao Tammineni, Arunachalam Krishna P
Department of Civil Engineering, Michael Okpara University of Agriculture, Umudike, 440109, Nigeria.
Department of Civil Engineering, Kampala International University, Kampala, Uganda.
Sci Rep. 2025 Jul 1;15(1):21509. doi: 10.1038/s41598-025-07494-5.
Earthquake-inducedliquefaction of soils poses a serious georisk in geotechnical designs, construction and the application of geotechnical structures around the world. In this study, the applicability of three soft computing models for liquefaction classification, a topic of significant importance within the fields of geotechnical and earthquake engineering has been evaluated. Twelve input parameters are used to classify the liquefaction potential for 234 data sets collected from an earthquake-induced liquefaction prone granular material environment. For developing the SVM_Poly, SVM_RBK models, an extensive number of trials were conducted using various combinations of C and d for polynomial kernels and C and ∂ for radial basis function kernel-based support vector machines (SVMs) utilizing user-defined parameters. In the same way, several experiments were conducted with a fixed value of C and ∂ kernel specific parameters in order to determine an appropriate value of error-insensitive zone (∋).Similarly, for the random forest classifier (RFC) model, the number of variables used (m) and the number of trees to be grown (k) are two user-defined parameters. These optimum values of m and k parameters are fixed using trial and error process and the same fixed values. The best model was developed as evidence from the confusion matrixes and statistical indicators. The calculated values of confusion matrixes and statistical indicators for training and testing shows that an accuracy of 0.89 indicates the model is correct in its predictions 89% of the time. A sensitivity of 0.85 signifies the model correctly identifies 85% of actual positive instances, while a specificity of 0.94 implies correct identification of 94% of actual negative instances. A precision of 0.94 suggests that when the model predicts a positive instance, it is correct 94% of the time. The Phi Correlation Coefficient, with a value of 0.82, indicates a strong positive correlation between predicted and actual values.Furthermore, the model exhibits a Mean Absolute Error (MAE) of 0.2351, reflecting a relatively low average error in predictions. The Root Mean Squared Error (RMSE) value of 0.3115 indicates better accuracy in predicting the target variable.Finally, all the developed models exhibit promising performance across various evaluation metrics, with low error measures (MAE and RMSE), high accuracy, and strong performance in correctly identifying both positive and negative instances, as evidenced by sensitivity and specificity. The high precision and Phi Correlation Coefficient further affirm the reliability and accuracy of the model's predictions. However, among the three models FRC model is the best for classifying the liquefaction. The novelty of this research lies in its comparative evaluation and optimization of SVM_Poly, SVM_RBK, and RFC models using a comprehensive set of seismic and soil parameters to accurately classify earthquake-induced liquefaction potential, with the RFC model demonstrating superior predictive performance.
地震引发的土壤液化在世界各地的岩土工程设计、施工以及岩土结构应用中构成了严重的地质风险。在本研究中,对三种软计算模型在液化分类中的适用性进行了评估,这是岩土工程和地震工程领域中一个非常重要的课题。使用十二个输入参数对从地震易引发液化的粒状材料环境中收集的234个数据集的液化潜力进行分类。为了开发支持向量机多项式(SVM_Poly)和支持向量机径向基核(SVM_RBK)模型,利用用户定义的参数,针对多项式核的C和d以及基于径向基函数核的支持向量机的C和∂的各种组合进行了大量试验。同样,为了确定误差不敏感区(∋)的合适值,对C和∂核特定参数的固定值进行了多次实验。类似地,对于随机森林分类器(RFC)模型,使用的变量数量(m)和要生长的树的数量(k)是两个用户定义的参数。通过试错过程和相同的固定值来确定m和k参数的这些最佳值。根据混淆矩阵和统计指标证明开发出了最佳模型。训练和测试的混淆矩阵和统计指标的计算值表明,0.89的准确率表明该模型在89%的时间内预测正确。0.85的灵敏度表示该模型正确识别了85%的实际正例,而0.94的特异性意味着正确识别了94%的实际负例。0.94的精度表明当模型预测一个正例时,它在94%的时间内是正确的。Phi相关系数值为0.82,表明预测值与实际值之间存在强正相关。此外,该模型的平均绝对误差(MAE)为0.2351,反映出预测中的平均误差相对较低。均方根误差(RMSE)值为0.3115表明在预测目标变量方面具有更好的准确性。最后,所有开发的模型在各种评估指标上都表现出良好的性能,具有低误差度量(MAE和RMSE)、高精度以及在正确识别正例和负例方面的强大性能,灵敏度和特异性证明了这一点。高精度和Phi相关系数进一步证实了模型预测的可靠性和准确性。然而,在这三个模型中,FRC模型在液化分类方面是最好的。本研究的新颖之处在于使用一套全面的地震和土壤参数对SVM_Poly、SVM_RBK和RFC模型进行比较评估和优化,以准确分类地震引发的液化潜力,其中RFC模型表现出卓越的预测性能。