Liu Jiarun, Yang Zihang, Li Lin, Chu Xiaoxue, Wei Shiguang, Lian Juyu
School of Life & Environmental Sciences Guilin University of Electronic Technology Guilin Guangxi China.
Key Laboratory of Ecology of Rare and Endangered Species and Environmental Protection, Ministry of Education - Guangxi Key Laboratory of Landscape Resources Conservation and Sustainable Utilization in Lijiang River Basin Guangxi Normal University Guilin Guangxi China.
Ecol Evol. 2025 May 26;15(5):e71499. doi: 10.1002/ece3.71499. eCollection 2025 May.
Motivated by the need to enhance the accuracy of forest aboveground carbon storage (ACS) assessments, this study aimed to explore the effectiveness of different machine learning models in predicting ACS across various subtropical forest types in southern China. The study was conducted in southern China, focusing on different types of subtropical forests. This region harbors several types of subtropical forests, which are rarely found at similar latitudes in the world. Variance inflation factor was employed to screen independent variables, resulting in the selection of 13 significant predictors. Four machine learning models-support vector machine (SVM), random forest (RF), multi-layer perceptron (MLP), and extreme gradient boosting (XGB)-were constructed to estimate carbon storage. Model performance was evaluated using root mean square error, coefficient of determination ( ), and mean absolute error. The model with the best generalization ability was selected to calculate SHAP values for each predictor. The XGB model demonstrated superior performance across all forest types, with values ranging from 0.898 to 0.974. In mountainous evergreen broad-leaved forests, the prediction accuracy followed the order of XGB>MLP>SVM>RF. In valley rainforests, MLP showed the highest value, but with higher MAE and RMSE, making it the second-best choice. The RF model performed moderately, while the SVM model showed the poorest performance. The SHAP values indicated that maximum diameter at breast height, slope, mean DBH, species evenness, altitude, and maximum tree height had significant effects on ACS. XGB model exhibits the best prediction performance and strongest adaptability for estimating ACS in subtropical southern China forests. Additionally, the MLP model can serve as an effective model for assessing carbon storage in valley rainforests within this region. Machine learning methods provide valuable references for predicting and assessing ACS in different types of zonal forests.
出于提高森林地上碳储量(ACS)评估准确性的需要,本研究旨在探讨不同机器学习模型在中国南方各种亚热带森林类型中预测ACS的有效性。该研究在中国南方进行,重点关注不同类型的亚热带森林。该地区拥有多种亚热带森林类型,这在世界上相似纬度地区很少见。采用方差膨胀因子筛选自变量,最终选择了13个显著预测因子。构建了四种机器学习模型——支持向量机(SVM)、随机森林(RF)、多层感知器(MLP)和极端梯度提升(XGB)——来估算碳储量。使用均方根误差、决定系数( )和平均绝对误差评估模型性能。选择具有最佳泛化能力的模型来计算每个预测因子的SHAP值。XGB模型在所有森林类型中均表现出卓越性能, 值范围为0.898至0.974。在山地常绿阔叶林中,预测准确性顺序为XGB>MLP>SVM>RF。在山谷雨林中,MLP显示出最高的 值,但MAE和RMSE较高,使其成为第二优选择。RF模型表现适中,而SVM模型表现最差。SHAP值表明,胸径最大、坡度、平均胸径、物种均匀度、海拔和最大树高对ACS有显著影响。XGB模型在估算中国南方亚热带森林的ACS方面表现出最佳预测性能和最强适应性。此外,MLP模型可作为评估该地区山谷雨林碳储量的有效模型。机器学习方法为预测和评估不同类型地带性森林的ACS提供了有价值的参考。