Liu Baoyang, Guo Baofeng, Zhuo Renxiong, Dai Fan
School of Automation, Hangzhou Dianzi University, Hangzhou 310018, China; Communications Information Transmission and Convergence Technology Laboratory, Hangzhou 310018, China.
Spectrochim Acta A Mol Biomol Spectrosc. 2024 Nov 15;321:124687. doi: 10.1016/j.saa.2024.124687. Epub 2024 Jun 19.
Soil Organic Carbon (SOC) is crucial for determining soil fertility and environmental quality. The problem with traditional SOC chemical analysis methods is that they are time-consuming and resource-intensive. In recent years, visible-near infrared (Vis-NIR) spectroscopy has been employed as an alternative method for SOC determination. However, when applied on a larger scale, the prediction accuracy of soil properties decreases due to the heterogeneity of samples. Therefore, this study compared and analyzed the performance of partial least squares regression (PLSR), support vector regression (SVR), random forest (RF), and gaussian process regression (GPR) in predicting SOC. On this basis, a GPR model based on a hybrid kernel function (HKF-GPR) was proposed for SOC prediction. This hybrid kernel function was designed according to the properties of single kernel functions and the characteristics of soil spectral data. Results indicate that in large soil spectral databases, the GPR model outperforms other models in estimating SOC. The HKF-GPR model achieved the best SOC estimation accuracy, with an R of 0.7671, RMSE of 5.2934 g/kg, RPD of 2.0721, and RPIQ of 2.5789. Compared to other regression models, the HKF-GPR model proposed in this paper offers broader applicability and superior performance, enabling SOC estimation in large soil spectral libraries.
土壤有机碳(SOC)对于确定土壤肥力和环境质量至关重要。传统的SOC化学分析方法存在耗时且资源密集的问题。近年来,可见-近红外(Vis-NIR)光谱法已被用作测定SOC的替代方法。然而,在大规模应用时,由于样品的异质性,土壤性质的预测准确性会降低。因此,本研究比较并分析了偏最小二乘回归(PLSR)、支持向量回归(SVR)、随机森林(RF)和高斯过程回归(GPR)在预测SOC方面的性能。在此基础上,提出了一种基于混合核函数的GPR模型(HKF-GPR)用于SOC预测。这种混合核函数是根据单核函数的性质和土壤光谱数据的特征设计的。结果表明,在大型土壤光谱数据库中,GPR模型在估计SOC方面优于其他模型。HKF-GPR模型实现了最佳的SOC估计精度,R为0.7671,RMSE为5.2934 g/kg,RPD为2.0721,RPIQ为2.5789。与其他回归模型相比,本文提出的HKF-GPR模型具有更广泛的适用性和卓越的性能,能够在大型土壤光谱库中进行SOC估计。