Institute of Agricultural Remote Sensing and Information Technology Application, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou 310058, China.
Department of Environmental Engineering, Yuzhang Normal University, Nanchang 330103, China.
Sensors (Basel). 2019 Jan 11;19(2):263. doi: 10.3390/s19020263.
Soil organic matter (SOM) and pH are essential soil fertility indictors of paddy soil in the middle-lower Yangtze Plain. Rapid, non-destructive and accurate determination of SOM and pH is vital to preventing soil degradation caused by inappropriate land management practices. Visible-near infrared (vis-NIR) spectroscopy with multivariate calibration can be used to effectively estimate soil properties. In this study, 523 soil samples were collected from paddy fields in the Yangtze Plain, China. Four machine learning approaches-partial least squares regression (PLSR), least squares-support vector machines (LS-SVM), extreme learning machines (ELM) and the Cubist regression model (Cubist)-were used to compare the prediction accuracy based on vis-NIR full bands and bands reduced using the genetic algorithm (GA). The coefficient of determination (R²), root mean square error (RMSE), and ratio of performance to inter-quartile distance (RPIQ) were used to assess the prediction accuracy. The ELM with GA reduced bands was the best model for SOM (SOM: R² = 0.81, RMSE = 5.17, RPIQ = 2.87) and pH (R² = 0.76, RMSE = 0.43, RPIQ = 2.15). The performance of the LS-SVM for pH prediction did not differ significantly between the model with GA (R² = 0.75, RMSE = 0.44, RPIQ = 2.08) and without GA (R² = 0.74, RMSE = 0.45, RPIQ = 2.07). Although a slight increase was observed when ELM were used for prediction of SOM and pH using reduced bands (SOM: R² = 0.81, RMSE = 5.17, RPIQ = 2.87; pH: R² = 0.76, RMSE = 0.43, RPIQ = 2.15) compared with full bands (R² = 0.81, RMSE = 5.18, RPIQ = 2.83; pH: R² = 0.76, RMSE = 0.45, RPIQ = 2.07), the number of wavelengths was greatly reduced (SOM: 201 to 44; pH: 201 to 32). Thus, the ELM coupled with reduced bands by GA is recommended for prediction of properties of paddy soil (SOM and pH) in the middle-lower Yangtze Plain.
土壤有机质(SOM)和 pH 值是长江中下游平原稻田土壤肥力的重要指标。快速、无损和准确地测定 SOM 和 pH 值对于防止因土地管理不当而导致的土壤退化至关重要。多元校正的可见-近红外(vis-NIR)光谱可用于有效估计土壤性质。本研究采集了中国长江中下游平原稻田的 523 个土壤样本。采用偏最小二乘回归(PLSR)、最小二乘支持向量机(LS-SVM)、极限学习机(ELM)和 Cubist 回归模型(Cubist)4 种机器学习方法,比较了基于 vis-NIR 全波段和遗传算法(GA)降维后波段的预测精度。采用决定系数(R²)、均方根误差(RMSE)和四分位距内性能比(RPIQ)评估预测精度。结果表明,GA 降维后的 ELM 模型对 SOM(R² = 0.81,RMSE = 5.17,RPIQ = 2.87)和 pH(R² = 0.76,RMSE = 0.43,RPIQ = 2.15)的预测效果最好。GA 对 LS-SVM 模型预测 pH 值的性能没有显著影响,GA 模型(R² = 0.75,RMSE = 0.44,RPIQ = 2.08)与无 GA 模型(R² = 0.74,RMSE = 0.45,RPIQ = 2.07)的预测效果相同。与全波段(R² = 0.81,RMSE = 5.18,RPIQ = 2.83;pH:R² = 0.76,RMSE = 0.45,RPIQ = 2.07)相比,ELM 模型使用降维波段(SOM:R² = 0.81,RMSE = 5.17,RPIQ = 2.87;pH:R² = 0.76,RMSE = 0.43,RPIQ = 2.15)时,SOM 和 pH 值的预测精度略有提高,但所用的波长数量大大减少(SOM:201 到 44;pH:201 到 32)。因此,建议采用 GA 与降维波段相结合的 ELM 模型来预测长江中下游平原稻田土壤(SOM 和 pH 值)的性质。