College of Energy and Power Engineering, Lanzhou University of Technology, Lanzhou, China.
JiuQuan Vocational Technical College, JiuQuan, China.
PeerJ. 2024 Sep 26;12:e18186. doi: 10.7717/peerj.18186. eCollection 2024.
Timely and accurate monitoring of soil salinity content (SSC) is essential for precise irrigation management of large-scale farmland. Uncrewed aerial vehicle (UAV) low-altitude remote sensing with high spatial and temporal resolution provides a scientific and effective technical means for SSC monitoring. Many existing soil salinity inversion models have only been tested by a single variable selection method or machine learning algorithm, and the influence of variable selection method combined with machine learning algorithm on the accuracy of soil salinity inversion remain further studied.
Firstly, based on UAV multispectral remote sensing data, by extracting the spectral reflectance of each sampling point to construct 30 spectral indexes, and using the pearson correlation coefficient (PCC), gray relational analysis (GRA), variable projection importance (VIP), and support vector machine-recursive feature elimination (SVM-RFE) to screen spectral index and realize the selection of sensitive variables. Subsequently, screened and unscreened variables as model input independent variables, constructed 20 soil salinity inversion models based on the support vector machine regression (SVM), back propagation neural network (BPNN), extreme learning machine (ELM), and random forest (RF) machine learning algorithms, the aim is to explore the feasibility of different variable selection methods combined with machine learning algorithms in SSC inversion of crop-covered farmland. To evaluate the performance of the soil salinity inversion model, the determination coefficient (R), root mean square error (RMSE) and performance deviation ratio (RPD) were used to evaluate the model performance, and determined the best variable selection method and soil salinity inversion model by taking alfalfa covered farmland in arid oasis irrigation areas of China as the research object.
The variable selection combined with machine learning algorithm can significantly improve the accuracy of remote sensing inversion of soil salinity. The performance of the models has been improved markedly using the four variable selection methods, and the applicability varied among the four methods, the GRA variable selection method is suitable for SVM, BPNN, and ELM modeling, while the PCC method is suitable for RF modeling. The GRA-SVM is the best soil salinity inversion model in alfalfa cover farmland, with R of 0.8888, RMSE of 0.1780, and RPD of 1.8115 based on the model verification dataset, and the spatial distribution map of soil salinity can truly reflect the degree of soil salinization in the study area.
Based on our findings, the variable selection combined with machine learning algorithm is an effective method to improve the accuracy of soil salinity remote sensing inversion, which provides a new approach for timely and accurate acquisition of crops covered farmland soil salinity information.
及时、准确地监测土壤盐分含量(SSC)对于大规模农田的精确灌溉管理至关重要。无人飞行器(UAV)低空遥感具有高时空分辨率,为 SSC 监测提供了科学有效的技术手段。许多现有的土壤盐分反演模型仅通过单一变量选择方法或机器学习算法进行了测试,变量选择方法与机器学习算法的结合对土壤盐分反演精度的影响仍需进一步研究。
首先,基于 UAV 多光谱遥感数据,通过提取每个采样点的光谱反射率来构建 30 个光谱指标,利用皮尔逊相关系数(PCC)、灰色关联分析(GRA)、变量投影重要性(VIP)和支持向量机递归特征消除(SVM-RFE)筛选光谱指标,实现敏感变量的选择。随后,将筛选和未筛选的变量作为模型输入的自变量,基于支持向量机回归(SVM)、反向传播神经网络(BPNN)、极限学习机(ELM)和随机森林(RF)机器学习算法构建 20 个土壤盐分反演模型,旨在探讨不同变量选择方法与机器学习算法相结合在作物覆盖农田 SSC 反演中的可行性。为了评估土壤盐分反演模型的性能,采用决定系数(R)、均方根误差(RMSE)和性能偏差比(RPD)来评估模型性能,并以中国干旱绿洲灌区的苜蓿覆盖农田为研究对象,确定最佳的变量选择方法和土壤盐分反演模型。
变量选择与机器学习算法的结合可以显著提高土壤盐分遥感反演的精度。四种变量选择方法均显著提高了模型的性能,且四种方法的适用性不同,GRA 变量选择方法适用于 SVM、BPNN 和 ELM 建模,而 PCC 方法适用于 RF 建模。在苜蓿覆盖农田中,GRA-SVM 是最佳的土壤盐分反演模型,基于模型验证数据集,R 为 0.8888、RMSE 为 0.1780、RPD 为 1.8115,土壤盐分的空间分布图能真实反映研究区土壤盐渍化程度。
基于研究结果,变量选择与机器学习算法的结合是提高土壤盐分遥感反演精度的有效方法,为及时、准确获取作物覆盖农田土壤盐分信息提供了新途径。