Faculty of Electrical Engineering, Trzaska cesta 25, 1000 Ljubljana, Slovenia.
Sensors (Basel). 2021 Jun 19;21(12):4208. doi: 10.3390/s21124208.
The research presented in this paper is based on the hypothesis that the machine learning approach improves the accuracy of soil properties prediction. The correlations obtained in this research are important for understanding the overall strategy for soil properties prediction using optical spectroscopy sensors. Several research results have been stated and investigated. A comparison is made between six commonly used techniques: Random Forest, Decision Tree, Naïve Bayes, Support Vector Machine, Least-Square Support Vector Machine and Artificial Neural Network, showing that the best prediction accuracy cannot always be achieved by the most common and complicated method. The influence of the chosen category for nutrient characterization was investigated, indicating better prediction when a multi-component strategy was used. In contrast, the prediction of single-component soil properties was less accurate. In addition, the influence of category levels was not as significant as expected when choosing between 3-level, 5-level or 13-level nutrient characterization for some nutrients, which can be used for a more precise nutrient characterization strategy. A comparative analysis was performed between soil from a local farm with similar texture and soils collected from different locations in Slovenia, which gave a better prediction for a local farm. Finally, the influence of principal component analysis was validated using 5, 10, 20 and 50 first principal components, indicating the better performance of machine learning when using the 50 principal components.
本文提出的研究基于这样一个假设,即机器学习方法可以提高土壤性质预测的准确性。本研究中获得的相关性对于理解使用光学光谱传感器进行土壤性质预测的整体策略非常重要。已经提出并研究了几项研究结果。比较了六种常用技术:随机森林、决策树、朴素贝叶斯、支持向量机、最小二乘支持向量机和人工神经网络,结果表明,最常用和最复杂的方法并不总是能达到最佳预测精度。研究了选择养分特征化类别对预测的影响,结果表明,使用多组分策略时预测效果更好。相反,对于单一组分土壤性质的预测,精度则较低。此外,当在 3 级、5 级或 13 级营养特征化之间进行选择时,一些营养物质的类别水平的影响并不像预期的那样显著,这可以用于更精确的营养特征化策略。对具有相似质地的当地农场土壤和从斯洛文尼亚不同地点采集的土壤进行了比较分析,结果表明,当地农场的预测效果更好。最后,通过使用 5、10、20 和 50 个第一主成分验证了主成分分析的影响,结果表明,使用 50 个主成分时,机器学习的性能更好。