Department of Soils and Agrifood Engineering, Université Laval, Québec, Canada.
PLoS One. 2021 Jul 20;16(7):e0233242. doi: 10.1371/journal.pone.0233242. eCollection 2021.
Accuracy of infrared (IR) models to measure soil particle-size distribution (PSD) depends on soil preparation, methodology (sedimentation, laser), settling times and relevant soil features. Compositional soil data may require log ratio (ilr) transformation to avoid numerical biases. Machine learning can relate numerous independent variables that may impact on NIR spectra to assess particle-size distribution. Our objective was to reach high IRS prediction accuracy across a large range of PSD methods and soil properties. A total of 1298 soil samples from eastern Canada were IR-scanned. Spectra were processed by Stochastic Gradient Boosting (SGB) to predict sand, silt, clay and carbon. Slope and intercept of the log-log relationships between settling time and suspension density function (SDF) (R2 = 0.84-0.92) performed similarly to NIR spectra using either ilr-transformed (R2 = 0.81-0.93) or raw percentages (R2 = 0.76-0.94). Settling times of 0.67-min and 2-h were the most accurate for NIR predictions (R2 = 0.49-0.79). The NIR prediction of sand sieving method (R2 = 0.66) was more accurate than sedimentation method(R2 = 0.53). The NIR 2X gain was less accurate (R2 = 0.69-0.92) than 4X (R2 = 0.87-0.95). The MIR (R2 = 0.45-0.80) performed better than NIR (R2 = 0.40-0.71) spectra. Adding soil carbon, reconstituted bulk density, pH, red-green-blue color, oxalate and Mehlich3 extracts returned R2 value of 0.86-0.91 for texture prediction. In addition to slope and intercept of the SDF, 4X gain, method and pre-treatment classes, soil carbon and color appeared to be promising features for routine SGB-processed NIR particle-size analysis. Machine learning methods support cost-effective soil texture NIR analysis.
红外(IR)模型测量土壤颗粒大小分布(PSD)的准确性取决于土壤准备、方法(沉降、激光)、沉降时间和相关土壤特征。组成土壤数据可能需要对数比(ilr)转换以避免数值偏差。机器学习可以将可能影响近红外光谱的大量独立变量联系起来,以评估颗粒大小分布。我们的目标是在广泛的 PSD 方法和土壤特性范围内达到高 IRS 预测精度。总共对来自加拿大东部的 1298 个土壤样本进行了红外扫描。使用 Stochastic Gradient Boosting(SGB)对光谱进行处理,以预测砂、粉土、粘土和碳。沉降时间与悬浮密度函数(SDF)之间的对数对数关系的斜率和截距(R2 = 0.84-0.92)与使用 ilr 转换的近红外光谱(R2 = 0.81-0.93)或原始百分比(R2 = 0.76-0.94)表现相似。沉降时间为 0.67 分钟和 2 小时的近红外预测最准确(R2 = 0.49-0.79)。砂筛方法的近红外预测(R2 = 0.66)比沉降法(R2 = 0.53)更准确。近红外 2X 增益(R2 = 0.69-0.92)不如 4X (R2 = 0.87-0.95)准确。MIR(R2 = 0.45-0.80)的性能优于近红外(R2 = 0.40-0.71)光谱。添加土壤碳、重构体密度、pH 值、红-绿-蓝颜色、草酸盐和 Mehlich3 提取物后,纹理预测的 R2 值为 0.86-0.91。除了 SDF 的斜率和截距、4X 增益、方法和预处理类别外,土壤碳和颜色似乎是常规 SGB 处理近红外粒度分析有前途的特征。机器学习方法支持具有成本效益的土壤质地近红外分析。