Angel Yoseline, McCabe Matthew F
Hydrology, Agriculture and Land Observation Group, Water Desalination and Reuse Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia.
Front Plant Sci. 2022 Mar 11;13:722442. doi: 10.3389/fpls.2022.722442. eCollection 2022.
Monitoring leaf Chlorophyll (Chl) is labor-intensive, limiting representative sampling for detailed mapping of Chl variability at field scales across time. Unmanned aeria-l vehicles (UAV) and hyperspectral cameras provide flexible platforms for observing agricultural systems, overcoming this spatio-temporal sampling constraint. Here, we evaluate a customized machine learning (ML) workflow to retrieve multi-temporal leaf-Chl levels, combining sub-centimeter resolution UAV-hyperspectral imagery (400-1,000 nm) with leaf-level reflectance spectra and SPAD measurements, capturing temporal correlations, selecting relevant predictors, and retrieving accurate results under different conditions. The study is performed within a phenotyping experiment to monitor wild tomato plants' development. Several analyses were conducted to evaluate multiple ML strategies, including: (1) exploring sequential versus retraining learning; (2) comparing insights gained from using 272 spectral bands versus 60 pigment-based vegetation indices (VIs); and (3) assessing six regression methods (linear, partial-least-square regression; PLSR, decision trees, support vector, ensemble trees, and Gaussian process; GPR). Goodness-of-fit ( ) and accuracy metrics (MAE, RMSE) were determined using training/testing and validation data subsets to assess the models' performance. Overall, while equally good performance was obtained using either PLSR, GPR, or random forest, results show: (1) the retraining strategy improved the ability of most of the approaches to model SPAD-based Chl dynamics; (2) comparative analysis between retrievals and validation data distributions informed the models' ability to capture Chl dynamics through SPAD levels; (3) VI predictors slightly improved (e.g., from 0.59 to 0.74 units for GPR) and accuracy (e.g., MAE and RMSE differences of up to 2 SPAD units) in specific algorithms; (4) feature importance examined through these methods, revealed strong overlaps between relevant bands and VI predictors, highlighting a few decisive spectral ranges and indices useful for retrieving leaf-Chl levels. The proposed ML framework allows the retrieval of high-quality spatially distributed and multi-temporal SPAD-based chlorophyll maps at an ultra-high pixel resolution (e.g., 7 mm).
监测叶片叶绿素(Chl)需要耗费大量人力,这限制了在田间尺度上随时间对Chl变异性进行详细制图的代表性采样。无人机(UAV)和高光谱相机为观察农业系统提供了灵活的平台,克服了这种时空采样限制。在此,我们评估了一种定制的机器学习(ML)工作流程,以检索多时间尺度的叶片Chl水平,该流程将亚厘米分辨率的无人机高光谱图像(400 - 1000nm)与叶片水平的反射光谱和SPAD测量相结合,捕捉时间相关性,选择相关预测因子,并在不同条件下获取准确结果。该研究在一个表型实验中进行,以监测野生番茄植株的发育。进行了多项分析以评估多种ML策略,包括:(1)探索顺序学习与重新训练学习;(2)比较使用272个光谱带与60个基于色素的植被指数(VIs)所获得的见解;(3)评估六种回归方法(线性回归、偏最小二乘回归;PLSR、决策树、支持向量机、集成树和高斯过程;GPR)。使用训练/测试和验证数据子集确定拟合优度( )和准确性指标(MAE、RMSE),以评估模型的性能。总体而言,虽然使用PLSR、GPR或随机森林获得了同样良好的性能,但结果表明:(1)重新训练策略提高了大多数方法对基于SPAD的Chl动态进行建模的能力;(2)检索数据与验证数据分布之间的比较分析有助于模型通过SPAD水平捕捉Chl动态的能力;(3)在特定算法中,VI预测因子略微提高了拟合优度(例如,GPR的拟合优度从0.59提高到0.74单位)和准确性(例如,MAE和RMSE差异高达2个SPAD单位);(4)通过这些方法检查的特征重要性揭示了相关波段与VI预测因子之间的强烈重叠,突出了一些对检索叶片Chl水平有用的决定性光谱范围和指数。所提出的ML框架允许以超高像素分辨率(例如7mm)检索基于SPAD的高质量空间分布和多时间尺度的叶绿素图。