Thorp Kelly R, Thompson Alison L, Herritt Matthew T
United States Department of Agriculture (USDA), Agricultural Research Service (ARS), Grassland Soil and Water Research Laboratory, Temple, TX, United States.
United States Department of Agriculture (USDA), Agricultural Research Service (ARS), U.S. Arid-Land Agricultural Research Center, Maricopa, AZ, United States.
Front Plant Sci. 2024 Nov 21;15:1495593. doi: 10.3389/fpls.2024.1495593. eCollection 2024.
Cotton ( L.) leaf chlorophyll (Chl) has been targeted as a phenotype for breeding selection to improve cotton tolerance to environmental stress. However, high-throughput phenotyping methods based on hyperspectral reflectance sensing are needed to rapidly screen cultivars for chlorophyll in the field. The objectives of this study were to deploy a cart-based field spectroradiometer to measure cotton leaf reflectance in two field experiments over four growing seasons at Maricopa, Arizona and to evaluate 148 spectral vegetation indices (SVI's) and 14 machine learning methods (MLM's) for estimating leaf chlorophyll from spectral information. Leaf tissue was sampled concurrently with reflectance measurements, and laboratory processing provided leaf Chl , Chl , and Chl as both areas-basis (µg cm) and mass-basis (mg g) measurements. Leaf reflectance along with several data transformations involving spectral derivatives, log-inverse reflectance, and SVI's were evaluated as MLM input. Models trained with 2019-2020 data performed poorly in tests with 2021-2022 data (e.g., RMSE=23.7% and r = 0.46 for area-basis Chl ), indicating difficulty transferring models between experiments. Performance was more satisfactory when training and testing data were based on a random split of all data from both experiments (e.g., RMSE=10.5% and r = 0.88 for area basis Chl ), but performance beyond the conditions of the present study cannot be guaranteed. Performance of SVI's was in the middle (e.g., RMSE=16.2% and r = 0.69 for area-basis Chl ), and SVI's provided more consistent error metrics compared to MLM's. Ensemble MLM's which combined estimates from several base estimators (e.g., random forest, gradient booting, and AdaBoost regressors) and a multi-layer perceptron neural network method performed best among MLM's. Input features based on spectral derivatives or SVI's improved MLM's performance compared to inputting reflectance data. Spectral reflectance data and SVI's involving red edge radiation were the most important inputs to MLM's for estimation of cotton leaf chlorophyll. Because MLM's struggled to perform beyond the constraints of their training data, SVI's should not be overlooked as practical plant trait estimators for high-throughput phenotyping, whereas MLM's offer great opportunity for data mining to develop more robust indices.
棉花(L.)叶片叶绿素(Chl)已成为育种选择的一个表型目标,以提高棉花对环境胁迫的耐受性。然而,需要基于高光谱反射传感的高通量表型分析方法来在田间快速筛选叶绿素含量不同的品种。本研究的目的是在亚利桑那州马里科帕的四个生长季节进行的两个田间试验中,使用基于推车的田间光谱辐射仪测量棉花叶片反射率,并评估148种光谱植被指数(SVI)和14种机器学习方法(MLM),以从光谱信息中估计叶片叶绿素含量。在测量反射率的同时采集叶片组织样本,实验室处理提供了以面积为基础(µg/cm²)和以质量为基础(mg/g)的叶片叶绿素a、叶绿素b和总叶绿素含量测量值。将叶片反射率以及涉及光谱导数、对数倒数反射率和SVI的几种数据转换作为MLM输入进行评估。用2019 - 2020年数据训练的模型在2021 - 2022年数据测试中表现不佳(例如,以面积为基础的叶绿素a的RMSE = 23.7%,r = 0.46),这表明在不同实验之间转移模型存在困难。当训练和测试数据基于两个实验所有数据的随机划分时,性能更令人满意(例如,以面积为基础的叶绿素a的RMSE = 10.5%,r = 0.88),但无法保证在本研究条件之外的性能。SVI的性能处于中等水平(例如,以面积为基础的叶绿素a的RMSE = 16.2%,r = 0.69),并且与MLM相比,SVI提供了更一致的误差指标。组合了几种基本估计器(例如随机森林、梯度提升和AdaBoost回归器)估计值的集成MLM以及多层感知器神经网络方法在MLM中表现最佳。与输入反射率数据相比,基于光谱导数或SVI的输入特征提高了MLM的性能。涉及红边辐射的光谱反射率数据和SVI是MLM估计棉花叶片叶绿素的最重要输入。由于MLM在超出其训练数据的约束条件下表现不佳,在高通量表型分析中,SVI作为实用的植物性状估计器不应被忽视,而MLM为数据挖掘以开发更强大的指数提供了巨大机会。