Selvaraj Michael Gomez, Valderrama Manuel, Guzman Diego, Valencia Milton, Ruiz Henry, Acharjee Animesh
International Center for Tropical Agriculture (CIAT), A.A. 6713 Cali, Colombia.
Department of Soil and Crop Sciences, Texas A&M University, College Station, TX USA.
Plant Methods. 2020 Jun 14;16:87. doi: 10.1186/s13007-020-00625-1. eCollection 2020.
Rapid non-destructive measurements to predict cassava root yield over the full growing season through large numbers of germplasm and multiple environments is a huge challenge in Cassava breeding programs. As opposed to waiting until the harvest season, multispectral imagery using unmanned aerial vehicles (UAV) are capable of measuring the canopy metrics and vegetation indices (VIs) traits at different time points of the growth cycle. This resourceful time series aerial image processing with appropriate analytical framework is very important for the automatic extraction of phenotypic features from the image data. Many studies have demonstrated the usefulness of advanced remote sensing technologies coupled with machine learning (ML) approaches for accurate prediction of valuable crop traits. Until now, Cassava has received little to no attention in aerial image-based phenotyping and ML model testing.
To accelerate image processing, an automated image-analysis framework called CIAT Pheno-i was developed to extract plot level vegetation indices/canopy metrics. Multiple linear regression models were constructed at different key growth stages of cassava, using ground-truth data and vegetation indices obtained from a multispectral sensor. Henceforth, the spectral indices/features were combined to develop models and predict cassava root yield using different Machine learning techniques. Our results showed that (1) Developed CIAT pheno-i image analysis framework was found to be easier and more rapid than manual methods. (2) The correlation analysis of four phenological stages of cassava revealed that elongation (EL) and late bulking (LBK) were the most useful stages to estimate above-ground biomass (AGB), below-ground biomass (BGB) and canopy height (CH). (3) The multi-temporal analysis revealed that cumulative image feature information of EL + early bulky (EBK) stages showed a higher significant correlation ( = 0.77) for Green Normalized Difference Vegetation indices (GNDVI) with BGB than individual time points. Canopy height measured on the ground correlated well with UAV (CHuav)-based measurements ( = 0.92) at late bulking (LBK) stage. Among different image features, normalized difference red edge index (NDRE) data were found to be consistently highly correlated ( = 0.65 to 0.84) with AGB at LBK stage. (4) Among the four ML algorithms used in this study, k-Nearest Neighbours (kNN), Random Forest (RF) and Support Vector Machine (SVM) showed the best performance for root yield prediction with the highest accuracy of R = 0.67, 0.66 and 0.64, respectively.
UAV platforms, time series image acquisition, automated image analytical framework (CIAT Pheno-i), and key vegetation indices (VIs) to estimate phenotyping traits and root yield described in this work have great potential for use as a selection tool in the modern cassava breeding programs around the world to accelerate germplasm and varietal selection. The image analysis software (CIAT Pheno-i) developed from this study can be widely applicable to any other crop to extract phenotypic information rapidly.
在木薯育种计划中,通过大量种质和多种环境在整个生长季节快速进行无损测量以预测木薯根产量是一项巨大挑战。与等到收获季节不同,使用无人机(UAV)的多光谱图像能够在生长周期的不同时间点测量冠层指标和植被指数(VIs)特征。这种借助适当分析框架的丰富时间序列航空图像处理对于从图像数据中自动提取表型特征非常重要。许多研究已经证明了先进的遥感技术与机器学习(ML)方法相结合对于准确预测有价值的作物性状的有用性。到目前为止,木薯在基于航空图像的表型分析和ML模型测试中很少受到关注。
为了加速图像处理,开发了一个名为CIAT Pheno-i的自动图像分析框架来提取地块级植被指数/冠层指标。在木薯的不同关键生长阶段构建了多元线性回归模型,使用地面真值数据和从多光谱传感器获得的植被指数。此后,将光谱指数/特征组合起来开发模型,并使用不同的机器学习技术预测木薯根产量。我们的结果表明:(1)发现开发的CIAT pheno-i图像分析框架比手动方法更简单、更快速。(2)木薯四个物候期的相关性分析表明,伸长(EL)和后期膨大(LBK)是估计地上生物量(AGB)、地下生物量(BGB)和冠层高度(CH)最有用的阶段。(3)多时间分析表明,EL + 早期膨大(EBK)阶段的累积图像特征信息对于绿色归一化差异植被指数(GNDVI)与BGB的相关性比单个时间点更高(= 0.77)。在后期膨大(LBK)阶段,地面测量的冠层高度与基于无人机(CHuav)的测量高度相关性良好(= 0.92)。在不同的图像特征中,发现归一化差异红边指数(NDRE)数据在LBK阶段与AGB始终具有高度相关性(= 0.65至0.84)。(4)在本研究中使用的四种ML算法中,k近邻(kNN)、随机森林(RF)和支持向量机(SVM)在根产量预测方面表现最佳,最高准确率分别为R = 0.67、0.66和0.64。
本文描述的无人机平台、时间序列图像采集、自动图像分析框架(CIAT Pheno-i)以及用于估计表型性状和根产量的关键植被指数(VIs)在作为世界各地现代木薯育种计划中的选择工具以加速种质和品种选择方面具有巨大潜力。本研究开发的图像分析软件(CIAT Pheno-i)可广泛应用于任何其他作物,以快速提取表型信息。