College of Civil Engineering, Wuhan City Polytechnic, Wuhan, 430074, Hubei, China.
School of Geography and Information Engineering, China University of Geosciences, Wuhan, 430074, Hubei, China.
Sci Rep. 2024 Oct 27;14(1):25598. doi: 10.1038/s41598-024-76872-2.
Leaf chlorophyll content (LCC) is crucial for monitoring the physiological processes of crops. Many studies have utilized spectral features to develop regression models for accurate LCC estimation, enabling the quantitative assessment and evaluation of crop growth status. The selection of optimal spectral features and regression algorithms significantly affects the precision of LCC estimation. In this study, we compared and analyzed the optimal spectral features for LCC estimation, as well as the consistency of machine learning methods across different crop types, phenology periods, and sensors. First, we extracted various spectral features, including the original spectral features (OS), first-order derivative spectral features (FDS), original continuum-removed spectra (CR) along with their four related derivative spectral features, principal component variables derived from different spectral features, and highly correlated spectral features with LCC. These extracted spectral features were then employed to construct the LCC models using six common regression algorithms on different datasets. Finally, we analyzed the optimal combination of spectral features and regression algorithms for accurate LCC estimation considering various dimensions, such as crop type, phenological period, and sensor used in observation conditions. The results demonstrate that the combinations of the principal component variables of continuum-removed derivative reflectance with the top 10 correlations with LCC (PCA_CRDR_R) combined with Gaussian process regression (GPR) can be considered as the optimal choice for estimating LCC under diverse observation conditions at a canopy scale, and its R can reach 0.62 for sugar beet LCC estimation; thus providing valuable theoretical guidance for selecting appropriate spectral features for LCC estimation.
叶片叶绿素含量(LCC)是监测作物生理过程的关键。许多研究利用光谱特征来开发回归模型,以实现对 LCC 的准确估计,从而对作物生长状况进行定量评估和评价。选择最佳的光谱特征和回归算法会显著影响 LCC 估计的精度。在本研究中,我们比较和分析了用于 LCC 估计的最佳光谱特征,以及机器学习方法在不同作物类型、物候期和传感器之间的一致性。首先,我们提取了各种光谱特征,包括原始光谱特征(OS)、一阶导数光谱特征(FDS)、原始连续体去除光谱(CR)及其四个相关的导数光谱特征、从不同光谱特征衍生的主成分变量,以及与 LCC 高度相关的光谱特征。然后,我们使用六种常见的回归算法在不同的数据集上构建了基于这些提取的光谱特征的 LCC 模型。最后,我们从作物类型、物候期和观测条件下使用的传感器等多个维度分析了用于准确估计 LCC 的最佳光谱特征和回归算法的组合。结果表明,在冠层尺度下,使用连续体去除导数的主成分变量与与 LCC 高度相关的前 10 个光谱特征(PCA_CRDR_R)的组合结合高斯过程回归(GPR)可以被认为是在各种观测条件下估计 LCC 的最佳选择,其 R 可以达到 0.62,用于甜菜 LCC 估计;因此,为选择用于 LCC 估计的适当光谱特征提供了有价值的理论指导。