Li Yanru, Yang Keming, Wu Bing
College of Geoscience and Surveying Engineering, China University of Mining and Technology, Beijing, China.
Appl Spectrosc. 2025 Feb;79(2):306-319. doi: 10.1177/00037028241279328. Epub 2024 Sep 23.
This study aims to identify different types of stress on maize leaves using feature selection and spectral index methods. Spectral data were collected from leaves under heavy metal, water, fertilizer stress, as well as under normal healthy conditions. Preprocessing steps such as continuum removal (CR), standard normal variable (SNV) transformation, multiple scattering correction (MSC), detrend correction (DT), and first-order derivative (FOD) were applied to the raw spectra. Various feature selection methods including ReliefF, chi-square test, recursive feature elimination (FRE), mutual information (MI), random forest (RF), and gradient boosting tree (GBT) were employed to determine the importance scores of different spectral bands, thus identifying sensitive spectral features capable of distinguishing various stress types. Spectral indices for stress type differentiation were constructed using label correlation method. Classification models were built using support vector machine (SVM), K-nearest neighbors (KNN), Gaussian naive Bayes (GNB), extreme gradient boosting (XGBoost), RF, and adaptive boosting (AdaBoost) algorithms. Results indicate that the characteristic spectral bands for differentiating stress types are primarily distributed around the red edge (near 700-800 nm) and water absorption valley (near 1900 nm). Spectral indices constructed using combinations of spectral bands around the near-infrared plateau absorption valley (near 1185 nm) and water absorption valley (near 1460 nm) effectively differentiate maize stress types. Among the modeling classification algorithms, RF and AdaBoost algorithms exhibited optimal performance, demonstrating high classification accuracy on both training and validation sets. These findings hold promise for providing new technical support for maize stress monitoring and diagnosis in agricultural production.
本研究旨在利用特征选择和光谱指数方法识别玉米叶片上不同类型的胁迫。光谱数据是在重金属、水分、肥料胁迫以及正常健康条件下从叶片收集的。对原始光谱应用了诸如连续统去除(CR)、标准正态变量(SNV)变换、多元散射校正(MSC)、去趋势校正(DT)和一阶导数(FOD)等预处理步骤。采用了各种特征选择方法,包括ReliefF、卡方检验、递归特征消除(FRE)、互信息(MI)、随机森林(RF)和梯度提升树(GBT),以确定不同光谱波段的重要性得分,从而识别能够区分各种胁迫类型的敏感光谱特征。使用标签相关方法构建了用于胁迫类型区分的光谱指数。使用支持向量机(SVM)、K近邻(KNN)、高斯朴素贝叶斯(GNB)、极端梯度提升(XGBoost)、RF和自适应提升(AdaBoost)算法建立了分类模型。结果表明,用于区分胁迫类型的特征光谱波段主要分布在红边(700 - 800 nm附近)和吸水谷(1900 nm附近)周围。利用近红外高原吸收谷(1185 nm附近)和吸水谷(1460 nm附近)周围的光谱波段组合构建的光谱指数能够有效区分玉米的胁迫类型。在建模分类算法中,RF和AdaBoost算法表现出最佳性能,在训练集和验证集上均显示出较高的分类准确率。这些发现有望为农业生产中玉米胁迫监测和诊断提供新的技术支持。