College of Electronics Engineering, Heilongjiang University, Harbin, Heilongjiang 150080, China.
Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China.
Spectrochim Acta A Mol Biomol Spectrosc. 2025 Feb 5;326:125212. doi: 10.1016/j.saa.2024.125212. Epub 2024 Sep 24.
Accurate, fast and non-destructive identification of varieties of common bean seeds is important for the cultivation and efficient utilization of common beans. This study is based on hyperspectral and deep learning to identify the varieties of common bean seeds non-destructively. In this study, the average spectrum of 3078 hyperspectral images from 500 varieties was obtained after image segmentation and sensitive region extraction, and the Synthetic Minority Over-sampling Technique (SMOTE) was used to achieve the equilibrium of the samples of various varieties. A one-dimensional convolutional neural network model (IResCNN) incorporating Inception module and residual structure was proposed to identify seed varieties, and Support Vector Machine (SVM), K-Nearest Neighbor (KNN), VGG19, AlexNet, ResNet50 were established to compare the identification effect. After analyzing the effects of multiple spectral preprocessing methods on the model, the study selected Savitzky-Golay smoothing correction (SG) for spectral preprocessing and extracted 66 characteristic wavelengths using Successive Projections Algorithm (SPA) as inputs to the discriminative model. Ultimately, the IResCNN model achieved the highest accuracy of 93.06 % on the test set, indicating that hyperspectral technology can accurately identify bean varieties, and the study provides a correct method of thinking for the non-destructive classification of multi-species small-sample bean varieties.
准确、快速、无损地识别普通豆种是普通豆种栽培和高效利用的重要前提。本研究基于高光谱和深度学习,实现了对普通豆种的无损品种识别。本研究通过图像分割和敏感区域提取,获得了 500 个品种 3078 张高光谱图像的平均光谱,采用过采样技术(SMOTE)实现了各品种样本的均衡。提出了一种结合 Inception 模块和残差结构的一维卷积神经网络模型(IResCNN)来识别种子品种,并建立了支持向量机(SVM)、K-最近邻(KNN)、VGG19、AlexNet、ResNet50 模型进行识别效果比较。在分析了多种光谱预处理方法对模型的影响后,本研究选择 Savitzky-Golay 平滑校正(SG)进行光谱预处理,并采用连续投影算法(SPA)提取 66 个特征波长作为判别模型的输入。最终,IResCNN 模型在测试集上达到了 93.06%的最高准确率,表明高光谱技术可以准确识别豆种,为多品种小样本豆种的无损分类提供了正确的思路。