College of Mechanical and Electrical Engineering, Sichuan Agricultural University, Ya'an 625014, China.
College of Food Science, Sichuan Agricultural University, Ya'an 625014, China.
Spectrochim Acta A Mol Biomol Spectrosc. 2024 Dec 5;322:124816. doi: 10.1016/j.saa.2024.124816. Epub 2024 Jul 17.
The variety and quality of corn seeds are crucial factors affecting crop yield and farmers' economic benefits. This study adopts an innovative method based on a hyperspectral imaging system combined with stacked ensemble learning, aiming to achieve varieties classification and mildew detection of sweet-waxy corn seeds. First, data interference is eliminated by extracting the spectral and texture information of each corn sample and preprocessing the data. Secondly, a stacked ensemble learning model (Stack) was constructed by stacking base models and meta-models. Its results were compared with those of the base models, including Gradient Boosting Decision Tree (GBDT), Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Random Forest (RF).Finally, the overall performance of the model is improved through the information fusion strategy of hyperspectral data and texture information. The research results indicate that the GBDT-Stack model, which integrates spectral and texture data, demonstrated optimal performance in the comprehensive classification of both corn seed varieties and mold detection. On the test set, the model achieved an average prediction accuracy of 97.01%. Specifically, the model achieved a test set accuracy ranging from 94.49% to 97.58% for different corn seed varieties and a test set accuracy of 98.89% for mildew detection. This model not only classifies corn seed varieties but also accurately detects mildew, demonstrating its wide applicability. The method has huge potential and is of great significance for improving crop yield and quality.
玉米种子的种类和质量是影响作物产量和农民经济效益的关键因素。本研究采用基于高光谱成像系统结合堆叠集成学习的创新方法,旨在实现甜糯玉米种子的品种分类和霉变检测。首先,通过提取每个玉米样本的光谱和纹理信息并对数据进行预处理,消除数据干扰。其次,通过堆叠基础模型和元模型构建堆叠集成学习模型(Stack)。并将其结果与基础模型(包括梯度提升决策树(GBDT)、极端梯度提升(XGBoost)、轻梯度提升机(LightGBM)和随机森林(RF))进行比较。最后,通过高光谱数据和纹理信息的信息融合策略来提高模型的整体性能。研究结果表明,集成光谱和纹理数据的 GBDT-Stack 模型在玉米种子品种综合分类和霉变检测方面表现出最佳性能。在测试集上,该模型的平均预测准确率达到 97.01%。具体来说,该模型对不同玉米种子品种的测试集准确率范围为 94.49%至 97.58%,对霉变的测试集准确率为 98.89%。该模型不仅可以对玉米种子品种进行分类,还可以准确地检测霉变,具有广泛的适用性。该方法具有巨大的潜力,对提高作物产量和质量具有重要意义。