Huang Peng, Yuan Jinfu, Yang Pan, Xiao Futong, Zhao Yongpeng
College of Mechanical and Electrical Engineering, Sichuan Agriculture University, Ya'an 625014, China.
Foods. 2024 Apr 25;13(9):1320. doi: 10.3390/foods13091320.
Sunflower is an important crop, and the vitality and moisture content of sunflower seeds have an important influence on the sunflower's planting and yield. By employing hyperspectral technology, the spectral characteristics of sunflower seeds within the wavelength range of 384-1034 nm were carefully analyzed with the aim of achieving effective prediction of seed vitality and moisture content. Firstly, the original hyperspectral data were subjected to preprocessing techniques such as Savitzky-Golay smoothing, standard normal variable correction (SNV), and multiplicative scatter correction (MSC) to effectively reduce noise interference, ensuring the accuracy and reliability of the data. Subsequently, principal component analysis (PCA), extreme gradient boosting (XGBoost), and stacked autoencoders (SAE) were utilized to extract key feature bands, enhancing the interpretability and predictive performance of the data. During the modeling phase, random forests (RFs) and LightGBM algorithms were separately employed to construct classification models for seed vitality and prediction models for moisture content. The experimental results demonstrated that the SG-SAE-LightGBM model exhibited outstanding performance in the classification task of sunflower seed vitality, achieving an accuracy rate of 98.65%. Meanwhile, the SNV-XGBoost-LightGBM model showed remarkable achievement in moisture content prediction, with a coefficient of determination (R2) of 0.9715 and root mean square error (RMSE) of 0.8349. In conclusion, this study confirms that the fusion of hyperspectral technology and multivariate data analysis algorithms enables the accurate and rapid assessment of sunflower seed vitality and moisture content, providing robust tools and theoretical support for seed quality evaluation and agricultural production practices. Furthermore, this research not only expands the application of hyperspectral technology in unraveling the intrinsic vitality characteristics of sunflower seeds but also possesses significant theoretical and practical value.
向日葵是一种重要的作物,向日葵种子的活力和水分含量对向日葵的种植和产量有着重要影响。通过采用高光谱技术,仔细分析了波长范围在384 - 1034 nm内的向日葵种子的光谱特征,旨在实现对种子活力和水分含量的有效预测。首先,对原始高光谱数据进行Savitzky - Golay平滑、标准正态变量校正(SNV)和多元散射校正(MSC)等预处理技术,以有效降低噪声干扰,确保数据的准确性和可靠性。随后,利用主成分分析(PCA)、极端梯度提升(XGBoost)和堆叠自编码器(SAE)提取关键特征波段,增强数据的可解释性和预测性能。在建模阶段,分别采用随机森林(RFs)和LightGBM算法构建种子活力分类模型和水分含量预测模型。实验结果表明,SG - SAE - LightGBM模型在向日葵种子活力分类任务中表现出色,准确率达到98.65%。同时,SNV - XGBoost - LightGBM模型在水分含量预测方面取得显著成果,决定系数(R2)为0.9715,均方根误差(RMSE)为0.8349。总之,本研究证实高光谱技术与多元数据分析算法的融合能够准确、快速地评估向日葵种子的活力和水分含量,为种子质量评价和农业生产实践提供了有力工具和理论支持。此外,本研究不仅拓展了高光谱技术在揭示向日葵种子内在活力特征方面的应用,还具有重要的理论和实践价值。