Jia Zhicheng, Ou Chengming, Sun Shoujiang, Wang Juan, Liu Jingyu, Sun Ming, Ma Wen, Li Manli, Jia Shangang, Mao Peisheng
College of Grassland Science and Technology, China Agricultural University, Beijing, China.
Front Plant Sci. 2023 Apr 20;14:1170947. doi: 10.3389/fpls.2023.1170947. eCollection 2023.
Advances in optical imaging technology using rapid and non-destructive methods have led to improvements in the efficiency of seed quality detection. Accurately timing the harvest is crucial for maximizing the yield of higher-quality Siberian wild rye seeds by minimizing excessive shattering during harvesting. This research applied integrated optical imaging techniques and machine learning algorithms to develop different models for classifying Siberian wild rye seeds based on different maturity stages and grain positions. The multi-source fusion of morphological, multispectral, and autofluorescence data provided more comprehensive information but also increases the performance requirements of the equipment. Therefore, we employed three filtering algorithms, namely minimal joint mutual information maximization (JMIM), information gain, and Gini impurity, and set up two control methods (feature union and no-filtering) to assess the impact of retaining only 20% of the features on the model performance. Both JMIM and information gain revealed autofluorescence and morphological features (CIELab A, CIELab B, hue and saturation), with these two filtering algorithms showing shorter run times. Furthermore, a strong correlation was observed between shoot length and morphological and autofluorescence spectral features. Machine learning models based on linear discriminant analysis (LDA), random forests (RF) and support vector machines (SVM) showed high performance (>0.78 accuracies) in classifying seeds at different maturity stages. Furthermore, it was found that there was considerable variation in the different grain positions at the maturity stage, and the K-means approach was used to improve the model performance by 5.8%-9.24%. In conclusion, our study demonstrated that feature filtering algorithms combined with machine learning algorithms offer high performance and low cost in identifying seed maturity stages and that the application of k-means techniques for inconsistent maturity improves classification accuracy. Therefore, this technique could be employed classification of seed maturity and superior physiological quality for Siberian wild rye seeds.
利用快速且无损方法的光学成像技术进展,已提高了种子质量检测的效率。准确把握收获时机对于通过尽量减少收获期间的过度落粒来最大化优质西伯利亚披碱草种子的产量至关重要。本研究应用集成光学成像技术和机器学习算法,开发了基于不同成熟阶段和籽粒位置对西伯利亚披碱草种子进行分类的不同模型。形态学、多光谱和自发荧光数据的多源融合提供了更全面的信息,但也提高了设备的性能要求。因此,我们采用了三种滤波算法,即最小联合互信息最大化(JMIM)、信息增益和基尼杂质,并设置了两种控制方法(特征联合和无滤波),以评估仅保留20%的特征对模型性能的影响。JMIM和信息增益都揭示了自发荧光和形态学特征(CIELab A、CIELab B、色调和饱和度),这两种滤波算法的运行时间较短。此外,观察到芽长与形态学和自发荧光光谱特征之间存在很强的相关性。基于线性判别分析(LDA)、随机森林(RF)和支持向量机(SVM)的机器学习模型在对不同成熟阶段的种子进行分类时表现出高性能(准确率>0.78)。此外,发现成熟阶段不同籽粒位置存在相当大的差异,并且使用K均值方法可将模型性能提高5.8% - 9.24%。总之,我们的研究表明,特征滤波算法与机器学习算法相结合在识别种子成熟阶段方面具有高性能和低成本,并且应用K均值技术处理成熟度不一致的情况可提高分类准确率。因此,该技术可用于西伯利亚披碱草种子成熟度和优良生理质量的分类。