Zhou Lei, Zhang Chu, Taha Mohamed Farag, Wei Xinhua, He Yong, Qiu Zhengjun, Liu Yufei
College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, China.
Key Laboratory of Spectroscopy Sensing, Ministry of Agriculture and Rural Affairs, Hangzhou, China.
Front Plant Sci. 2020 Nov 10;11:575810. doi: 10.3389/fpls.2020.575810. eCollection 2020.
Near-infrared (NIR) hyperspectroscopy becomes an emerging nondestructive sensing technology for inspection of crop seeds. A large spectral dataset of more than 140,000 wheat kernels in 30 varieties was prepared for classification. Feature selection is a critical segment in large spectral data analysis. A novel convolutional neural network-based feature selector (CNN-FS) was proposed to screen out deeply target-related spectral channels. A convolutional neural network with attention (CNN-ATT) framework was designed for one-dimension data classification. Popular machine learning models including support vector machine (SVM) and partial least square discrimination analysis were used as the benchmark classifiers. Features selected by conventional feature selection algorithms were considered for comparison. Results showed that the designed CNN-ATT produced a higher performance than the compared classifier. The proposed CNN-FS found a subset of features, which made a better representation of raw dataset than conventional selectors did. The CNN-ATT achieved an accuracy of 93.01% using the full spectra and keep its high precision (90.20%) by training on the 60-channel features obtained via the CNN-FS method. The proposed methods have great potential for handling the analyzing tasks on other large spectral datasets. The proposed feature selection structure can be extended to design other new model-based selectors. The combination of NIR hyperspectroscopic technology and the proposed models has great potential for automatic nondestructive classification of single wheat kernels.
近红外(NIR)高光谱成像技术成为一种新兴的用于农作物种子检测的无损传感技术。我们准备了一个包含30个品种、超过140000粒小麦籽粒的大型光谱数据集用于分类。特征选择是大型光谱数据分析中的关键环节。我们提出了一种基于卷积神经网络的新型特征选择器(CNN-FS),以筛选出与目标高度相关的光谱通道。我们设计了一种带有注意力机制的卷积神经网络(CNN-ATT)框架用于一维数据分类。包括支持向量机(SVM)和偏最小二乘判别分析在内的常用机器学习模型被用作基准分类器。我们还考虑了用传统特征选择算法选择的特征进行比较。结果表明,设计的CNN-ATT比所比较的分类器具有更高的性能。所提出的CNN-FS找到了一个特征子集,该子集比传统选择器能更好地表示原始数据集。CNN-ATT使用全光谱时的准确率达到了93.01%,并且通过在经CNN-FS方法获得的60通道特征上进行训练,保持了其高精度(90.20%)。所提出的方法在处理其他大型光谱数据集的分析任务方面具有巨大潜力。所提出的特征选择结构可以扩展用于设计其他基于新模型的选择器。近红外高光谱技术与所提出的模型相结合,在单粒小麦的自动无损分类方面具有巨大潜力。