Gao Pan, Xu Wei, Yan Tianying, Zhang Chu, Lv Xin, He Yong
College of Information Science and Technology, Shihezi University, Shihezi 832000, China.
Key Laboratory of Oasis Ecology Agriculture, Shihezi University, Shihezi 832003, China.
Foods. 2019 Nov 27;8(12):620. doi: 10.3390/foods8120620.
Narrow-leaved oleaster () fruit is a kind of natural product used as food and traditional medicine. Narrow-leaved oleaster fruits from different geographical origins vary in chemical and physical properties and differ in their nutritional and commercial values. In this study, near-infrared hyperspectral imaging covering the spectral range of 874-1734 nm was used to identify the geographical origins of dry narrow-leaved oleaster fruits with machine learning methods. Average spectra of each single narrow-leaved oleaster fruit were extracted. Second derivative spectra were used to identify effective wavelengths. Partial least squares discriminant analysis (PLS-DA) and support vector machine (SVM) were used to build discriminant models for geographical origin identification using full spectra and effective wavelengths. In addition, deep convolutional neural network (CNN) models were built using full spectra and effective wavelengths. Good classification performances were obtained by these three models using full spectra and effective wavelengths, with classification accuracy of the calibration, validation, and prediction set all over 90%. Models using effective wavelengths obtained close results to models using full spectra. The performances of the PLS-DA, SVM, and CNN models were close. The overall results illustrated that near-infrared hyperspectral imaging coupled with machine learning could be used to trace geographical origins of dry narrow-leaved oleaster fruits.
沙枣果实是一种用作食品和传统药物的天然产物。不同地理来源的沙枣果实在化学和物理性质上存在差异,其营养和商业价值也有所不同。在本研究中,利用覆盖874 - 1734 nm光谱范围的近红外高光谱成像技术,结合机器学习方法来识别干燥沙枣果实的地理来源。提取了每个单颗沙枣果实的平均光谱。利用二阶导数光谱来识别有效波长。采用偏最小二乘判别分析(PLS - DA)和支持向量机(SVM),使用全光谱和有效波长构建地理来源识别判别模型。此外,还使用全光谱和有效波长构建了深度卷积神经网络(CNN)模型。这三种模型使用全光谱和有效波长均获得了良好的分类性能,校正集、验证集和预测集的分类准确率均超过90%。使用有效波长的模型与使用全光谱的模型结果相近。PLS - DA、SVM和CNN模型的性能相近。总体结果表明,近红外高光谱成像结合机器学习可用于追溯干燥沙枣果实的地理来源。