Zhou Xuan, Wu Xiaohong, Cao Zhihang, Wu Bin
School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, China.
High-Tech Key Laboratory of Agricultural Equipment and Intelligence of Jiangsu Province, Jiangsu University, Zhenjiang 212013, China.
Foods. 2025 May 1;14(9):1601. doi: 10.3390/foods14091601.
Lettuce is a kind of nutritious leafy vegetable. The lettuce storage time has a significant impact on its nutrition and taste. Therefore, to classify lettuce samples with different storage times accurately and non-destructively, this study built classification models by combining several feature extraction methods and categorical boosting (CatBoost). Firstly, the near-infrared (NIR) spectral data of lettuce samples were collected using a NIR spectrometer, and then they were preprocessed using six preprocessing methods. Next, feature extraction was carried out on the spectral data using approximate linear discriminant analysis (ALDA), common-vector linear discriminant analysis (CLDA), maximum-uncertainty linear discriminant analysis (MLDA), and null-space linear discriminant analysis (NLDA). These four feature extraction methods can solve the problem of small sample sizes. Finally, the classification was achieved using classification and regression trees (CARTs) and CatBoost, respectively. The experimental results showed that the classification accuracy of NLDA combined with CatBoost could reach 97.67%. Therefore, the combination of feature extraction methods (NLDA) and CatBoost using NIR spectroscopy is an effective way to classify lettuce storage time.
生菜是一种营养丰富的叶类蔬菜。生菜的储存时间对其营养和口感有显著影响。因此,为了准确、无损地对不同储存时间的生菜样本进行分类,本研究通过结合多种特征提取方法和分类提升(CatBoost)构建了分类模型。首先,使用近红外光谱仪收集生菜样本的近红外(NIR)光谱数据,然后使用六种预处理方法对其进行预处理。接下来,使用近似线性判别分析(ALDA)、公共向量线性判别分析(CLDA)、最大不确定性线性判别分析(MLDA)和零空间线性判别分析(NLDA)对光谱数据进行特征提取。这四种特征提取方法可以解决小样本量的问题。最后,分别使用分类与回归树(CART)和CatBoost进行分类。实验结果表明,NLDA与CatBoost相结合的分类准确率可达97.67%。因此,利用近红外光谱技术将特征提取方法(NLDA)与CatBoost相结合是对生菜储存时间进行分类的有效方法。