Beijing Zhongyan Tongrentang Medicine R&D Co.Ltd, Beijing, 100079, People's Republic of China.
School of Pharmacy, Lanzhou University, Lanzhou, Gansu, 730000, People's Republic of China.
Sci Rep. 2022 Nov 9;12(1):19120. doi: 10.1038/s41598-022-23857-8.
Codonopsis Radix (CR) is an edible food and traditional Chinese herb medicine in China. Various varieties of Codonopsis Radix have different tastes. To make the flavor of processed food stable, two kinds of electronic sensory devices, electronic nose and electronic tongue, were used to establish a discrimination model to identify the botanical origin of each sample. The optimal model built on the 88 batches of samples was selected from the models trained with all combination of two pretreatment methods and three classification methods. A comparison were performed on the models trained on the data collected by electronic nose and electronic tongue. The results showed that the model trained on the fused dataset outperformed the models trained separately on the electronic nose data and electronic tongue data. The two preprocessing approaches could improve the prediction performance of all classification methods. Classification and Regression Tree approach performed better than Partial Least Square Discriminant Analysis and Linear Discriminant Analysis in terms of accuracy. But Classification and Regression Tree tends to assign the samples of minority class to the majority class. Meanwhile, Partial Least Square Discriminant Analysis keeps a good balance between the identification requirements of all the two groups of samples. Taking all the results above, the model built using the Partial Least Square Discriminant Analysis method on the fused data after z-score was used to identify the botanical origin of Codonopsis Radix.
党参(CR)是中国的一种食用食品和传统中药。党参的不同品种具有不同的味道。为了使加工食品的风味稳定,使用两种电子感官设备,电子鼻和电子舌,建立了一个鉴别模型来识别每个样品的植物来源。从使用两种预处理方法和三种分类方法的所有组合训练的模型中选择了 88 批样品的最佳模型。对电子鼻和电子舌采集的数据进行了模型训练的比较。结果表明,在融合数据集上训练的模型优于在电子鼻数据和电子舌数据上分别训练的模型。两种预处理方法可以提高所有分类方法的预测性能。在准确性方面,分类回归树方法优于偏最小二乘判别分析和线性判别分析。但分类回归树倾向于将少数类的样本分配到多数类。同时,偏最小二乘判别分析在两组样本的识别要求之间保持良好的平衡。综合以上所有结果,使用 z 分数后融合数据的偏最小二乘判别分析方法建立的模型用于识别党参的植物来源。