Yang Hai-qing, He Yong, Chen Yong-ming, Lin Ping, Wu Di
College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou 310029, China.
Guang Pu Xue Yu Guang Pu Fen Xi. 2008 Jun;28(6):1232-6.
The potential of visible/near infrared absorbance spectroscopy as a way for the nondestructive discrimination of various fragrant mushrooms was evaluated. First, the spectral data ranging from 375 to 1025 nm were analyzed by principal component analysis (PCA) for data compression and space clustering. The resulting accumulative credibility of 94.37% based on the first three principle components (PCs) was achieved. This signifies that it is possible to establish a model for the sample discrimination in three dimensional space. Then, a new method in which space division planes were established based on the 3-D PC score plot was proposed. Due to the irregular sample distribution, the division planes for sample discrimination were established through genetic algorithm (GA). The fitness function was evaluated based on the number of the samples that have wrong sign by the division plane function. The goal is to achieve the minimum of the fitness function. Various parameters were predetermined, including population size, selection method, crossover rate, mutation rate and iteration number. Three plane functions were conducted as the model for sample discrimination. In order to evaluate the prediction performance of the new model, another model based on PCA and 3-layer BP-ANN was created and brought into comparison. The three PCs were adopted as the input of the BP-ANN. The number of the neurons in the middle layer was optimized based on the calibration error. The output layer was encoded in binary number. In the test, a total of 195 samples were examined, in which 150 samples were selected randomly for model building and the other 45 for model prediction. Both models adopted the same calibration set and prediction set. The result indicated that the two models established by different methods had similar capability of sorting the same samples out of others. Both models featured more than 91% of sample recognition rate. It can be concluded that while BP-ANN tends to solve high-dimension data analysis, the new method proves reliable and practicable in the three dimensional space so that it could serve as an approach to machine recognition of fragrant mushrooms with various origins.
评估了可见/近红外吸收光谱法作为无损鉴别各种香菇的一种方法的潜力。首先,通过主成分分析(PCA)对375至1025nm范围内的光谱数据进行分析,以进行数据压缩和空间聚类。基于前三个主成分(PC)实现了94.37%的累积可信度。这表明有可能在三维空间中建立样本鉴别模型。然后,提出了一种基于三维PC得分图建立空间划分平面的新方法。由于样本分布不规则,通过遗传算法(GA)建立样本鉴别的划分平面。基于划分平面函数对符号错误的样本数量评估适应度函数。目标是使适应度函数最小化。预先确定了各种参数,包括种群大小、选择方法、交叉率、变异率和迭代次数。进行了三个平面函数作为样本鉴别模型。为了评估新模型的预测性能,创建了另一个基于PCA和三层BP-ANN的模型并进行比较。采用三个PC作为BP-ANN的输入。基于校准误差对中间层的神经元数量进行了优化。输出层采用二进制编码。在测试中,共检查了195个样本,其中随机选择150个样本进行模型构建,另外45个用于模型预测。两个模型采用相同的校准集和预测集。结果表明,通过不同方法建立的两个模型在将相同样本与其他样本区分开来方面具有相似的能力。两个模型的样本识别率均超过91%。可以得出结论,虽然BP-ANN倾向于解决高维数据分析问题,但新方法在三维空间中证明是可靠且可行的,因此可以作为一种对各种来源香菇进行机器识别的方法。