Fang Hui, Zhang Zhao, Wang Hai-long, Yang Xiang-dong, He Yong, Bao Yi-dan
Guang Pu Xue Yu Guang Pu Fen Xi. 2017 Mar;37(3):760-5.
Transgenic technology has enormous significance in increasing food production, protecting biodiversity and reducing the use of chemical pesticides and so on. However, there may be some security risks; therefore, research on genetically modified crop identification technology is attracting more and more attention. Mid-infrared spectroscopy combined with feature extraction methods were used to investigate the feasibility of identifying different kinds of transgenic soybeans in the wavelength range of 3 818~734 cm-1. For this purpose, partial least squares-discriminant analysis (PLS-DA) was employed as pattern recognition methods to classify three non-GMO parent soybeans(HC6, JACK and W82)and their transgenic soybeans. The results of the calibration set were 96.67%, 96.67% and 83.33% for three non-GMO parent soybeans and their transgenic soybeans, and the results of the prediction set were 83.33%, 85% and 85%. X-loading weights, variable importance in the projection (VIP) algorithm and second derivative (2-Der) algorithm were applied to select sensitive wavenumbers. The sensitive wavelengths selected with x-loading weights were used to build PLS-DA model, the classification accuracy of the calibration set were 91.11%, 91.67% and 81.67%, and the results of the prediction set were 80%, 80% and 75%. By using the VIP algorithm, the classification accuracy of the calibration set were 94.44%, 95% and 76.67%, and the results of the prediction set were 80%, 85% and 75%. By using the 2-Der algorithm, the classification accuracy of the calibration set were 88.89%, 81.67% and 80%, and the results of the prediction set were 76.67%, 75% and 75%. Principal components analysis (PCA) and independent component analysis (ICA) were applied to extract feature information. The principal components were combined with PLS-DA model. The classification accuracy of the calibration set were 96.67%, 90% and 80%, and the results of the prediction set were 80%, 90% and 80%. The independent components were combined with PLS-DA model. The classification accuracy of the calibration set were 93.33%, 83.33% and 83.33% while the results of the prediction set were 83.33%, 75% and 75%. The overall results indicated that mid-infrared spectroscopy could accurately identify the varieties of the non-GMO parent soybeans, which provided a new idea for nondestructive testing of transgenic soybeans. Feature extraction methods could be used to build more concise models and reduce the amount of program operations combined with sensitive wavenumbers selection methods.
转基因技术在提高粮食产量、保护生物多样性以及减少化学农药使用等方面具有重大意义。然而,可能存在一些安全风险;因此,转基因作物鉴定技术的研究正吸引着越来越多的关注。采用中红外光谱结合特征提取方法,研究在3818~734 cm-1波长范围内鉴别不同种类转基因大豆的可行性。为此,采用偏最小二乘判别分析(PLS-DA)作为模式识别方法,对三种非转基因亲本大豆(HC6、JACK和W82)及其转基因大豆进行分类。校正集对三种非转基因亲本大豆及其转基因大豆的分类结果分别为96.67%、96.67%和83.33%,预测集的结果分别为83.33%、85%和85%。应用X-载荷权重、投影变量重要性(VIP)算法和二阶导数(2-Der)算法来选择敏感波数。用X-载荷权重选择的敏感波长建立PLS-DA模型,校正集的分类准确率分别为91.11%、91.67%和81.67%,预测集的结果分别为80%、80%和75%。采用VIP算法时,校正集的分类准确率分别为94.44%、95%和76.67%,预测集的结果分别为80%、85%和75%。采用2-Der算法时,校正集的分类准确率分别为88.89%、81.67%和80%,预测集的结果分别为76.67%、75%和75%。应用主成分分析(PCA)和独立成分分析(ICA)提取特征信息。将主成分与PLS-DA模型相结合。校正集的分类准确率分别为96.67%、90%和80%,预测集的结果分别为80%、90%和80%。将独立成分与PLS-DA模型相结合。校正集的分类准确率分别为93.33%、83.33%和83.33%,预测集的结果分别为83.33%、75%和75%。总体结果表明,中红外光谱能够准确鉴别非转基因亲本大豆的品种,为转基因大豆的无损检测提供了新思路。特征提取方法结合敏感波数选择方法可用于建立更简洁的模型并减少程序运算量。