Yang Yongqin, Harrison Rashaun Candace, Zhang Dun, Shen Binghui, Yan Yanlu, Kang Dingming
Ministry of Education of the People's Republic of China (MOE) Key Laboratory of Crop Heterosis and Utilization, College of Agronomy and Biotechnology, China Agricultural University, Beijing, China.
Department of Applied Physics, College of Science, China Agricultural University, Beijing, China.
Front Plant Sci. 2024 Feb 29;15:1361328. doi: 10.3389/fpls.2024.1361328. eCollection 2024.
NIR spectroscopy combined with chemometric algorithms has been widely used for seed authenticity detection. However, the study of seed genetic distance, an internal feature that affects the discriminative performance of classification models, has rarely been reported.
Therefore, maize seed samples of different genotypes were selected to investigate the effect of genetic distance on the authenticity of single seeds detected by NIR spectroscopy. Firstly, the Support vector machine (SVM) model was established using spectral information combined with a preprocessing algorithm, and then the DNA of the samples was extracted to study the correlation between genetic and relative spectral distances, the model identification performance, and finally to compare the similarities and differences between the results of genetic clustering and relative spectral clustering.
The results were as follows: the average accuracy of the models was 93.6% (inbred lines) and 93.7% (hybrids), respectively; Genetic distance and correlation spectral distance exhibited positive correlation significantly (inbred lines: r=0.177, <0.05; hybrids: r=0.238, p<0.05), likewise genetic distance and model accuracy also showed positive correlation (inbred lines: r=0.611, p<0.01; hybrids: r=0.6158, <0.01); Genetic clustering and spectral clustering results were essentially uniform for 94.3% (inbred lines) and 93.9% (hybrids), respectively.
This study reveals the relationship between the genetic and relative spectral distances of seeds and the accuracy of the model, which provides theoretical basis for the establishment of the standardized system for detecting the authenticity of seeds by NIR spectroscopic techniques.
近红外光谱结合化学计量算法已广泛应用于种子真实性检测。然而,作为影响分类模型判别性能的内部特征——种子遗传距离的研究却鲜有报道。
因此,选取不同基因型的玉米种子样本,研究遗传距离对近红外光谱检测单粒种子真实性的影响。首先,结合预处理算法利用光谱信息建立支持向量机(SVM)模型,然后提取样本DNA,研究遗传距离与相对光谱距离、模型识别性能之间的相关性,最后比较遗传聚类和相对光谱聚类结果的异同。
结果如下:模型的平均准确率分别为93.6%(自交系)和93.7%(杂交种);遗传距离与相关光谱距离呈显著正相关(自交系:r = 0.177,p < 0.05;杂交种:r = 0.238,p < 0.05),同样遗传距离与模型准确率也呈正相关(自交系:r = 0.611,p < 0.01;杂交种:r = 0.6158,p < 0.01);遗传聚类和光谱聚类结果在自交系中分别有94.3%、杂交种中有93.9%基本一致。
本研究揭示了种子遗传距离与相对光谱距离以及模型准确率之间的关系,为建立近红外光谱技术检测种子真实性的标准化体系提供了理论依据。