Tu Keling, Wen Shaozhe, Cheng Ying, Xu Yanan, Pan Tong, Hou Haonan, Gu Riliang, Wang Jianhua, Wang Fengge, Sun Qun
Department of Plant Genetics & Breeding and Seed Science, College of Agronomy and Biotechnology, Ministry of Agriculture and Rural Affairs/Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University/The Innovation Center (Beijing) of Crop Seeds Whole-Process Technology Research, Beijing, 100193, People's Republic of China.
Beijing Key Laboratory of Vegetable Germplasm Improvement, Beijing Vegetable Research Center, Beijing Academy of Agriculture and Forestry Sciences (BAAFS), Beijing, 100097, People's Republic of China.
Plant Methods. 2022 Jun 11;18(1):81. doi: 10.1186/s13007-022-00918-7.
Variety genuineness and purity are essential indices of maize seed quality that affect yield. However, detection methods for variety genuineness are time-consuming, expensive, require extensive training, or destroy the seeds in the process. Here, we present an accurate, high-throughput, cost-effective, and non-destructive method for screening variety genuineness that uses seed phenotype data with machine learning to distinguish between genetically and phenotypically similar seed varieties. Specifically, we obtained image data of seed morphology and hyperspectral reflectance for Jingke 968 and nine other closely-related varieties (non-Jingke 968). We then compared the robustness of three common machine learning algorithms in distinguishing these varieties based on the phenotypic imaging data.
Our results showed that hyperspectral imaging (HSI) combined with a multilayer perceptron (MLP) or support vector machine (SVM) model could distinguish Jingke 968 from varieties that differed by as few as two loci, with a 99% or higher accuracy, while machine vision imaging provided ~ 90% accuracy. Through model validation and updating with varieties not included in the training data, we developed a genuineness detection model for Jingke 968 that effectively discriminated between genetically similar and distant varieties.
This strategy has potential for wide adoption in large-scale variety genuineness detection operations for internal quality control or governmental regulatory agencies, or for accelerating the breeding of new varieties. Besides, it could easily be extended to other target varieties and other crops.
品种真实性和纯度是影响玉米产量的种子质量的重要指标。然而,品种真实性的检测方法耗时、昂贵,需要大量培训,或者在检测过程中会破坏种子。在此,我们提出一种准确、高通量、经济高效且无损的品种真实性筛选方法,该方法利用种子表型数据和机器学习来区分遗传和表型相似的种子品种。具体而言,我们获取了京科968及其他九个密切相关品种(非京科968)的种子形态和高光谱反射率图像数据。然后,我们基于表型成像数据比较了三种常见机器学习算法在区分这些品种时的稳健性。
我们的结果表明,高光谱成像(HSI)结合多层感知器(MLP)或支持向量机(SVM)模型能够将京科968与仅有两个基因座差异的品种区分开来,准确率达到99%或更高,而机器视觉成像的准确率约为90%。通过使用未包含在训练数据中的品种进行模型验证和更新,我们开发了一个京科968品种真实性检测模型,该模型能够有效地区分遗传相似和差异较大的品种。
该策略有潜力在大规模品种真实性检测操作中广泛应用,用于内部质量控制或政府监管机构,或用于加速新品种的培育。此外,它可以很容易地扩展到其他目标品种和其他作物。