Biological Sciences, Chungnam National University, 99 Daehagro, Youseong, Daejon, 34134, Korea.
Gene Engineering Division, National Institute of Agricultural Sciences, 370 Nongsaengmyeongro, Jeonju, Jeollabuk-do, 54874, Korea.
Plant Cell Rep. 2024 Jun 9;43(7):164. doi: 10.1007/s00299-024-03249-0.
Hyperspectral features enable accurate classification of soybean seeds using linear discriminant analysis and GWAS for novel seed trait genes. Evaluating crop seed traits such as size, shape, and color is crucial for assessing seed quality and improving agricultural productivity. The introduction of the SUnSet toolbox, which employs hyperspectral sensor-derived image analysis, addresses this necessity. In a validation test involving 420 seed accessions from the Korean Soybean Core Collections, the pixel purity index algorithm identified seed- specific hyperspectral endmembers to facilitate segmentation. Various metrics extracted from ventral and lateral side images facilitated the categorization of seeds into three size groups and four shape groups. Additionally, quantitative RGB triplets representing seven seed coat colors, averaged reflectance spectra, and pigment indices were acquired. Machine learning models, trained on a dataset comprising 420 accession seeds and 199 predictors encompassing seed size, shape, and reflectance spectra, achieved accuracy rates of 95.8% for linear discriminant analysis model. Furthermore, a genome-wide association study utilizing hyperspectral features uncovered associations between seed traits and genes governing seed pigmentation and shapes. This comprehensive approach underscores the effectiveness of SUnSet in advancing precision agriculture through meticulous seed trait analysis.
高光谱特征可通过线性判别分析和 GWAS 对大豆种子进行准确分类,为新型种子性状基因提供支持。评估作物种子的大小、形状和颜色等特征对于评估种子质量和提高农业生产力至关重要。SUnSet 工具盒的引入满足了这一需求,它采用高光谱传感器衍生的图像分析。在一项涉及 420 份韩国大豆核心收集品系种子的验证试验中,像素纯度指数算法确定了种子特异性高光谱端元,以促进分割。从腹侧和侧部图像中提取的各种度量标准有助于将种子分为三个大小组和四个形状组。此外,还获得了代表七个种皮颜色的定量 RGB 三元组、平均反射光谱和色素指数。基于包含 420 个品系种子和 199 个涵盖种子大小、形状和反射光谱的预测因子的数据集训练的机器学习模型,线性判别分析模型的准确率达到 95.8%。此外,利用高光谱特征进行的全基因组关联研究揭示了种子性状与控制种子色素沉着和形状的基因之间的关联。这种综合方法强调了 SUnSet 通过细致的种子特征分析在推进精准农业方面的有效性。