Kim Geon Woo, Hong Ju-Pyo, Lee Hea-Young, Kwon Jin-Kyung, Kim Dong-Am, Kang Byoung-Cheorl
Department of Agriculture, Forestry and Bioresources, Research Institute of Agriculture and Life Sciences, Plant Genomics Breeding Institute, College of Agriculture and Life Sciences, Seoul National University, Seoul 08826, Republic of Korea.
R&D Center, Hana Seed Co., Ltd., Anseong 17601, Republic of Korea.
Hortic Res. 2022 Sep 13;9:uhac204. doi: 10.1093/hr/uhac204. eCollection 2022.
Capsaicinoids provide chili peppers ( spp.) with their characteristic pungency. Several structural and transcription factor genes are known to control capsaicinoid contents in pepper. However, many other genes also regulating capsaicinoid contents remain unknown, making it difficult to develop pepper cultivars with different levels of capsaicinoids. Genomic selection (GS) uses genome-wide random markers (including many in undiscovered genes) for a trait to improve selection efficiency. In this study, we predicted the capsaicinoid contents of pepper breeding lines using several GS models trained with genotypic and phenotypic data from a training population. We used a core collection of 351 accessions and 96 breeding lines as training and testing populations, respectively. To obtain the optimal number of single nucleotide polymorphism (SNP) markers for GS, we tested various numbers of genome-wide SNP markers based on linkage disequilibrium. We obtained the highest mean prediction accuracy (0.550) for different models using 3294 SNP markers. Using this marker set, we conducted GWAS and selected 25 markers that were associated with capsaicinoid biosynthesis genes and quantitative trait loci for capsaicinoid contents. Finally, to develop more accurate prediction models, we obtained SNP markers from GWAS as fixed-effect markers for GS, where 3294 genome-wide SNPs were employed. When four to five fixed-effect markers from GWAS were used as fixed effects, the RKHS and RR-BLUP models showed accuracies of 0.696 and 0.689, respectively. Our results lay the foundation for developing pepper cultivars with various capsaicinoid levels using GS for capsaicinoid contents.
辣椒素类物质赋予辣椒(辣椒属)其特有的辛辣味。已知有几个结构基因和转录因子基因控制辣椒中的辣椒素类物质含量。然而,许多其他调控辣椒素类物质含量的基因仍不为人知,这使得培育具有不同辣椒素类物质含量水平的辣椒品种变得困难。基因组选择(GS)使用全基因组随机标记(包括许多未知基因中的标记)来针对某一性状提高选择效率。在本研究中,我们使用几个基于训练群体的基因型和表型数据训练的GS模型,预测了辣椒育种系的辣椒素类物质含量。我们分别使用351份种质的核心收集品系和96个育种系作为训练群体和测试群体。为了获得用于GS的单核苷酸多态性(SNP)标记的最佳数量,我们基于连锁不平衡测试了不同数量的全基因组SNP标记。使用3294个SNP标记,我们在不同模型中获得了最高的平均预测准确性(0.550)。使用这个标记集,我们进行了全基因组关联研究(GWAS),并选择了25个与辣椒素生物合成基因和辣椒素类物质含量的数量性状位点相关的标记。最后,为了开发更准确的预测模型,我们从GWAS中获得SNP标记作为GS的固定效应标记,其中使用了3294个全基因组SNP。当使用来自GWAS的四到五个固定效应标记作为固定效应时,RKHS和RR - BLUP模型的准确性分别为0.696和0.689。我们的结果为利用GS针对辣椒素类物质含量培育具有不同辣椒素类物质水平的辣椒品种奠定了基础。