Sleper Joshua, Tapia Ronald, Lee Seonghee, Whitaker Vance
Plant Breeding Graduate Program, Horticultural Sciences Department, University of Florida, IFAS Gulf Coast Research and Education Center, Wimauma, Florida, USA.
Horticultural Sciences Department, IFAS Citrus Research and Education Center, University of Florida, Lake Alfred, Florida, USA.
Plant Genome. 2025 Mar;18(1):e20550. doi: 10.1002/tpg2.20550.
Genomic selection is a widely used quantitative method of determining the genetic value of an individual from genomic information and phenotypic data. In this study, we used a large, multi-year training population of 3248 individuals from the University of Florida strawberry (Fragaria × ananassa Duchesne) breeding program. We coupled this training population with a test population of 1460 individuals derived from 20 biparental families. Using these two populations, we tested different genomic selection methods of predicting each family separately for the purpose of within-family selection of seedlings for multiple yield-related traits in strawberry. The methodology we considered were comprised of 11 different marker densities, 10 different training set sizes, four different training set composition techniques, and one to five clonal replications for each individual in the training population. We demonstrated that prediction accuracy varied among the 20 biparental families from 0.05 to 0.63 for the three traits investigated. We also showed that a medium-density genotyping strategy (1500-1650 single nucleotide polymorphisms) could be 95%-97% as effective as a high-density genotyping platform and that imputation to the more dense platform always improved accuracy. Training set composition techniques had no discernible effect on prediction accuracy. However, increasing training set size improved prediction accuracy, and accuracy did not plateau even when training sets exceeded 3000 individuals. Finally, we showed that the number of clonal replicates in field trials could be reduced by 80% without any negative effects on genomic selection accuracy.
基因组选择是一种广泛应用的定量方法,用于根据基因组信息和表型数据确定个体的遗传价值。在本研究中,我们使用了来自佛罗里达大学草莓(Fragaria × ananassa Duchesne)育种项目的3248个个体组成的大型多年训练群体。我们将这个训练群体与来自20个双亲家庭的1460个个体组成的测试群体相结合。利用这两个群体,我们测试了不同的基因组选择方法,以便分别预测每个家庭,目的是在草莓多个产量相关性状的幼苗家庭内选择。我们考虑的方法包括11种不同的标记密度、10种不同的训练集大小、4种不同的训练集组成技术,以及训练群体中每个个体1至5次克隆重复。我们证明,在所研究的三个性状上,20个双亲家庭的预测准确性在0.05至0.63之间变化。我们还表明,中等密度基因分型策略(1500 - 1650个单核苷酸多态性)的有效性可达高密度基因分型平台的95% - 97%,并且向更密集平台的插补总是能提高准确性。训练集组成技术对预测准确性没有明显影响。然而,增加训练集大小可提高预测准确性,即使训练集超过3000个个体,准确性也没有达到平稳状态。最后,我们表明田间试验中的克隆重复数量可以减少80%,而不会对基因组选择准确性产生任何负面影响。