Blueberry Breeding and Genomics Lab, Horticultural Sciences Department, University of Florida, Gainesville, FL, 32611, USA.
Heredity (Edinb). 2020 Dec;125(6):437-448. doi: 10.1038/s41437-020-00357-x. Epub 2020 Oct 19.
Blueberry (Vaccinium spp.) is an important autopolyploid crop with significant benefits for human health. Apart from its genetic complexity, the feasibility of genomic prediction has been proven for blueberry, enabling a reduction in the breeding cycle time and increasing genetic gain. However, as for other polyploid crops, sequencing costs still hinder the implementation of genome-based breeding methods for blueberry. This motivated us to evaluate the effect of training population sizes and composition, as well as the impact of marker density and sequencing depth on phenotype prediction for the species. For this, data from a large real breeding population of 1804 individuals were used. Genotypic data from 86,930 markers and three traits with different genetic architecture (fruit firmness, fruit weight, and total yield) were evaluated. Herein, we suggested that marker density, sequencing depth, and training population size can be substantially reduced with no significant impact on model accuracy. Our results can help guide decisions toward resource allocation (e.g., genotyping and phenotyping) in order to maximize prediction accuracy. These findings have the potential to allow for a faster and more accurate release of varieties with a substantial reduction of resources for the application of genomic prediction in blueberry. We anticipate that the benefits and pipeline described in our study can be applied to optimize genomic prediction for other diploid and polyploid species.
蓝莓(Vaccinium spp.)是一种重要的同源多倍体作物,对人类健康有重要意义。除了遗传复杂性之外,蓝莓的基因组预测可行性已经得到证明,这可以减少育种周期时间并增加遗传增益。然而,与其他多倍体作物一样,测序成本仍然阻碍了蓝莓基于基因组的育种方法的实施。这促使我们评估训练群体大小和组成、标记密度和测序深度对该物种表型预测的影响。为此,我们使用了一个包含 1804 个个体的大型真实育种群体的数据。评估了来自 86930 个标记和三个具有不同遗传结构(果实硬度、果实重量和总产量)的性状的基因型数据。在此,我们建议可以大大减少标记密度、测序深度和训练群体大小,而不会对模型准确性产生重大影响。我们的研究结果可以帮助指导资源分配(例如,基因型和表型)决策,以最大程度地提高预测准确性。这些发现有可能允许更快、更准确地发布品种,同时大大减少基因组预测在蓝莓中的应用所需的资源。我们预计,我们研究中描述的优势和流程可以应用于优化其他二倍体和多倍体物种的基因组预测。