Department of Statistics, Facultad de Agronomía, Universidad de la República, Garzón 780, 12900, Montevideo, Uruguay.
Instituto Nacional de Investigación Agropecuaria, Est. Exp. La Estanzuela, Ruta 50 km 11.5, 70006, Colonia, Uruguay.
Theor Appl Genet. 2018 Dec;131(12):2719-2731. doi: 10.1007/s00122-018-3186-3. Epub 2018 Sep 19.
Multi-trait genomic prediction models are useful to allocate available resources in breeding programs by targeted phenotyping of correlated traits when predicting expensive and labor-intensive quality parameters. Multi-trait genomic prediction models can be used to predict labor-intensive or expensive correlated traits where phenotyping depth of correlated traits could be larger than phenotyping depth of targeted traits, reducing resources and improving prediction accuracy. This is particularly important in the context of allocating phenotyping resource in plant breeding programs. The objective of this work was to evaluate multi-trait models predictive ability with different depth of phenotypic information from correlated traits. We evaluated 495 wheat advanced breeding lines for eight baking quality traits which were genotyped with genotyping-by-sequencing. Through different approaches for cross-validation, we evaluated the predictive ability of a single-trait model and a multi-trait model. Moreover, we evaluated different sizes of the training population (from 50 to 396 individuals) for the trait of interest, different depth of phenotypic information for correlated traits (50 and 100%) and the number of correlated traits to be used (one to three). There was no loss in the predictive ability by reducing the training population up to a 30% (149 individuals) when using correlated traits. A multi-trait model with one highly correlated trait phenotyped for both the training and testing sets was the best model considering phenotyping resources and the gain in predictive ability. The inclusion of correlated traits in the training and testing lines is a strategic approach to replace phenotyping of labor-intensive and high cost traits in a breeding program.
多性状基因组预测模型通过对相关性状进行目标表型分析,在预测昂贵且劳动密集型的质量参数时,有助于在育种计划中分配可用资源。多性状基因组预测模型可用于预测劳动密集型或昂贵的相关性状,相关性状的表型深度可能大于目标性状的表型深度,从而减少资源并提高预测准确性。这在植物育种计划中分配表型资源方面尤为重要。本研究的目的是评估具有不同相关性状表型信息深度的多性状模型的预测能力。我们对 495 条小麦高级育种系进行了 8 个烘焙品质性状的表型分析,这些性状是通过测序进行基因分型的。通过不同的交叉验证方法,我们评估了单性状模型和多性状模型的预测能力。此外,我们还评估了不同大小的感兴趣性状的训练群体(50 至 396 个个体)、相关性状的不同表型信息深度(50%和 100%)以及要使用的相关性状数量(1 至 3 个)。当使用相关性状时,将训练群体减少到 30%(149 个个体)不会降低预测能力。考虑到表型资源和预测能力的提高,具有一个高度相关性状的多性状模型是最佳模型,该性状在训练和测试集中均进行了表型分析。在训练和测试系中包含相关性状是替代育种计划中劳动密集型和高成本性状表型分析的一种策略。