Dept. of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB T6G 2P5, Canada.
J Anim Sci. 2013 Oct;91(10):4669-78. doi: 10.2527/jas.2013-5715.
In beef cattle, phenotypic data that are difficult and/or costly to measure, such as feed efficiency, and DNA marker genotypes are usually available on a small number of animals of different breeds or populations. To achieve a maximal accuracy of genomic prediction using the phenotype and genotype data, strategies for forming a training population to predict genomic breeding values (GEBV) of the selection candidates need to be evaluated. In this study, we examined the accuracy of predicting GEBV for residual feed intake (RFI) based on 522 Angus and 395 Charolais steers genotyped on SNP with the Illumina Bovine SNP50 Beadchip for 3 training population forming strategies: within breed, across breed, and by pooling data from the 2 breeds (i.e., combined). Two other scenarios with the training and validation data split by birth year and by sire family within a breed were also investigated to assess the impact of genetic relationships on the accuracy of genomic prediction. Three statistical methods including the best linear unbiased prediction with the relationship matrix defined based on the pedigree (PBLUP), based on the SNP genotypes (GBLUP), and a Bayesian method (BayesB) were used to predict the GEBV. The results showed that the accuracy of the GEBV prediction was the highest when the prediction was within breed and when the validation population had greater genetic relationships with the training population, with a maximum of 0.58 for Angus and 0.64 for Charolais. The within-breed prediction accuracies dropped to 0.29 and 0.38, respectively, when the validation populations had a minimal pedigree link with the training population. When the training population of a different breed was used to predict the GEBV of the validation population, that is, across-breed genomic prediction, the accuracies were further reduced to 0.10 to 0.22, depending on the prediction method used. Pooling data from the 2 breeds to form the training population resulted in accuracies increased to 0.31 and 0.43, respectively, for the Angus and Charolais validation populations. The results suggested that the genetic relationship of selection candidates with the training population has a greater impact on the accuracy of GEBV using the Illumina Bovine SNP50 Beadchip. Pooling data from different breeds to form the training population will improve the accuracy of across breed genomic prediction for RFI in beef cattle.
在肉牛中,通常可以获得一些不同品种或群体的动物的表型数据,例如饲料效率,以及 DNA 标记基因型,但这些数据很难且/或成本高昂。为了使用表型和基因型数据实现基因组预测的最大准确性,需要评估形成训练群体以预测选择候选者基因组育种值 (GEBV) 的策略。在这项研究中,我们检查了基于 Illumina Bovine SNP50 Beadchip 对 522 头安格斯牛和 395 头夏洛莱牛进行 SNP 基因分型的剩余饲料摄入量 (RFI) 的 GEBV 预测准确性,使用了 3 种训练群体形成策略:品种内、品种间和品种间数据混合(即组合)。还研究了另外两种训练和验证数据按出生年份和品种内的父系家族划分的情况,以评估遗传关系对基因组预测准确性的影响。使用三种统计方法,包括基于系谱定义的关系矩阵的最佳线性无偏预测(PBLUP)、基于 SNP 基因型的最佳线性无偏预测(GBLUP)和贝叶斯方法(BayesB)来预测 GEBV。结果表明,当预测在品种内进行并且验证群体与训练群体具有更大的遗传关系时,GEBV 预测的准确性最高,安格斯牛的最大预测值为 0.58,夏洛莱牛的最大预测值为 0.64。当验证群体与训练群体的系谱联系最小化时,品种内预测准确性分别下降到 0.29 和 0.38。当使用不同品种的训练群体来预测验证群体的 GEBV 时,即跨品种基因组预测,准确性进一步降低到 0.10 到 0.22,具体取决于所使用的预测方法。将两个品种的数据混合形成训练群体,分别使安格斯牛和夏洛莱牛验证群体的准确性提高到 0.31 和 0.43。结果表明,候选者与训练群体的遗传关系对使用 Illumina Bovine SNP50 Beadchip 的 GEBV 有更大的影响。混合来自不同品种的数据以形成训练群体将提高肉牛中 RFI 的跨品种基因组预测的准确性。