Department of Genetics and Biotechnology, Faculty of Agricultural Sciences, Aarhus University, Tjele, Denmark.
J Dairy Sci. 2010 Mar;93(3):1175-83. doi: 10.3168/jds.2009-2192.
This study investigated the reliability of genomic estimated breeding values (GEBV) in the Danish Holstein population. The data in the analysis included 3,330 bulls with both published conventional EBV and single nucleotide polymorphism (SNP) markers. After data editing, 38,134 SNP markers were available. In the analysis, all SNP were fitted simultaneously as random effects in a Bayesian variable selection model, which allows heterogeneous variances for different SNP markers. The response variables were the official EBV. Direct GEBV were calculated as the sum of individual SNP effects. Initial analyses of 4 index traits were carried out to compare models with different intensities of shrinkage for SNP effects; that is, mixture prior distributions of scaling factors (standard deviation of SNP effects) assuming 5, 10, 20, or 50% of SNP having large effects and the others having very small or no effects, and a single prior distribution common for all SNP. It was found that, in general, the model with a common prior distribution of scaling factors had better predictive ability than any mixture prior models. Therefore, a common prior model was used to estimate SNP effects and breeding values for all 18 index traits. Reliability of GEBV was assessed by squared correlation between GEBV and conventional EBV (r(2)(GEBV, EBV)), and expected reliability was obtained from prediction error variance using a 5-fold cross validation. Squared correlations between GEBV and published EBV (without any adjustment) ranged from 0.252 to 0.700, with an average of 0.418. Expected reliabilities ranged from 0.494 to 0.733, with an average of 0.546. Averaged over 18 traits, r(2)(GEBV, EBV) was 0.13 higher and expected reliability was 0.26 higher than reliability of conventional parent average. The results indicate that genomic selection can greatly improve the accuracy of preselection for young bulls compared with traditional selection based on parent average information.
本研究旨在探究丹麦荷斯坦牛群体中基因组估计育种值(GEBV)的可靠性。分析中使用的数据包括 3330 头公牛,这些公牛的表型数据既包含常规 EBV,也包含单核苷酸多态性(SNP)标记。经过数据编辑,共获得 38134 个 SNP 标记。在分析中,所有 SNP 作为随机效应同时拟合在贝叶斯变量选择模型中,该模型允许不同 SNP 标记的方差存在异质性。响应变量是官方 EBV。直接 GEBV 被计算为个体 SNP 效应的总和。初始分析了 4 个指标性状,以比较 SNP 效应不同程度收缩的模型;也就是说,假设 SNP 中 5%、10%、20%或 50%具有较大效应,其余 SNP 具有非常小或无效应,使用 SNP 效应的缩放因子(标准偏差)的混合先验分布(5、10、20 或 50%),或者 SNP 效应的单一先验分布,用于所有 SNP。结果表明,一般来说,具有缩放因子共同先验模型的预测能力优于任何混合先验模型。因此,使用共同先验模型估计所有 18 个指标性状的 SNP 效应和育种值。通过 GEBV 与常规 EBV(r(2)(GEBV, EBV))之间的平方相关来评估 GEBV 的可靠性,通过 5 倍交叉验证获得预测误差方差的期望可靠性。GEBV 与已发表 EBV(未经任何调整)之间的平方相关性范围从 0.252 到 0.700,平均为 0.418。期望可靠性范围从 0.494 到 0.733,平均为 0.546。在 18 个性状的平均值中,r(2)(GEBV, EBV) 比常规亲本平均值高 0.13,期望可靠性高 0.26。结果表明,与基于亲本平均值信息的传统选择相比,基因组选择可以大大提高年轻公牛预选择的准确性。