Dipartimento di Agraria, Università di Sassari, Viale Italia 39, 07100, Sassari, Italy.
J Anim Sci. 2013 Jan;91(1):29-37. doi: 10.2527/jas.2011-5061. Epub 2012 Oct 16.
In the current study, principal component (PC) analysis was used to reduce the number of predictors in the estimation of direct genomic breeding values (DGV) for meat traits in a sample of 479 Italian Simmental bulls. Single nucleotide polymorphism marker genotypes were determined with the 54K Illumina beadchip. After edits, 457 bulls and 40,179 SNP were retained. Principal component extraction was performed separately for each chromosome and 2466 new variables able to explain 70% of total variance were obtained. Bulls were divided into reference and validation population. Three scenarios of the ratio reference:validation were tested: 70:30, 80:20, 90:10. Effect of PC scores on polygenic EBV was estimated in the reference population using different models and methods. Traits analyzed were 7 beef traits: daily BW gain, size score, muscularity score, feet and legs score, beef index (economic index), calving ease direct effect, and cow muscularity. Accuracy was calculated as correlation between DGV and polygenic EBV in the validation bulls. Muscularity, feet and legs, and the beef index showed the greatest accuracies; calving ease, the least. In general, accuracies were slightly greater when reference animals were selected at random and the best scenario was 90:10 and no substantial differences in accuracy were found among different methods. Principal component analysis is entirely based on the factorization of the SNP (co)variance matrix and produced a reduced set of variables (6% of the original variables) which may be used for different phenotypic traits. In spite of this huge reduction in the number of independent variables, DGV accuracies resulted similar to those obtained by using the whole set of SNP markers. Accuracies of direct genomic values found in the present work were always greater than those of traditional parental average (PA). Thus, results of the present study may suggest a possible advantage of use of genomic indexes in the preselection of performance test candidates for beef traits. Moreover, the relevant reduction of variable space might allow genomic selection implementation also in small populations.
在当前的研究中,主成分(PC)分析被用于减少 479 头意大利西门塔尔公牛样本中用于估计肉质性状直接基因组育种值(DGV)的预测因子数量。单核苷酸多态性标记基因型使用 54K Illumina 珠芯片确定。经过编辑,保留了 457 头公牛和 40179 个 SNP。针对每条染色体分别进行主成分提取,获得了 2466 个新变量,能够解释总方差的 70%。公牛被分为参考和验证群体。测试了三种参考:验证比例的情景:70:30、80:20、90:10。在参考群体中使用不同的模型和方法估计 PC 得分对多基因 EBV 的影响。分析的性状有 7 个牛肉性状:日增重、体型评分、肌肉评分、脚部和腿部评分、牛肉指数(经济指数)、产犊容易度直接效应和母牛肌肉度。在验证公牛中,根据 DGV 和多基因 EBV 之间的相关性计算准确性。肌肉度、脚部和腿部以及牛肉指数的准确性最高;产犊容易度最低。通常,当随机选择参考动物时,准确性略高,最佳情景是 90:10,并且不同方法之间的准确性没有显著差异。主成分分析完全基于 SNP(协)方差矩阵的分解,产生了一组减少的变量(原始变量的 6%),可用于不同的表型性状。尽管独立变量的数量大幅减少,但 DGV 的准确性与使用整个 SNP 标记集获得的准确性相似。本研究中发现的直接基因组值的准确性始终高于传统的亲本平均值(PA)。因此,本研究的结果可能表明在牛肉性状的性能测试候选物的预筛选中使用基因组指数可能具有优势。此外,变量空间的相关减少可能允许在小群体中也实施基因组选择。