Genus Plc, Hendersonville, TN, USA.
Genet Sel Evol. 2011 Jan 5;43(1):1. doi: 10.1186/1297-9686-43-1.
The incorporation of genomic coefficients into the numerator relationship matrix allows estimation of breeding values using all phenotypic, pedigree and genomic information simultaneously. In such a single-step procedure, genomic and pedigree-based relationships have to be compatible. As there are many options to create genomic relationships, there is a question of which is optimal and what the effects of deviations from optimality are.
Data of litter size (total number born per litter) for 338,346 sows were analyzed. Illumina PorcineSNP60 BeadChip genotypes were available for 1,989. Analyses were carried out with the complete data set and with a subset of genotyped animals and three generations pedigree (5,090 animals). A single-trait animal model was used to estimate variance components and breeding values. Genomic relationship matrices were constructed using allele frequencies equal to 0.5 (G05), equal to the average minor allele frequency (GMF), or equal to observed frequencies (GOF). A genomic matrix considering random ascertainment of allele frequencies was also used (GOF*). A normalized matrix (GN) was obtained to have average diagonal coefficients equal to 1. The genomic matrices were combined with the numerator relationship matrix creating H matrices.
In G05 and GMF, both diagonal and off-diagonal elements were on average greater than the pedigree-based coefficients. In GOF and GOF*, the average diagonal elements were smaller than pedigree-based coefficients. The mean of off-diagonal coefficients was zero in GOF and GOF*. Choices of G with average diagonal coefficients different from 1 led to greater estimates of additive variance in the smaller data set. The correlation between EBV and genomic EBV (n = 1,989) were: 0.79 using G05, 0.79 using GMF, 0.78 using GOF, 0.79 using GOF*, and 0.78 using GN. Accuracies calculated by inversion increased with all genomic matrices. The accuracies of genomic-assisted EBV were inflated in all cases except when GN was used.
Parameter estimates may be biased if the genomic relationship coefficients are in a different scale than pedigree-based coefficients. A reasonable scaling may be obtained by using observed allele frequencies and re-scaling the genomic relationship matrix to obtain average diagonal elements of 1.
将基因组系数纳入分子关系矩阵中,可以同时利用所有表型、系谱和基因组信息来估计育种值。在这种单步过程中,基因组和系谱关系必须兼容。由于有许多创建基因组关系的选择,因此存在一个问题,即哪种选择是最佳的,以及偏离最优性的影响是什么。
分析了 338346 头母猪的窝产仔数(每窝出生的总头数)数据。1989 头猪可用 Illumina PorcineSNP60 BeadChip 基因型。使用完整数据集和一组已基因分型动物和三代系谱(5090 头动物)进行分析。使用单性状动物模型估计方差分量和育种值。使用等位基因频率等于 0.5(G05)、等于平均小等位基因频率(GMF)或等于观察到的频率(GOF)构建基因组关系矩阵。还使用考虑等位基因频率随机确定的基因组矩阵(GOF*)。获得一个归一化矩阵(GN),使对角线系数的平均值等于 1。将基因组矩阵与分子关系矩阵组合创建 H 矩阵。
在 G05 和 GMF 中,对角线和非对角线元素的平均值均大于系谱系数。在 GOF 和 GOF中,平均对角线元素小于系谱系数。在 GOF 和 GOF中,非对角线系数的平均值为零。选择具有不同于 1 的平均对角线系数的 G 会导致较小数据集的加性方差估计值更大。使用 G05 时 EBV 和基因组 EBV(n=1989)之间的相关性为 0.79,使用 GMF 时为 0.79,使用 GOF 时为 0.78,使用 GOF*时为 0.79,使用 GN 时为 0.78。通过反转计算的准确性随着所有基因组矩阵的增加而增加。除使用 GN 外,在所有情况下,基因组辅助 EBV 的准确性都被夸大了。
如果基因组关系系数与系谱系数不在同一范围内,则参数估计可能会有偏差。通过使用观察到的等位基因频率并重新缩放基因组关系矩阵以获得平均对角线元素为 1,可以获得合理的缩放。