Hozé C, Fritz S, Phocas F, Boichard D, Ducrocq V, Croiseau P
Institut National de la Recherche Agronomique (INRA), UMR 1313, Génétique Animale et Biologie Intégrative (GABI), 78350 Jouy-en-Josas, France; Union Nationales des Coopératives d'Élevages et d'Insémination Animales (UNCEIA), 149 rue de Bercy, 75012 Paris, France.
Union Nationales des Coopératives d'Élevages et d'Insémination Animales (UNCEIA), 149 rue de Bercy, 75012 Paris, France.
J Dairy Sci. 2014;97(6):3918-29. doi: 10.3168/jds.2013-7761. Epub 2014 Apr 3.
Single-breed genomic selection (GS) based on medium single nucleotide polymorphism (SNP) density (~50,000; 50K) is now routinely implemented in several large cattle breeds. However, building large enough reference populations remains a challenge for many medium or small breeds. The high-density BovineHD BeadChip (HD chip; Illumina Inc., San Diego, CA) containing 777,609 SNP developed in 2010 is characterized by short-distance linkage disequilibrium expected to be maintained across breeds. Therefore, combining reference populations can be envisioned. A population of 1,869 influential ancestors from 3 dairy breeds (Holstein, Montbéliarde, and Normande) was genotyped with the HD chip. Using this sample, 50K genotypes were imputed within breed to high-density genotypes, leading to a large HD reference population. This population was used to develop a multi-breed genomic evaluation. The goal of this paper was to investigate the gain of multi-breed genomic evaluation for a small breed. The advantage of using a large breed (Normande in the present study) to mimic a small breed is the large potential validation population to compare alternative genomic selection approaches more reliably. In the Normande breed, 3 training sets were defined with 1,597, 404, and 198 bulls, and a unique validation set included the 394 youngest bulls. For each training set, estimated breeding values (EBV) were computed using pedigree-based BLUP, single-breed BayesC, or multi-breed BayesC for which the reference population was formed by any of the Normande training data sets and 4,989 Holstein and 1,788 Montbéliarde bulls. Phenotypes were standardized by within-breed genetic standard deviation, the proportion of polygenic variance was set to 30%, and the estimated number of SNP with a nonzero effect was about 7,000. The 2 genomic selection (GS) approaches were performed using either the 50K or HD genotypes. The correlations between EBV and observed daughter yield deviations (DYD) were computed for 6 traits and using the different prediction approaches. Compared with pedigree-based BLUP, the average gain in accuracy with GS in small populations was 0.057 for the single-breed and 0.086 for multi-breed approach. This gain was up to 0.193 and 0.209, respectively, with the large reference population. Improvement of EBV prediction due to the multi-breed evaluation was higher for animals not closely related to the reference population. In the case of a breed with a small reference population size, the increase in correlation due to multi-breed GS was 0.141 for bulls without their sire in reference population compared with 0.016 for bulls with their sire in reference population. These results demonstrate that multi-breed GS can contribute to increase genomic evaluation accuracy in small breeds.
基于中等单核苷酸多态性(SNP)密度(约50,000个;50K)的单品种基因组选择(GS)目前已在几个大型牛品种中常规实施。然而,对于许多中小型品种而言,构建足够大的参考群体仍然是一项挑战。2010年开发的包含777,609个SNP的高密度牛HD基因分型芯片(HD芯片;Illumina公司,加利福尼亚州圣地亚哥)的特点是预期在各品种间能保持短距离连锁不平衡。因此,可以设想合并参考群体。对来自3个奶牛品种(荷斯坦、蒙贝利亚尔和诺曼底)的1,869头有影响力的祖先进行了HD芯片基因分型。利用该样本,在品种内将50K基因型推算为高密度基因型,从而形成了一个大型HD参考群体。该群体被用于开展多品种基因组评估。本文的目的是研究多品种基因组评估对一个小型品种的增益情况。利用一个大型品种(本研究中的诺曼底品种)来模拟小型品种的优势在于有大量潜在的验证群体,能更可靠地比较不同的基因组选择方法。在诺曼底品种中,定义了3个训练集,分别包含1,597头、404头和198头公牛,还有一个单独的验证集,包含394头最年轻的公牛。对于每个训练集,使用基于系谱的最佳线性无偏预测(BLUP)、单品种贝叶斯C法或多品种贝叶斯C法计算估计育种值(EBV),其中多品种贝叶斯C法的参考群体由诺曼底训练数据集之一以及4,989头荷斯坦公牛和1,788头蒙贝利亚尔公牛组成。表型通过品种内遗传标准差进行标准化,多基因方差比例设定为30%,估计有非零效应的SNP数量约为七千个。两种基因组选择(GS)方法分别使用50K或HD基因型进行。针对6个性状并使用不同的预测方法,计算了EBV与观察到的女儿产奶偏差(DYD)之间的相关性。与基于系谱的BLUP相比,在小群体中,单品种GS的准确性平均增益为0.057,多品种方法为0.086。在有大型参考群体的情况下,这一增益分别高达0.193和0.209。对于与参考群体关系不密切的动物,多品种评估对EBV预测的改进更高。在参考群体规模较小的品种中,与参考群体中有父亲的公牛相比,多品种GS导致的相关性增加在没有父亲在参考群体中的公牛中为0.141,而在有父亲在参考群体中的公牛中为0.016。这些结果表明,多品种GS有助于提高小型品种的基因组评估准确性。