Department of Animal and Dairy Science, University of Georgia, Athens 30602.
Department of Animal and Dairy Science, University of Georgia, Athens 30602.
J Dairy Sci. 2022 Nov;105(12):9810-9821. doi: 10.3168/jds.2022-22143. Epub 2022 Oct 12.
High relatedness in the US Holstein breed can be attributed to the increased rate of inbreeding that resulted from strong selection and the extensive use of a few bulls via reproductive biotechnology. The objectives of this study were to determine whether clustering could separate selected candidates into genetically different groups and whether such clustering could reduce the expected inbreeding of the next generation. A genomic relationship matrix composed of 1,145 sires with the most registered progeny in the breed born after 1985 was used for principal component analysis and k-means clustering. The 5 clusters reduced the variance by 25% and contained 171 (C1), 252 (C2), 200 (C3), 244 (C4), and 278 (C5) animals, respectively. The 2 most predominant families were C1 and C2, while C4 contained the most international animals. On average, C1 and C5 contained older animals; the average birth year per cluster was 1988 (C1), 1996 (C2 and C3), 1999 (C4), and 1990 (C5). Increasing to 10 clusters allowed the separation of the predominant sons. Statistically significant differences were observed for indices (net merit index, cheese merit index, and fluid merit index), daughter pregnancy rate, and production traits (milk, fat, and protein), with older clusters having lower merit for production but higher for reproduction. K-means clustering was also used for 20,099 animals considered as selection candidates. Based on the reduction in variance achieved by clustering, 5 to 7 clusters were appropriate. The number of animals in each cluster was 3,577 (C1), 3,073 (C2), 3,302 (C3), 5,931 (C4), and 4,216 (C5). The expected inbreeding from within or across cluster mating was calculated using the complete pedigree, assuming the mean inbreeding of animals born in the same year when parents are unknown. Generally, inbreeding was highest within cluster mating and lowest across cluster mating. Even when 10 clusters were used, one cluster always gave low inbreeding in all scenarios. This suggests that this cluster contains animals that differ from all other groups but still contains enough diversity within itself. Based on lower across cluster inbreeding, up to 7 clusters were appropriate. Statistically significant differences in genomic estimated breeding values were found between clusters. The rankings of clusters for different traits were mostly the same except for reproduction and fat. Results show that diversity within the population exists and clustering of selection candidates can reduce the expected inbreeding of the next generations.
美国荷斯坦牛的高度近交可归因于通过生殖生物技术选择和广泛使用少数公牛导致的近交率增加。本研究的目的是确定聚类是否可以将选定的候选者分为遗传上不同的群体,以及这种聚类是否可以降低下一代的预期近交程度。使用由 1145 头具有最多注册后代的公牛组成的基因组关系矩阵进行主成分分析和 K-均值聚类。5 个聚类降低了 25%的方差,分别包含 171 头(C1)、252 头(C2)、200 头(C3)、244 头(C4)和 278 头(C5)动物。两个最主要的家族是 C1 和 C2,而 C4 则包含最多的国际动物。平均而言,C1 和 C5 包含较老的动物;每个聚类的平均出生年份分别为 1988 年(C1)、1996 年(C2 和 C3)、1999 年(C4)和 1990 年(C5)。增加到 10 个聚类可以分离主要的后代。在净效益指数、奶酪效益指数和液体效益指数、女儿妊娠率和生产性状(牛奶、脂肪和蛋白质)方面观察到指数有统计学意义的差异,较老的聚类在生产方面的效益较低,但在繁殖方面的效益较高。K-均值聚类也用于 20099 头被认为是选择候选者的动物。基于聚类实现的方差降低,5 到 7 个聚类是合适的。每个聚类的动物数量为 3577 头(C1)、3073 头(C2)、3302 头(C3)、5931 头(C4)和 4216 头(C5)。根据完整的系谱,假设当父母未知时,同一年出生的动物的平均近交率,计算了来自簇内或簇间交配的预期近交程度。通常,簇内交配的近交程度最高,而簇间交配的近交程度最低。即使使用 10 个聚类,在所有情况下,一个聚类始终会导致较低的近交。这表明该聚类包含与所有其他群体不同的动物,但仍在自身内部包含足够的多样性。基于较低的跨簇近交程度,最多 7 个聚类是合适的。在聚类之间发现了基因组估计育种值的统计学显著差异。不同性状的聚类排名大多相同,除了繁殖和脂肪。结果表明,群体内部存在多样性,选择候选者的聚类可以降低下一代的预期近交程度。