Brito Luiz F, McEwan John C, Miller Stephen P, Pickering Natalie K, Bain Wendy E, Dodds Ken G, Schenkel Flávio S, Clarke Shannon M
Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, N1G 2W1, Canada.
AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand.
BMC Genet. 2017 Mar 14;18(1):25. doi: 10.1186/s12863-017-0492-8.
Knowledge about the genetic diversity of a population is a crucial parameter for the implementation of successful genomic selection and conservation of genetic resources. The aim of this research was to establish the scientific basis for the implementation of genomic selection in a composite Terminal sheep breeding scheme by providing consolidated linkage disequilibrium (LD) measures across SNP markers, estimating consistency of gametic phase between breed-groups, and assessing genetic diversity measures, such as effective population size (N), and population structure parameters, using a large number of animals (n = 14,845) genotyped with a high density SNP chip (606,006 markers). Information generated in this research will be useful for optimizing molecular breeding values predictions and managing the available genetic resources.
Overall, as expected, levels of pairwise LD decreased with increasing distance between SNP pairs. The mean LD r between adjacent SNP was 0.26 ± 0.10. The most recent effective population size for all animals (687) and separately per breed-groups: Primera (974), Lamb Supreme (380), Texel (227) and Dual-Purpose (125) was quite variable. The genotyped animals were outbred or had an average low level of inbreeding. Consistency of gametic phase was higher than 0.94 for all breed pairs at the average distance between SNP on the chip (~4.74 kb). Moreover, there was not a clear separation between the breed-groups based on principal component analysis, suggesting that a mixed-breed training population for calculation of molecular breeding values would be beneficial.
This study reports, for the first time, estimates of linkage disequilibrium, genetic diversity and population structure parameters from a genome-wide perspective in New Zealand Terminal Sire composite sheep breeds. The levels of linkage disequilibrium indicate that genomic selection could be implemented with the high density SNP panel. The moderate to high consistency of gametic phase between breed-groups and overlapping population structure support the pooling of the animals in a mixed training population for genomic predictions. In addition, the moderate to high N highlights the need to genotype and phenotype a large training population in order to capture most of the haplotype diversity and increase accuracies of genomic predictions. The results reported herein are a first step toward understanding the genomic architecture of a Terminal Sire composite sheep population and for the optimal implementation of genomic selection and genome-wide association studies in this sheep population.
了解种群的遗传多样性是成功实施基因组选择和保护遗传资源的关键参数。本研究的目的是通过提供全基因组范围内单核苷酸多态性(SNP)标记的连锁不平衡(LD)综合测量值、估计品种组间配子相位的一致性以及评估遗传多样性指标(如有效种群大小(N))和种群结构参数,为在复合终端绵羊育种方案中实施基因组选择奠定科学基础。本研究使用大量动物(n = 14,845)进行高密度SNP芯片(606,006个标记)基因分型。本研究产生的信息将有助于优化分子育种值预测和管理可用的遗传资源。
总体而言,正如预期的那样,成对LD水平随着SNP对之间距离的增加而降低。相邻SNP之间的平均LD r为0.26±0.10。所有动物(687)以及各品种组分别的最近有效种群大小:Primera(974)、Lamb Supreme(380)、特克塞尔(227)和两用型(125)差异很大。基因分型的动物为远交种或平均近亲繁殖水平较低。在芯片上SNP的平均距离(约4.74 kb)处,所有品种对的配子相位一致性均高于0.94。此外,基于主成分分析,品种组之间没有明显的分离,这表明用于计算分子育种值的混合品种训练群体将是有益的。
本研究首次从全基因组角度报道了新西兰终端父本复合绵羊品种的连锁不平衡、遗传多样性和种群结构参数估计值。连锁不平衡水平表明可以使用高密度SNP面板实施基因组选择。品种组之间配子相位的中等到高度一致性以及重叠的种群结构支持将动物合并到混合训练群体中进行基因组预测。此外,中等到高的N值突出了对大量训练群体进行基因分型和表型分型的必要性,以便捕获大部分单倍型多样性并提高基因组预测的准确性。本文报道的结果是朝着理解终端父本复合绵羊群体的基因组结构以及在该绵羊群体中最佳实施基因组选择和全基因组关联研究迈出的第一步。