National Fund for Scientific Research (F.R.S.-FNRS), Brussels, Belgium.
TERRA Teaching and Research Centre, Gembloux Agro-Bio Tech, University of Liège, Gembloux, Belgium.
J Anim Breed Genet. 2022 Jan;139(1):40-61. doi: 10.1111/jbg.12643. Epub 2021 Aug 24.
Assignment of individual cattle to a specific breed can often not rely on pedigree information. This is especially the case for local breeds for which the development of genomic assignment tools is required to allow individuals of unknown origin to be included to their herd books. A breed assignment model can be based on two specific stages: (a) the selection of breed-informative markers and (b) the assignment of individuals to a breed with a classification method. However, the performance of combination of methods used in these two stages has been rarely studied until now. In this study, the combination of 16 different SNP panels with four classification methods was developed on 562 reference genotypes from 12 cattle breeds. Based on their performances, best models were validated on three local breeds of interest. In cross-validation, 14 models had a global cross-validation accuracy higher than 90%, with a maximum of 98.22%. In validation, best models used 7,153 or 2,005 SNPs, based on a partial least squares-discriminant analysis (PLS-DA) and assigned individuals to breeds based on nearest shrunken centroids. The average validation sensitivity of the first two best models for the three local breeds of interest were 98.33% and 97.5%. Moreover, results reported in this study suggest that further studies should consider the PLS-DA method when selecting breed-informative SNPs.
对个体牛的品种归属通常不能依赖系谱信息。对于本地品种来说尤其如此,因为需要开发基因组归属工具,以便将来源不明的个体纳入其牛群登记簿。品种归属模型可以基于两个特定阶段:(a)选择品种信息标记和(b)使用分类方法将个体分配到品种。然而,直到现在,这些两个阶段中使用的方法组合的性能才很少被研究。在这项研究中,针对来自 12 个牛品种的 562 个参考基因型,开发了 16 种不同 SNP 面板与 4 种分类方法的组合。基于它们的性能,在三个感兴趣的本地品种上对最佳模型进行了验证。在交叉验证中,14 种模型的全局交叉验证准确性高于 90%,最高可达 98.22%。在验证中,最佳模型使用了 7153 或 2005 个 SNP,基于偏最小二乘判别分析(PLS-DA)并根据最近的收缩质心将个体分配到品种。对于三个感兴趣的本地品种,前两个最佳模型的平均验证灵敏度分别为 98.33%和 97.5%。此外,本研究报告的结果表明,在选择品种信息 SNP 时,应进一步考虑 PLS-DA 方法。