比较用于估计十个牛品种个体动物基因组品种组成的单核苷酸多态性（SNP）面板和统计方法。

Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds.

作者信息

He Jun, Guo Yage, Xu Jiaqi, Li Hao, Fuller Anna, Tait Richard G, Wu Xiao-Lin, Bauck Stewart

机构信息

Biostatistics and Bioinformatics, Neogen GeneSeek Operations, Lincoln, NE, USA.

College of Animal Science and Technology, Hunan Agricultural University, Changsha, China.

出版信息

BMC Genet. 2018 Aug 9;19(1):56. doi: 10.1186/s12863-018-0654-3.

DOI:10.1186/s12863-018-0654-3

PMID:30092776

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6085684/

Abstract

BACKGROUND

SNPs are informative to estimate genomic breed composition (GBC) of individual animals, but selected SNPs for this purpose were not made available in the commercial bovine SNP chips prior to the present study. The primary objective of the present study was to select five common SNP panels for estimating GBC of individual animals initially involving 10 cattle breeds (two dairy breeds and eight beef breeds). The performance of the five common SNP panels was evaluated based on admixture model and linear regression model, respectively. Finally, the downstream implication of GBC on genomic prediction accuracies was investigated and discussed in a Santa Gertrudis cattle population.

RESULTS

There were 15,708 common SNPs across five currently-available commercial bovine SNP chips. From this set, four subsets (1,000, 3,000, 5,000, and 10,000 SNPs) were selected by maximizing average Euclidean distance (AED) of SNP allelic frequencies among the ten cattle breeds. For 198 animals presented as Akaushi, estimated GBC of the Akaushi breed (GBCA) based on the admixture model agreed very well among the five SNP panels, identifying 166 animals with GBCA = 1. Using the same SNP panels, the linear regression approach reported fewer animals with GBCA = 1. Nevertheless, estimated GBCA using both models were highly correlated (r = 0.953 to 0.992). In the genomic prediction of a Santa Gertrudis population (and crosses), the results showed that the predictability of molecular breeding values using SNP effects obtained from 1,225 animals with no less than 0.90 GBC of Santa Gertrudis (GBCSG) decreased on crossbred animals with lower GBCSG.

CONCLUSIONS

Of the two statistical models used to compute GBC, the admixture model gave more consistent results among the five selected SNP panels than the linear regression model. The availability of these common SNP panels facilitates identification and estimation of breed compositions using currently-available bovine SNP chips. In view of utility, the 1 K panel is the most cost effective and it is convenient to be included as add-on content in future development of bovine SNP chips, whereas the 10 K and 16 K SNP panels can be more resourceful if used independently for imputation to intermediate or high-density genotypes.

摘要

背景

单核苷酸多态性（SNPs）有助于估计个体动物的基因组品种组成（GBC），但在本研究之前，用于此目的的选定SNPs在商业牛SNP芯片中并未提供。本研究的主要目的是选择五个常见的SNP面板，用于估计最初涉及10个牛品种（两个奶牛品种和八个肉牛品种）的个体动物的GBC。分别基于混合模型和线性回归模型评估了这五个常见SNP面板的性能。最后，在圣格特鲁迪斯牛群体中研究并讨论了GBC对基因组预测准确性的下游影响。

结果

在五个当前可用的商业牛SNP芯片中共有15,708个常见SNP。从这个集合中，通过最大化十个牛品种之间SNP等位基因频率的平均欧几里得距离（AED），选择了四个子集（1000、3000、5000和10000个SNP）。对于198头呈现为赤牛的动物，基于混合模型估计的赤牛品种GBC（GBCA）在五个SNP面板之间非常一致，识别出166头GBCA = 1的动物。使用相同的SNP面板，线性回归方法报告的GBCA = 1的动物较少。然而，使用这两种模型估计的GBCA高度相关（r = 0.953至）。在圣格特鲁迪斯牛群体（及其杂交后代）的基因组预测中，结果表明使用来自1225头圣格特鲁迪斯牛GBC不少于0.90（GBCSG）的动物获得的SNP效应进行分子育种值预测时，对于GBCSG较低的杂交动物，预测能力会下降。

结论

在用于计算GBC的两种统计模型中，混合模型在五个选定的SNP面板之间给出的结果比线性回归模型更一致。这些常见SNP面板的可用性有助于使用当前可用的牛SNP芯片识别和估计品种组成。从实用性来看，1K面板最具成本效益，并且便于在未来牛SNP芯片的开发中作为附加内容包含在内，而10K和16K SNP面板如果独立用于推算到中等或高密度基因型，则可能更具资源优势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3227/6085684/23004ddabae3/12863_2018_654_Fig1_HTML.jpg

相似文献

Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds.

BMC Genet. 2018 Aug 9;19(1):56. doi: 10.1186/s12863-018-0654-3.

Estimation of genomic breed composition of individual animals in composite beef cattle.

Anim Genet. 2020 Jun;51(3):457-460. doi: 10.1111/age.12928. Epub 2020 Apr 2.

[Estimating genomic breed composition of individual animals using selected SNPs].

Yi Chuan. 2018 Apr 20;40(4):305-314. doi: 10.16288/j.yczz.17-394.

Genomic breed composition of Ningxiang pig via different SNP panels.

J Anim Physiol Anim Nutr (Berl). 2022 Jul;106(4):783-791. doi: 10.1111/jpn.13603. Epub 2021 Jul 14.

SNP panels for the estimation of dairy breed proportion and parentage assignment in African crossbred dairy cattle.

Genet Sel Evol. 2021 Mar 2;53(1):21. doi: 10.1186/s12711-021-00615-4.

Genetic tests for estimating dairy breed proportion and parentage assignment in East African crossbred cattle.

Genet Sel Evol. 2017 Sep 12;49(1):67. doi: 10.1186/s12711-017-0342-1.

Evaluating the use of statistical and machine learning methods for estimating breed composition of purebred and crossbred animals in thirteen cattle breeds using genomic information.

Front Genet. 2023 May 15;14:1120312. doi: 10.3389/fgene.2023.1120312. eCollection 2023.

A Causality Perspective of Genomic Breed Composition for Composite Animals.

Front Genet. 2020 Oct 30;11:546052. doi: 10.3389/fgene.2020.546052. eCollection 2020.

A low-density SNP genotyping panel for the accurate prediction of cattle breeds.

J Anim Sci. 2020 Nov 1;98(11). doi: 10.1093/jas/skaa337.

Estimation of Genomic Breed Composition for Purebred and Crossbred Animals Using Sparsely Regularized Admixture Models.

Front Genet. 2020 Jun 11;11:576. doi: 10.3389/fgene.2020.00576. eCollection 2020.

引用本文的文献

A deep learning strategy for accurate identification of purebred and hybrid pigs across SNP chips.

J Anim Sci Biotechnol. 2025 Aug 14;16(1):116. doi: 10.1186/s40104-025-01249-y.

Comprehensive duck DNA fingerprinting based on machine learning for breed identification.

Poult Sci. 2025 May 29;104(8):105359. doi: 10.1016/j.psj.2025.105359.

Population structure and breed identification of Chinese indigenous sheep breeds using whole genome SNPs and InDels.

Genet Sel Evol. 2024 Sep 3;56(1):60. doi: 10.1186/s12711-024-00927-1.

Definition of metafounders based on population structure analysis.

Genet Sel Evol. 2024 Jun 6;56(1):43. doi: 10.1186/s12711-024-00913-7.

A Comprehensive Genomic Analysis of Chinese Indigenous Ningxiang Pigs: Genomic Breed Compositions, Runs of Homozygosity, and Beyond.

Int J Mol Sci. 2023 Sep 26;24(19):14550. doi: 10.3390/ijms241914550.

Evaluating the use of statistical and machine learning methods for estimating breed composition of purebred and crossbred animals in thirteen cattle breeds using genomic information.

Front Genet. 2023 May 15;14:1120312. doi: 10.3389/fgene.2023.1120312. eCollection 2023.

Breed identification using breed-informative SNPs and machine learning based on whole genome sequence data and SNP chip data.

J Anim Sci Biotechnol. 2023 Jun 1;14(1):85. doi: 10.1186/s40104-023-00880-x.

The use of a genomic relationship matrix for breed assignment of cattle breeds: comparison and combination with a machine learning method.

J Anim Sci. 2023 Jan 3;101. doi: 10.1093/jas/skad172.

A look under the hood of genomic-estimated breed compositions for brangus cattle: What have we learned?

Front Genet. 2023 Mar 28;14:1080279. doi: 10.3389/fgene.2023.1080279. eCollection 2023.

A web tool for the global identification of pig breeds.

Genet Sel Evol. 2023 Mar 21;55(1):18. doi: 10.1186/s12711-023-00788-0.

本文引用的文献

Estimation of genome-wide and locus-specific breed composition in pigs.

Transl Anim Sci. 2017 Feb 1;1(1):36-44. doi: 10.2527/tas2016.0003. eCollection 2017 Feb.

Comparing strategies for selection of low-density SNPs for imputation-mediated genomic prediction in U. S. Holsteins.

Genetica. 2018 Apr;146(2):137-149. doi: 10.1007/s10709-017-0004-9. Epub 2017 Dec 14.

Moving Beyond Managing Realized Genomic Relationship in Long-Term Genomic Selection.

Genetics. 2017 Jun;206(2):1127-1138. doi: 10.1534/genetics.116.194449. Epub 2017 Apr 4.

LASER server: ancestry tracing with genotypes or sequence reads.

Bioinformatics. 2017 Jul 1;33(13):2056-2058. doi: 10.1093/bioinformatics/btx075.

Genomic Selection in Dairy Cattle: The USDA Experience.

Annu Rev Anim Biosci. 2017 Feb 8;5:309-327. doi: 10.1146/annurev-animal-021815-111422. Epub 2016 Nov 16.

Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations.

BMC Bioinformatics. 2015 Jan 16;16:4. doi: 10.1186/s12859-014-0418-7.

Within- and across-breed imputation of high-density genotypes in dairy and beef cattle from medium- and low-density genotypes.

J Anim Breed Genet. 2014 Jun;131(3):165-72. doi: 10.1111/jbg.12067. Epub 2013 Dec 5.

Within- and across-breed genomic predictions and genomic relationships for Western Pyrenees dairy sheep breeds Latxa, Manech, and Basco-Béarnaise.

J Dairy Sci. 2014 May;97(5):3200-12. doi: 10.3168/jds.2013-7745. Epub 2014 Mar 13.

Selection of SNP from 50K and 777K arrays to predict breed of origin in cattle.

J Anim Sci. 2013 Nov;91(11):5128-34. doi: 10.2527/jas.2013-6678. Epub 2013 Sep 17.

Inference of population splits and mixtures from genome-wide allele frequency data.

PLoS Genet. 2012;8(11):e1002967. doi: 10.1371/journal.pgen.1002967. Epub 2012 Nov 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

比较用于估计十个牛品种个体动物基因组品种组成的单核苷酸多态性（SNP）面板和统计方法。

Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献