Suppr超能文献

安格斯牛的基因组预测:交叉验证的样本量、响应变量和聚类方法比较

Genomic predictions in Angus cattle: comparisons of sample size, response variables, and clustering methods for cross-validation.

作者信息

Boddhireddy P, Kelly M J, Northcutt S, Prayaga K C, Rumph J, DeNise S

机构信息

Zoetis Inc., Kalamazoo, MI 49007.

出版信息

J Anim Sci. 2014 Feb;92(2):485-97. doi: 10.2527/jas.2013-6757. Epub 2014 Jan 15.

Abstract

Advances in genomics, molecular biology, and statistical genetics have created a paradigm shift in the way livestock producers pursue genetic improvement in their herds. The nexus of these technologies has resulted in combining genotypic and phenotypic information to compute genomically enhanced measures of genetic merit of individual animals. However, large numbers of genotyped and phenotyped animals are required to produce robust estimates of the effects of SNP that are summed together to generate direct genomic breeding values (DGV). Data on 11,756 Angus animals genotyped with the Illumina BovineSNP50 Beadchip were used to develop genomic predictions for 17 traits reported by the American Angus Association through Angus Genetics Inc. in their National Cattle Evaluation program. Marker effects were computed using a 5-fold cross-validation approach and a Bayesian model averaging algorithm. The accuracies were examined with EBV and deregressed EBV (DEBV) response variables and with K-means and identical by state (IBS)-based cross-validation methodologies. The cross-validation accuracies obtained using EBV response variables were consistently greater than those obtained using DEBV (average correlations were 0.64 vs. 0.57). The accuracies obtained using K-means cross-validation were consistently smaller than accuracies obtained with the IBS-based cross-validation approach (average correlations were 0.58 vs. 0.64 with EBV used as a response variable). Comparing the results from the current study with the results from a similar study consisting of only 2,253 records indicated that larger training population size resulted in higher accuracies in validation animals and explained on average 18% (69% improvement) additional genetic variance across all traits.

摘要

基因组学、分子生物学和统计遗传学的进展,使家畜养殖者追求畜群遗传改良的方式发生了范式转变。这些技术的结合,使得基因型和表型信息得以整合,用于计算个体动物基因组增强的遗传价值指标。然而,要对单核苷酸多态性(SNP)的效应做出可靠估计,进而汇总得出直接基因组育种值(DGV),就需要大量进行基因分型和表型分型的动物。本研究使用了11,756头经Illumina BovineSNP50基因芯片进行基因分型的安格斯牛的数据,对美国安格斯协会通过安格斯遗传学公司在其全国奶牛评估计划中报告的17个性状进行基因组预测。使用5倍交叉验证方法和贝叶斯模型平均算法计算标记效应。通过估计育种值(EBV)和去回归估计育种值(DEBV)响应变量,以及基于K均值和状态相同(IBS)的交叉验证方法来检验准确性。使用EBV响应变量获得的交叉验证准确性始终高于使用DEBV获得的准确性(平均相关性分别为0.64和0.57)。使用K均值交叉验证获得的准确性始终低于基于IBS的交叉验证方法获得的准确性(以EBV作为响应变量时,平均相关性分别为0.58和0.64)。将本研究结果与另一项仅包含2,253条记录的类似研究结果进行比较,结果表明,更大的训练群体规模会使验证动物的准确性更高,并且在所有性状上平均解释了额外18%(提高了69%)的遗传方差。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验