Suppr超能文献

利用不同的统计模型预测意大利荷斯坦奶牛生产和功能性状的直接基因组值。

Use of different statistical models to predict direct genomic values for productive and functional traits in Italian Holsteins.

机构信息

Dipartimento di Agraria-Sezione Scienze Zootecniche, Università di Sassari, Sassari, Italy.

出版信息

J Anim Breed Genet. 2013 Feb;130(1):32-40. doi: 10.1111/j.1439-0388.2012.01019.x. Epub 2012 Jul 24.

Abstract

One of the main issues in genomic selection was the huge unbalance between number of markers and phenotypes available. In this work, principal component analysis is used to reduce the number of predictors for calculating direct genomic breeding values (DGV) for production and functional traits. 2093 Italian Holstein bulls were genotyped with the 54 K Illumina beadchip, and 39,555 SNP markers were retained after data editing. Principal Components (PC) were extracted from SNP matrix, and 15,207 PC explaining 99% of the original variance were retained and used as predictors. Bulls born before 2001 were included in the reference population, younger animals in the test population. A BLUP model was used to estimate the effect of principal component on deregressed proof (DRPF) for 35 traits and results were compared to those obtained by using SNP genotypes as predictors either with BLUP or with Bayes_A models. Correlations between DGV and DRPF did not substantially differ among the three methods except for milk fat content. The lowest prediction bias was obtained for the method based on the use of principal component. Regression coefficients of DRPF on DGV were lower than one for the approach based on the use of PC and higher than one for the other two methods. The use of PC as predictors resulted in a large reduction of number of predictors (approximately 38%) and of computational time that was approximately 2% of the time needed to estimate SNP effects with the other two methods. Accuracies of genomic predictions were in most of cases only slightly higher than those of the traditional pedigree index, probably due to the limited size of the considered population.

摘要

基因组选择的主要问题之一是可用标记数量与表型数量之间存在巨大的不平衡。在这项工作中,主成分分析用于减少预测因子的数量,以便计算生产和功能性状的直接基因组育种值(DGV)。2093 头意大利荷斯坦公牛使用 Illumina 54 K 珠芯片进行了基因分型,数据编辑后保留了 39555 个 SNP 标记。从 SNP 矩阵中提取主成分(PC),保留了 15207 个 PC,它们解释了原始方差的 99%,并用作预测因子。2001 年前出生的公牛被纳入参考群体,较年轻的动物被纳入测试群体。使用 BLUP 模型估计主成分对回归证明(DRPF)的影响,对 35 个性状进行了 35 次估计,并将结果与使用 SNP 基因型作为预测因子的 BLUP 或 Bayes_A 模型的结果进行了比较。除了乳脂含量外,DGV 和 DRPF 之间的相关性在三种方法之间没有实质性差异。基于主成分使用的方法获得了最低的预测偏差。基于使用 PC 的方法,DRPF 对 DGV 的回归系数低于 1,而其他两种方法的回归系数高于 1。使用 PC 作为预测因子会导致预测因子数量(约 38%)和计算时间的大量减少,计算时间约为使用其他两种方法估计 SNP 效应所需时间的 2%。在大多数情况下,基因组预测的准确性仅略高于传统系谱指数,这可能是由于所考虑的群体规模有限。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验