Suppr超能文献

绵羊品种基因型推断的准确性。

Accuracy of genotype imputation in sheep breeds.

机构信息

Biosciences Research Division, Department of Primary Industries, 1 Park Drive, Bundoora, Victoria 3083, Australia.

出版信息

Anim Genet. 2012 Feb;43(1):72-80. doi: 10.1111/j.1365-2052.2011.02208.x. Epub 2011 May 27.

Abstract

Although genomic selection offers the prospect of improving the rate of genetic gain in meat, wool and dairy sheep breeding programs, the key constraint is likely to be the cost of genotyping. Potentially, this constraint can be overcome by genotyping selection candidates for a low density (low cost) panel of SNPs with sparse genotype coverage, imputing a much higher density of SNP genotypes using a densely genotyped reference population. These imputed genotypes would then be used with a prediction equation to produce genomic estimated breeding values. In the future, it may also be desirable to impute very dense marker genotypes or even whole genome re-sequence data from moderate density SNP panels. Such a strategy could lead to an accurate prediction of genomic estimated breeding values across breeds, for example. We used genotypes from 48 640 (50K) SNPs genotyped in four sheep breeds to investigate both the accuracy of imputation of the 50K SNPs from low density SNP panels, as well as prospects for imputing very dense or whole genome re-sequence data from the 50K SNPs (by leaving out a small number of the 50K SNPs at random). Accuracy of imputation was low if the sparse panel had less than 5000 (5K) markers. Across breeds, it was clear that the accuracy of imputing from sparse marker panels to 50K was higher if the genetic diversity within a breed was lower, such that relationships among animals in that breed were higher. The accuracy of imputation from sparse genotypes to 50K genotypes was higher when the imputation was performed within breed rather than when pooling all the data, despite the fact that the pooled reference set was much larger. For Border Leicesters, Poll Dorsets and White Suffolks, 5K sparse genotypes were sufficient to impute 50K with 80% accuracy. For Merinos, the accuracy of imputing 50K from 5K was lower at 71%, despite a large number of animals with full genotypes (2215) being used as a reference. For all breeds, the relationship of individuals to the reference explained up to 64% of the variation in accuracy of imputation, demonstrating that accuracy of imputation can be increased if sires and other ancestors of the individuals to be imputed are included in the reference population. The accuracy of imputation could also be increased if pedigree information was available and was used in tracking inheritance of large chromosome segments within families. In our study, we only considered methods of imputation based on population-wide linkage disequilibrium (largely because the pedigree for some of the populations was incomplete). Finally, in the scenarios designed to mimic imputation of high density or whole genome re-sequence data from the 50K panel, the accuracy of imputation was much higher (86-96%). This is promising, suggesting that in silico genome re-sequencing is possible in sheep if a suitable pool of key ancestors is sequenced for each breed.

摘要

虽然基因组选择为提高肉类、羊毛和奶制品羊的遗传增益速度提供了前景,但关键的限制因素可能是基因分型的成本。通过对低密度(低成本)SNP 面板的选择候选者进行基因分型,以稀疏基因型覆盖,使用高密度基因分型参考群体来推断更高密度的 SNP 基因型,从而有可能克服这种限制。然后,将这些推断的基因型用于预测方程,以生成基因组估计育种值。将来,可能还需要从中等密度 SNP 面板中推断出非常密集的标记基因型甚至整个基因组重测序数据。例如,这种策略可以实现跨品种的基因组估计育种值的准确预测。我们使用来自四个绵羊品种的 48640(50K)个 SNP 的基因型来研究从低密度 SNP 面板推断 50K SNP 的准确性,以及从 50K SNP 推断非常密集或整个基因组重测序数据的前景(随机排除少量 50K SNP)。如果稀疏面板的标记少于 5000(5K)个,则推断的准确性较低。在所有品种中,如果品种内的遗传多样性较低,即该品种内动物之间的关系较高,则从稀疏标记面板推断到 50K 的准确性更高。尽管汇总参考集大得多,但在品种内进行推断而不是汇总所有数据时,从稀疏基因型推断到 50K 基因型的准确性更高。对于 Border Leicesters、Poll Dorsets 和 White Suffolks,5K 稀疏基因型足以以 80%的准确率推断 50K。对于 Merinos,尽管使用了大量具有完整基因型(2215 个)的动物作为参考,但从 5K 推断 50K 的准确性较低,为 71%。对于所有品种,个体与参考的关系解释了推断准确性变化的高达 64%,表明如果在参考群体中包含要推断的个体的父亲和其他祖先,则可以提高推断的准确性。如果可用并且可用于跟踪家族内大染色体片段的遗传,则可以提高推断的准确性。在我们的研究中,我们仅考虑基于全基因组连锁不平衡的推断方法(主要是因为某些群体的系谱不完整)。最后,在模拟从 50K 面板推断高密度或全基因组重测序数据的场景中,推断的准确性要高得多(86-96%)。这很有希望,表明如果为每个品种对合适的关键祖先进行测序,则绵羊的虚拟基因组重测序是可行的。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验