选择群体下单步基因组预测的准确性和偏差

The Accuracy and Bias of Single-Step Genomic Prediction for Populations Under Selection.

作者信息

Hsu Wan-Ling, Garrick Dorian J, Fernando Rohan L

机构信息

Department of Animal Science, Iowa State University, Ames, Iowa 50011.

Institute of Veterinary, Animal and Biomedical Sciences, Massey University, Palmerston North 4442, New Zealand.

出版信息

G3 (Bethesda). 2017 Aug 7;7(8):2685-2694. doi: 10.1534/g3.117.043596.

DOI:10.1534/g3.117.043596

PMID:28642364

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5555473/

Abstract

In single-step analyses, missing genotypes are explicitly or implicitly imputed, and this requires centering the observed genotypes using the means of the unselected founders. If genotypes are only available for selected individuals, centering on the unselected founder mean is not straightforward. Here, computer simulation is used to study an alternative analysis that does not require centering genotypes but fits the mean [Formula: see text] of unselected individuals as a fixed effect. Starting with observed diplotypes from 721 cattle, a five-generation population was simulated with sire selection to produce 40,000 individuals with phenotypes, of which the 1000 sires had genotypes. The next generation of 8000 genotyped individuals was used for validation. Evaluations were undertaken with (J) or without (N) [Formula: see text] when marker covariates were not centered; and with (JC) or without (C) [Formula: see text] when all observed and imputed marker covariates were centered. Centering did not influence accuracy of genomic prediction, but fitting [Formula: see text] did. Accuracies were improved when the panel comprised only quantitative trait loci (QTL); models JC and J had accuracies of 99.4%, whereas models C and N had accuracies of 90.2%. When only markers were in the panel, the 4 models had accuracies of 80.4%. In panels that included QTL, fitting [Formula: see text] in the model improved accuracy, but had little impact when the panel contained only markers. In populations undergoing selection, fitting [Formula: see text] in the model is recommended to avoid bias and reduction in prediction accuracy due to selection.

摘要

在单步分析中，缺失基因型会被显式或隐式地估算，这需要使用未被选择的奠基者的均值对观察到的基因型进行中心化处理。如果基因型仅适用于选定的个体，以未被选择的奠基者均值进行中心化处理并非易事。在此，通过计算机模拟来研究一种无需对基因型进行中心化处理的替代分析方法，而是将未被选择个体的均值[公式：见正文]作为固定效应进行拟合。从721头牛的观察到的双倍型开始，模拟了一个五代群体，通过父系选择产生了40,000个具有表型的个体，其中1000个父系具有基因型。下一代的8000个基因型个体用于验证。当标记协变量未进行中心化处理时，分别在有（J）或无（N）[公式：见正文]的情况下进行评估；当所有观察到的和估算的标记协变量都进行了中心化处理时，分别在有（JC）或无（C）[公式：见正文]的情况下进行评估。中心化处理不会影响基因组预测的准确性，但拟合[公式：见正文]会产生影响。当面板仅包含数量性状位点（QTL）时，准确性会提高；模型JC和J的准确率为99.4%，而模型C和N的准确率为90.2%。当面板中仅包含标记时，这4个模型的准确率为80.4%。在包含QTL的面板中，在模型中拟合[公式：见正文]可提高准确性，但当面板仅包含标记时影响较小。在正在进行选择的群体中，建议在模型中拟合[公式：见正文]以避免由于选择导致的偏差和预测准确性降低。

相似文献

The Accuracy and Bias of Single-Step Genomic Prediction for Populations Under Selection.选择群体下单步基因组预测的准确性和偏差

G3 (Bethesda). 2017 Aug 7;7(8):2685-2694. doi: 10.1534/g3.117.043596.

Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle.利用真实或推算的全基因组标记预测牛模拟多基因表型及其潜在数量性状位点基因型的准确性。

Genet Sel Evol. 2015 Dec 23;47:99. doi: 10.1186/s12711-015-0179-4.

Empirical and deterministic accuracies of across-population genomic prediction.跨群体基因组预测的经验性和确定性准确性。

Genet Sel Evol. 2015 Feb 6;47(1):5. doi: 10.1186/s12711-014-0086-0.

Accuracy of Genomic Prediction in Synthetic Populations Depending on the Number of Parents, Relatedness, and Ancestral Linkage Disequilibrium.取决于亲本数量、亲缘关系和祖先连锁不平衡的合成群体中基因组预测的准确性。

Genetics. 2017 Jan;205(1):441-454. doi: 10.1534/genetics.116.193243. Epub 2016 Nov 9.

Persistency of Prediction Accuracy and Genetic Gain in Synthetic Populations Under Recurrent Genomic Selection.轮回基因组选择下合成群体中预测准确性和遗传增益的持续性

G3 (Bethesda). 2017 Mar 10;7(3):801-811. doi: 10.1534/g3.116.036582.

The impact of clustering methods for cross-validation, choice of phenotypes, and genotyping strategies on the accuracy of genomic predictions.聚类方法对交叉验证、表型选择和基因分型策略对基因组预测准确性的影响。

J Anim Sci. 2019 Apr 3;97(4):1534-1549. doi: 10.1093/jas/skz055.

Using markers with large effect in genetic and genomic predictions.在遗传和基因组预测中使用具有大效应的标记。

J Anim Sci. 2017 Jan;95(1):59-71. doi: 10.2527/jas.2016.0754.

Prediction accuracy for a simulated maternally affected trait of beef cattle using different genomic evaluation models.利用不同基因组评估模型预测肉牛受母性影响的性状的准确性。

J Anim Sci. 2013 Sep;91(9):4090-8. doi: 10.2527/jas.2012-5826. Epub 2013 Jul 26.

Genomic prediction of simulated multibreed and purebred performance using observed fifty thousand single nucleotide polymorphism genotypes.利用观测到的五万个性状 SNP 基因型对模拟多品种和纯种表现进行基因组预测。

J Anim Sci. 2010 Feb;88(2):544-51. doi: 10.2527/jas.2009-2064. Epub 2009 Oct 9.

Using selection index theory to estimate consistency of multi-locus linkage disequilibrium across populations.利用选择指数理论估计多基因座连锁不平衡在不同群体间的一致性。

BMC Genet. 2015 Jul 19;16:87. doi: 10.1186/s12863-015-0252-6.

引用本文的文献

An effective hyper-parameter can increase the prediction accuracy in a single-step genetic evaluation.一个有效的超参数可以在单步遗传评估中提高预测准确性。

Front Genet. 2023 Jun 8;14:1104906. doi: 10.3389/fgene.2023.1104906. eCollection 2023.

Integration of beef cattle international pedigree and genomic estimated breeding values into national evaluations, with an application to the Italian Limousin population.将肉牛国际谱系和基因组估计育种值整合到国家评估中，并应用于意大利利木赞牛群体。

Genet Sel Evol. 2023 Jun 12;55(1):41. doi: 10.1186/s12711-023-00813-2.

Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates.高效的大规模单步评估和间接基因组预测的基因分型选择候选者。

Genet Sel Evol. 2023 Jun 8;55(1):37. doi: 10.1186/s12711-023-00808-z.

Validation with single-step SNPBLUP shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds.单步 SNPBLUP 的验证表明，即使有多个品种，也可以继续使用纯合个体的单一平均值进行评估。

Genet Sel Evol. 2023 Mar 22;55(1):19. doi: 10.1186/s12711-023-00787-1.

New insights into the genetic resistance to paratuberculosis in Holstein cattle via single-step genomic evaluation.通过一步法基因组评估揭示荷斯坦奶牛对副结核病的遗传抗性的新见解。

Genet Sel Evol. 2022 Oct 15;54(1):67. doi: 10.1186/s12711-022-00757-z.

International single-step SNPBLUP beef cattle evaluations for Limousin weaning weight.国际单步 SNPBLUP 肉牛评估利木赞断奶体重。

Genet Sel Evol. 2022 Sep 4;54(1):57. doi: 10.1186/s12711-022-00748-0.

Impact of genomic preselection on subsequent genetic evaluations with ssGBLUP using real data from pigs.基因组预选择对使用猪真实数据的 ssGBLUP 后续遗传评估的影响。

Genet Sel Evol. 2022 Jun 28;54(1):48. doi: 10.1186/s12711-022-00727-5.

Correcting for base-population differences and unknown parent groups in single-step genomic predictions of Norwegian Red cattle.校正挪威红牛单步基因组预测中基础群体差异和未知父群。

J Anim Sci. 2022 Sep 1;100(9). doi: 10.1093/jas/skac227.

Single-step genomic BLUP with genetic groups and automatic adjustment for allele coding.单步基因组最佳线性无偏预测，具有遗传群组和等位基因编码的自动调整。

Genet Sel Evol. 2022 Jun 2;54(1):38. doi: 10.1186/s12711-022-00721-x.

Application of Bayesian genomic prediction methods to genome-wide association analyses.贝叶斯基因组预测方法在全基因组关联分析中的应用。

Genet Sel Evol. 2022 May 13;54(1):31. doi: 10.1186/s12711-022-00724-8.

本文引用的文献

A fast and efficient Gibbs sampler for BayesB in whole-genome analyses.全基因组分析中用于BayesB的一种快速高效的吉布斯采样器。

Genet Sel Evol. 2015 Oct 14;47:80. doi: 10.1186/s12711-015-0157-x.

A class of Bayesian methods to combine large numbers of genotyped and non-genotyped animals for whole-genome analyses.一类用于全基因组分析的贝叶斯方法，可结合大量基因分型和未基因分型的动物。

Genet Sel Evol. 2014 Sep 22;46(1):50. doi: 10.1186/1297-9686-46-50.

Selection on selected records.对选定记录进行选择。

Genet Sel Evol (1983). 1983;15(1):91-8. doi: 10.1186/1297-9686-15-1-91.

Single-step methods for genomic evaluation in pigs.猪基因组评估的单步法。

Animal. 2012 Oct;6(10):1565-71. doi: 10.1017/S1751731112000742. Epub 2012 Apr 5.

Bias in genomic predictions for populations under selection.选择作用下群体基因组预测中的偏差。

Genet Res (Camb). 2011 Oct;93(5):357-66. doi: 10.1017/S001667231100022X. Epub 2011 Jul 18.

Allele coding in genomic evaluation.基因组评估中的等位基因编码

Genet Sel Evol. 2011 Jun 26;43(1):25. doi: 10.1186/1297-9686-43-25.

Hot topic: a unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score.热门话题：利用表型、全谱系和基因组信息统一方法对荷斯坦综合评分进行遗传评估。

J Dairy Sci. 2010 Feb;93(2):743-52. doi: 10.3168/jds.2009-2730.

Genomic prediction when some animals are not genotyped.当有些动物未进行基因型检测时的基因组预测。

Genet Sel Evol. 2010 Jan 27;42(1):2. doi: 10.1186/1297-9686-42-2.

A relationship matrix including full pedigree and genomic information.一个包含完整谱系和基因组信息的关系矩阵。

J Dairy Sci. 2009 Sep;92(9):4656-63. doi: 10.3168/jds.2009-2061.

Efficient methods to compute genomic predictions.计算基因组预测的有效方法。

J Dairy Sci. 2008 Nov;91(11):4414-23. doi: 10.3168/jds.2007-0980.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验