基因组评估中的等位基因编码

Allele coding in genomic evaluation.

机构信息

Biotechnology and Food Research, MTT Agrifood Research Finland, FI-31600 Jokioinen, Finland.

出版信息

Genet Sel Evol. 2011 Jun 26;43(1):25. doi: 10.1186/1297-9686-43-25.

DOI:10.1186/1297-9686-43-25

PMID:21703021

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3154140/

Abstract

BACKGROUND

Genomic data are used in animal breeding to assist genetic evaluation. Several models to estimate genomic breeding values have been studied. In general, two approaches have been used. One approach estimates the marker effects first and then, genomic breeding values are obtained by summing marker effects. In the second approach, genomic breeding values are estimated directly using an equivalent model with a genomic relationship matrix. Allele coding is the method chosen to assign values to the regression coefficients in the statistical model. A common allele coding is zero for the homozygous genotype of the first allele, one for the heterozygote, and two for the homozygous genotype for the other allele. Another common allele coding changes these regression coefficients by subtracting a value from each marker such that the mean of regression coefficients is zero within each marker. We call this centered allele coding. This study considered effects of different allele coding methods on inference. Both marker-based and equivalent models were considered, and restricted maximum likelihood and Bayesian methods were used in inference.

RESULTS

Theoretical derivations showed that parameter estimates and estimated marker effects in marker-based models are the same irrespective of the allele coding, provided that the model has a fixed general mean. For the equivalent models, the same results hold, even though different allele coding methods lead to different genomic relationship matrices. Calculated genomic breeding values are independent of allele coding when the estimate of the general mean is included into the values. Reliabilities of estimated genomic breeding values calculated using elements of the inverse of the coefficient matrix depend on the allele coding because different allele coding methods imply different models. Finally, allele coding affects the mixing of Markov chain Monte Carlo algorithms, with the centered coding being the best.

CONCLUSIONS

Different allele coding methods lead to the same inference in the marker-based and equivalent models when a fixed general mean is included in the model. However, reliabilities of genomic breeding values are affected by the allele coding method used. The centered coding has some numerical advantages when Markov chain Monte Carlo methods are used.

摘要

背景

基因组数据用于动物育种以辅助遗传评估。已经研究了几种估计基因组育种值的模型。一般来说，使用了两种方法。一种方法是首先估计标记效应，然后通过对标记效应求和获得基因组育种值。在第二种方法中，使用具有基因组关系矩阵的等效模型直接估计基因组育种值。等位基因编码是为统计模型中的回归系数赋值所选择的方法。一种常见的等位基因编码是，第一个等位基因的纯合基因型为零，杂合子为一，另一个等位基因的纯合基因型为二。另一种常见的等位基因编码是通过从每个标记中减去一个值来改变这些回归系数，使得每个标记内回归系数的均值为零。我们称这种为中心化等位基因编码。本研究考虑了不同等位基因编码方法对推断的影响。同时考虑了基于标记的模型和等效模型，并在推断中使用了限制最大似然法和贝叶斯方法。

结果

理论推导表明，只要模型具有固定的总体均值，基于标记的模型中的参数估计和估计的标记效应与等位基因编码无关。对于等效模型，即使不同的等位基因编码方法导致不同的基因组关系矩阵，结果也是相同的。当将总体均值的估计值包含在基因组育种值中时，计算出的基因组育种值与等位基因编码无关。使用系数矩阵逆矩阵的元素计算的估计基因组育种值的可靠性取决于等位基因编码，因为不同的等位基因编码方法意味着不同的模型。最后，等位基因编码会影响马尔可夫链蒙特卡罗算法的混合，其中中心化编码是最好的。

结论

当模型中包含固定的总体均值时，不同的等位基因编码方法在基于标记的模型和等效模型中会导致相同的推断。然而，基因组育种值的可靠性受所用等位基因编码方法的影响。当使用马尔可夫链蒙特卡罗方法时，中心化编码具有一些数值优势。

相似文献

Allele coding in genomic evaluation.

Genet Sel Evol. 2011 Jun 26;43(1):25. doi: 10.1186/1297-9686-43-25.

Variable selection models for genomic selection using whole-genome sequence data and singular value decomposition.

Genet Sel Evol. 2017 Dec 27;49(1):94. doi: 10.1186/s12711-017-0369-3.

Comparison of genomic predictions using genomic relationship matrices built with different weighting factors to account for locus-specific variances.

J Dairy Sci. 2014 Oct;97(10):6547-59. doi: 10.3168/jds.2014-8210. Epub 2014 Aug 14.

Using the unified relationship matrix adjusted by breed-wise allele frequencies in genomic evaluation of a multibreed population.

J Dairy Sci. 2014 Feb;97(2):1117-27. doi: 10.3168/jds.2013-7167. Epub 2013 Dec 15.

Marker genotyping error effects on genomic predictions under different genetic architectures.

Mol Genet Genomics. 2021 Jan;296(1):79-89. doi: 10.1007/s00438-020-01728-z. Epub 2020 Sep 29.

Genomic Model with Correlation Between Additive and Dominance Effects.

Genetics. 2018 Jul;209(3):711-723. doi: 10.1534/genetics.118.301015. Epub 2018 May 9.

Technical note: Automatic scaling in single-step genomic BLUP.

J Dairy Sci. 2021 Feb;104(2):2027-2031. doi: 10.3168/jds.2020-18969. Epub 2020 Dec 11.

Single-step genomic BLUP with genetic groups and automatic adjustment for allele coding.

Genet Sel Evol. 2022 Jun 2;54(1):38. doi: 10.1186/s12711-022-00721-x.

Accuracy of genomic breeding values for meat tenderness in Polled Nellore cattle.

J Anim Sci. 2016 Jul;94(7):2752-60. doi: 10.2527/jas.2016-0279.

Weighting genomic and genealogical information for genetic parameter estimation and breeding value prediction in tropical beef cattle.

J Anim Sci. 2018 Mar 6;96(2):612-617. doi: 10.1093/jas/skx027.

引用本文的文献

Integrative multi-environmental genomic prediction in apple.

Hortic Res. 2024 Nov 20;12(2):uhae319. doi: 10.1093/hr/uhae319. eCollection 2025 Feb.

Discovering non-additive heritability using additive GWAS summary statistics.

Elife. 2024 Jun 24;13:e90459. doi: 10.7554/eLife.90459.

Complex traits and candidate genes: estimation of genetic variance components across multiple genetic architectures.

G3 (Bethesda). 2023 Aug 30;13(9). doi: 10.1093/g3journal/jkad148.

Reliabilities of estimated breeding values in models with metafounders.

Genet Sel Evol. 2023 Jan 23;55(1):6. doi: 10.1186/s12711-023-00778-2.

Theoretical accuracy for indirect predictions based on SNP effects from single-step GBLUP.

Genet Sel Evol. 2022 Sep 27;54(1):66. doi: 10.1186/s12711-022-00752-4.

Single-step genomic BLUP with genetic groups and automatic adjustment for allele coding.

Genet Sel Evol. 2022 Jun 2;54(1):38. doi: 10.1186/s12711-022-00721-x.

MetaGS: an accurate method to impute and combine SNP effects across populations using summary statistics.

Genet Sel Evol. 2022 Jun 2;54(1):37. doi: 10.1186/s12711-022-00725-7.

The long-term effects of genomic selection: 1. Response to selection, additive genetic variance, and genetic architecture.

Genet Sel Evol. 2022 Mar 7;54(1):19. doi: 10.1186/s12711-022-00709-7.

Phantom Epistasis in Genomic Selection: On the Predictive Ability of Epistatic Models.

G3 (Bethesda). 2020 Sep 2;10(9):3137-3145. doi: 10.1534/g3.120.401300.

A Review of Genomic Models for the Analysis of Livestock Crossbred Data.

Front Genet. 2020 Jun 26;11:568. doi: 10.3389/fgene.2020.00568. eCollection 2020.

本文引用的文献

Reconciling the analysis of IBD and IBS in complex trait studies.

Nat Rev Genet. 2010 Nov;11(11):800-5. doi: 10.1038/nrg2865. Epub 2010 Sep 28.

Hot topic: a unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score.

J Dairy Sci. 2010 Feb;93(2):743-52. doi: 10.3168/jds.2009-2730.

Genomic prediction when some animals are not genotyped.

Genet Sel Evol. 2010 Jan 27;42(1):2. doi: 10.1186/1297-9686-42-2.

A relationship matrix including full pedigree and genomic information.

J Dairy Sci. 2009 Sep;92(9):4656-63. doi: 10.3168/jds.2009-2061.

Technical note: Derivation of equivalent computing algorithms for genomic predictions and reliabilities of animal merit.

J Dairy Sci. 2009 Jun;92(6):2971-5. doi: 10.3168/jds.2008-1929.

Comparison of analyses of the QTLMAS XII common dataset. II: genome-wide association and fine mapping.

BMC Proc. 2009 Feb 23;3 Suppl 1(Suppl 1):S2. doi: 10.1186/1753-6561-3-s1-s2.

Increased accuracy of artificial selection by using the realized relationship matrix.

Genet Res (Camb). 2009 Feb;91(1):47-60. doi: 10.1017/S0016672308009981.

Efficient methods to compute genomic predictions.

J Dairy Sci. 2008 Nov;91(11):4414-23. doi: 10.3168/jds.2007-0980.

Genomic selection: prediction of accuracy and maximisation of long term response.

Genetica. 2009 Jun;136(2):245-57. doi: 10.1007/s10709-008-9308-0. Epub 2008 Aug 14.

Prediction of total genetic value using genome-wide dense marker maps.

Genetics. 2001 Apr;157(4):1819-29. doi: 10.1093/genetics/157.4.1819.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基因组评估中的等位基因编码

Allele coding in genomic evaluation.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献