逆回归估计育种值和基因组回归分析的权重信息。

Deregressing estimated breeding values and weighting information for genomic regression analyses.

机构信息

Department of Animal Science, Iowa State University, Ames, IA 50011, USA.

出版信息

Genet Sel Evol. 2009 Dec 31;41(1):55. doi: 10.1186/1297-9686-41-55.

DOI:10.1186/1297-9686-41-55

PMID:20043827

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2817680/

Abstract

BACKGROUND

Genomic prediction of breeding values involves a so-called training analysis that predicts the influence of small genomic regions by regression of observed information on marker genotypes for a given population of individuals. Available observations may take the form of individual phenotypes, repeated observations, records on close family members such as progeny, estimated breeding values (EBV) or their deregressed counterparts from genetic evaluations. The literature indicates that researchers are inconsistent in their approach to using EBV or deregressed data, and as to using the appropriate methods for weighting some data sources to account for heterogeneous variance.

METHODS

A logical approach to using information for genomic prediction is introduced, which demonstrates the appropriate weights for analyzing observations with heterogeneous variance and explains the need for and the manner in which EBV should have parent average effects removed, be deregressed and weighted.

RESULTS

An appropriate deregression for genomic regression analyses is EBV/r2 where EBV excludes parent information and r2 is the reliability of that EBV. The appropriate weights for deregressed breeding values are neither the reliability nor the prediction error variance, two alternatives that have been used in published studies, but the ratio (1 - h2)/[(c + (1 - r2)/r2)h2] where c > 0 is the fraction of genetic variance not explained by markers.

CONCLUSIONS

Phenotypic information on some individuals and deregressed data on others can be combined in genomic analyses using appropriate weighting.

摘要

背景

基因组预测的选育值涉及所谓的训练分析，该分析通过回归观察到的信息对标记基因型进行预测，从而预测小的基因组区域对个体的给定群体的影响。可用的观测值可以是个体表型、重复观测值、近亲属（如后代）的记录、估计的育种值 (EBV) 或遗传评估中其去回归的对应值。文献表明，研究人员在使用 EBV 或去回归数据的方法上不一致，并且在使用适当的方法对一些数据源进行加权以考虑异方差方面也不一致。

方法

本文介绍了一种逻辑方法来使用信息进行基因组预测，该方法演示了分析具有异方差的观测值的适当权重，并解释了去除 EBV 中的亲本平均效应、去回归和加权的必要性和方法。

结果

基因组回归分析的适当去回归是 EBV/r2，其中 EBV 排除了亲本信息，r2 是 EBV 的可靠性。去回归选育值的适当权重既不是可靠性也不是预测误差方差，这是两个已在已发表的研究中使用的替代方案，而是（1-h2）/[(c + (1-r2)/r2)h2]，其中 c > 0 是标记无法解释的遗传方差的分数。

结论

使用适当的加权，可以在基因组分析中组合某些个体的表型信息和其他个体的去回归数据。

相似文献

Deregressing estimated breeding values and weighting information for genomic regression analyses.逆回归估计育种值和基因组回归分析的权重信息。

Genet Sel Evol. 2009 Dec 31;41(1):55. doi: 10.1186/1297-9686-41-55.

Validation of simultaneous deregression of cow and bull breeding values and derivation of appropriate weights.奶牛和公牛育种值同时去回归的验证及适当权重的推导

J Dairy Sci. 2016 Aug;99(8):6403-6419. doi: 10.3168/jds.2016-11028. Epub 2016 May 18.

Comparison of genomic predictions using genomic relationship matrices built with different weighting factors to account for locus-specific variances.使用基于不同加权因子构建的基因组关系矩阵来考虑位点特异性方差的基因组预测比较。

J Dairy Sci. 2014 Oct;97(10):6547-59. doi: 10.3168/jds.2014-8210. Epub 2014 Aug 14.

Deregressed EBV as the response variable yield more reliable genomic predictions than traditional EBV in pure-bred pigs.以去回归 EBV 作为反应变量得出的基因组预测结果比传统 EBV 在纯种猪中更可靠。

Genet Sel Evol. 2011 Nov 9;43(1):38. doi: 10.1186/1297-9686-43-38.

Genomic breeding value estimation using genetic markers, inferred ancestral haplotypes, and the genomic relationship matrix.使用遗传标记、推断的祖先单倍型和基因组关系矩阵进行基因组育种值估计。

J Dairy Sci. 2011 Sep;94(9):4708-14. doi: 10.3168/jds.2010-3905.

Application of genomic evaluation to dairy cattle in New Zealand.基因组评估在新西兰奶牛中的应用。

J Dairy Sci. 2015 Jan;98(1):659-75. doi: 10.3168/jds.2014-8560. Epub 2014 Oct 30.

Accuracy of genomic prediction using deregressed breeding values estimated from purebred and crossbred offspring phenotypes in pigs.利用纯种和杂种后代表型估计的去回归育种值进行猪基因组预测的准确性。

J Anim Sci. 2015 Jul;93(7):3313-21. doi: 10.2527/jas.2015-8899.

Genomic prediction ability for beef fatty acid profile in Nelore cattle using different pseudo-phenotypes.使用不同伪表型对内洛尔牛牛肉脂肪酸谱的基因组预测能力。

J Appl Genet. 2018 Nov;59(4):493-501. doi: 10.1007/s13353-018-0470-5. Epub 2018 Sep 24.

Reliability of pedigree-based and genomic evaluations in selected populations.选定群体中基于系谱和基因组评估的可靠性。

Genet Sel Evol. 2015 Aug 14;47(1):65. doi: 10.1186/s12711-015-0145-1.

Single-step genomic model improved reliability and reduced the bias of genomic predictions in Danish Jersey.单步基因组模型提高了丹麦泽西牛基因组预测的可靠性并减少了偏差。

J Dairy Sci. 2015 Dec;98(12):9026-34. doi: 10.3168/jds.2015-9703. Epub 2015 Nov 11.

引用本文的文献

The impact of deregressed foreign breeding values on national beef cattle single-step genomic evaluation.去回归国外育种值对全国肉牛单步基因组评估的影响。

Genet Sel Evol. 2025 Jul 14;57(1):37. doi: 10.1186/s12711-025-00982-2.

Genomic architecture of purebred and crossbred Moghani lambs with Texel and Booroola sheep.与特克塞尔羊和波欧拉羊杂交的纯种和杂种莫加尼羔羊的基因组结构

Sci Rep. 2025 Jul 2;15(1):22833. doi: 10.1038/s41598-025-06152-0.

Environmental data provide marginal benefit for predicting climate adaptation.环境数据在预测气候适应方面提供的益处有限。

PLoS Genet. 2025 Jun 9;21(6):e1011714. doi: 10.1371/journal.pgen.1011714. eCollection 2025 Jun.

Mammary gland multi-omics data reveals new genetic insights into milk production traits in dairy cattle.乳腺多组学数据揭示了奶牛产奶性状的新遗传见解。

PLoS Genet. 2025 Apr 17;21(4):e1011675. doi: 10.1371/journal.pgen.1011675. eCollection 2025 Apr.

Diversity of cereal rye (Secale cereale) germplasm in the Southeast United States.美国东南部谷类黑麦（Secale cereale）种质的多样性。

Plant Genome. 2025 Jun;18(2):e70008. doi: 10.1002/tpg2.70008.

Genetic Associations of Gene with Milk Yield and Composition Traits in Chinese Holstein Cows.中国荷斯坦奶牛中基因与产奶量及乳成分性状的遗传关联

Animals (Basel). 2025 Mar 26;15(7):953. doi: 10.3390/ani15070953.

Integrative genomic analysis reveals shared loci for reproduction and production traits in Yorkshire pigs.整合基因组分析揭示了大白猪繁殖和生产性状的共享基因座。

BMC Genomics. 2025 Mar 29;26(1):310. doi: 10.1186/s12864-025-11416-0.

Improving multi-trait genomic prediction by incorporating local genetic correlations.通过纳入局部遗传相关性来改进多性状基因组预测。

Commun Biol. 2025 Feb 25;8(1):307. doi: 10.1038/s42003-025-07721-9.

Optimizing fully-efficient two-stage models for genomic selection using open-source software.使用开源软件优化用于基因组选择的全效两阶段模型。

Plant Methods. 2025 Feb 4;21(1):9. doi: 10.1186/s13007-024-01318-9.

Genetic parameters and genome-wide association studies including the X chromosome for various reproduction and semen quality traits in Nellore cattle.内洛尔牛各种繁殖和精液品质性状的遗传参数及全基因组关联研究，包括X染色体。

BMC Genomics. 2025 Jan 10;26(1):26. doi: 10.1186/s12864-024-11193-2.

本文引用的文献

Predictive ability of direct genomic values for lifetime net merit of Holstein sires using selected subsets of single nucleotide polymorphism markers.使用单核苷酸多态性标记的选定子集对荷斯坦公牛终身净效益的直接基因组值的预测能力。

J Dairy Sci. 2009 Oct;92(10):5248-57. doi: 10.3168/jds.2009-2092.

Additive genetic variability and the Bayesian alphabet.加性遗传变异性和贝叶斯字母表。

Genetics. 2009 Sep;183(1):347-63. doi: 10.1534/genetics.109.103952. Epub 2009 Jul 20.

Technical note: Derivation of equivalent computing algorithms for genomic predictions and reliabilities of animal merit.技术说明：基因组预测等效计算算法的推导及动物遗传价值的可靠性

J Dairy Sci. 2009 Jun;92(6):2971-5. doi: 10.3168/jds.2008-1929.

Efficient methods to compute genomic predictions.计算基因组预测的有效方法。

J Dairy Sci. 2008 Nov;91(11):4414-23. doi: 10.3168/jds.2007-0980.

Accuracy of genomic selection using different methods to define haplotypes.使用不同方法定义单倍型时基因组选择的准确性。

Genetics. 2008 Jan;178(1):553-61. doi: 10.1534/genetics.107.080838.

Association analysis of adiponectin and somatostatin polymorphisms on BTA1 with growth and carcass traits in Angus cattle.安格斯牛中脂联素和BTA1上生长抑素基因多态性与生长及胴体性状的关联分析

Anim Genet. 2006 Dec;37(6):554-62. doi: 10.1111/j.1365-2052.2006.01528.x.

Mapping quantitative trait loci affecting dairy conformation to chromosome 27 in two Holstein grandsire families.在两个荷斯坦公牛家系中将影响奶牛体型的数量性状基因座定位到27号染色体上。

J Dairy Sci. 2004 Feb;87(2):450-7. doi: 10.3168/jds.S0022-0302(04)73184-7.

Interval and composite interval mapping of somatic cell score, yield, and components of milk in dairy cattle.奶牛体细胞评分、产奶量及乳成分的区间和复合区间定位

J Dairy Sci. 2002 Nov;85(11):3081-91. doi: 10.3168/jds.S0022-0302(02)74395-6.

Prediction of identity by descent probabilities from marker-haplotypes.基于标记单倍型的同源概率进行身份预测。

Genet Sel Evol. 2001 Nov-Dec;33(6):605-34. doi: 10.1186/1297-9686-33-6-605.

Prediction of total genetic value using genome-wide dense marker maps.利用全基因组密集标记图谱预测总遗传值。

Genetics. 2001 Apr;157(4):1819-29. doi: 10.1093/genetics/157.4.1819.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。