比较五种方法从全基因组 SNP 标记预测奶牛公牛的基因组育种值。

A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers.

机构信息

The CRC for Innovative Dairy Products, Australia.

出版信息

Genet Sel Evol. 2009 Dec 31;41(1):56. doi: 10.1186/1297-9686-41-56.

DOI:10.1186/1297-9686-41-56

PMID:20043835

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2814805/

Abstract

BACKGROUND

Genomic selection (GS) uses molecular breeding values (MBV) derived from dense markers across the entire genome for selection of young animals. The accuracy of MBV prediction is important for a successful application of GS. Recently, several methods have been proposed to estimate MBV. Initial simulation studies have shown that these methods can accurately predict MBV. In this study we compared the accuracies and possible bias of five different regression methods in an empirical application in dairy cattle.

METHODS

Genotypes of 7,372 SNP and highly accurate EBV of 1,945 dairy bulls were used to predict MBV for protein percentage (PPT) and a profit index (Australian Selection Index, ASI). Marker effects were estimated by least squares regression (FR-LS), Bayesian regression (Bayes-R), random regression best linear unbiased prediction (RR-BLUP), partial least squares regression (PLSR) and nonparametric support vector regression (SVR) in a training set of 1,239 bulls. Accuracy and bias of MBV prediction were calculated from cross-validation of the training set and tested against a test team of 706 young bulls.

RESULTS

For both traits, FR-LS using a subset of SNP was significantly less accurate than all other methods which used all SNP. Accuracies obtained by Bayes-R, RR-BLUP, PLSR and SVR were very similar for ASI (0.39-0.45) and for PPT (0.55-0.61). Overall, SVR gave the highest accuracy.All methods resulted in biased MBV predictions for ASI, for PPT only RR-BLUP and SVR predictions were unbiased. A significant decrease in accuracy of prediction of ASI was seen in young test cohorts of bulls compared to the accuracy derived from cross-validation of the training set. This reduction was not apparent for PPT. Combining MBV predictions with pedigree based predictions gave 1.05 - 1.34 times higher accuracies compared to predictions based on pedigree alone. Some methods have largely different computational requirements, with PLSR and RR-BLUP requiring the least computing time.

CONCLUSIONS

The four methods which use information from all SNP namely RR-BLUP, Bayes-R, PLSR and SVR generate similar accuracies of MBV prediction for genomic selection, and their use in the selection of immediate future generations in dairy cattle will be comparable. The use of FR-LS in genomic selection is not recommended.

摘要

背景

基因组选择（GS）使用整个基因组中密集标记的分子育种值（MBV）对幼畜进行选择。MBV 预测的准确性对于 GS 的成功应用很重要。最近，已经提出了几种估计 MBV 的方法。初步模拟研究表明，这些方法可以准确地预测 MBV。在本研究中，我们在奶牛的实证应用中比较了五种不同回归方法的准确性和可能的偏差。

方法

使用 7372 个 SNP 的基因型和 1945 头公牛的高度准确 EBV，预测蛋白质百分比（PPT）和利润指数（澳大利亚选择指数，ASI）的 MBV。在 1239 头公牛的训练集中，通过最小二乘回归（FR-LS）、贝叶斯回归（Bayes-R）、随机回归最佳线性无偏预测（RR-BLUP）、偏最小二乘回归（PLSR）和非参数支持向量回归（SVR）估计标记效应。从训练集的交叉验证中计算 MBV 预测的准确性和偏差，并在 706 头年轻公牛的测试组中进行测试。

结果

对于两种性状，使用 SNP 子集的 FR-LS 明显不如使用所有 SNP 的所有其他方法准确。对于 ASI（0.39-0.45）和 PPT（0.55-0.61），Bayes-R、RR-BLUP、PLSR 和 SVR 获得的准确性非常相似。总体而言，SVR 的准确性最高。所有方法对 ASI 的 MBV 预测均存在偏差，而仅 RR-BLUP 和 SVR 预测对 PPT 无偏差。与从训练集交叉验证中得出的准确性相比，年轻公牛测试组中 ASI 的预测准确性显著降低。对于 PPT，这种降低并不明显。与仅基于系谱的预测相比，结合 MBV 预测和系谱预测可将准确性提高 1.05-1.34 倍。某些方法具有很大不同的计算要求，其中 PLSR 和 RR-BLUP 需要最少的计算时间。

结论

使用所有 SNP 信息的四种方法，即 RR-BLUP、Bayes-R、PLSR 和 SVR，对基因组选择的 MBV 预测具有相似的准确性，并且它们在奶牛的下一代选择中的使用将是可比的。不建议在基因组选择中使用 FR-LS。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f560/2814805/c394a89fc3b1/1297-9686-41-56-1.jpg

相似文献

A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers.比较五种方法从全基因组 SNP 标记预测奶牛公牛的基因组育种值。

Genet Sel Evol. 2009 Dec 31;41(1):56. doi: 10.1186/1297-9686-41-56.

Comparison of methods for the implementation of genome-assisted evaluation of Spanish dairy cattle.比较基因组辅助评估西班牙奶牛的方法。

J Dairy Sci. 2013 Jan;96(1):625-34. doi: 10.3168/jds.2012-5631. Epub 2012 Oct 24.

Accuracy of direct genomic values in Holstein bulls and cows using subsets of SNP markers.使用 SNP 标记子集评估荷斯坦公牛和母牛的直接基因组估计值的准确性。

Genet Sel Evol. 2010 Oct 16;42(1):37. doi: 10.1186/1297-9686-42-37.

Genomic prediction of breeding values using previously estimated SNP variances.利用先前估计的单核苷酸多态性（SNP）方差进行育种值的基因组预测。

Genet Sel Evol. 2014 Sep 25;46(1):52. doi: 10.1186/s12711-014-0052-x.

Development of genomic predictions for Angus cattle in Brazil incorporating genotypes from related American sires.发展巴西安格斯牛的基因组预测，纳入相关美国父本的基因型。

J Anim Sci. 2022 Feb 1;100(2). doi: 10.1093/jas/skac009.

The accuracy of Genomic Selection in Norwegian red cattle assessed by cross-validation.基因组选择在挪威红牛中的准确性通过交叉验证评估。

Genetics. 2009 Nov;183(3):1119-26. doi: 10.1534/genetics.109.107391. Epub 2009 Aug 24.

Accuracy of genomic predictions in Bos indicus (Nellore) cattle.印度野牛（内洛尔牛）基因组预测的准确性。

Genet Sel Evol. 2014 Feb 27;46(1):17. doi: 10.1186/1297-9686-46-17.

Genomic prediction of milk-production traits and somatic cell score using single-step genomic best linear unbiased predictor with random regression test-day model in Thai dairy cattle.利用泰国奶牛随机回归测试日模型的一步基因组最佳线性无偏预测法对产奶性状和体细胞评分进行基因组预测。

J Dairy Sci. 2021 Dec;104(12):12713-12723. doi: 10.3168/jds.2021-20263. Epub 2021 Sep 16.

Application of Bayesian least absolute shrinkage and selection operator (LASSO) and BayesCπ methods for genomic selection in French Holstein and Montbéliarde breeds.贝叶斯最小绝对收缩和选择算子（LASSO）和 BayesCπ 方法在法国荷斯坦和蒙贝利亚德品种基因组选择中的应用。

J Dairy Sci. 2013 Jan;96(1):575-91. doi: 10.3168/jds.2011-5225. Epub 2012 Nov 3.

Controlling bias in genomic breeding values for young genotyped bulls.控制年轻基因型公牛基因组育种值中的偏差。

J Dairy Sci. 2019 Nov;102(11):9956-9970. doi: 10.3168/jds.2019-16789. Epub 2019 Sep 5.

引用本文的文献

Genomic prediction with NetGP based on gene network and multi-omics data in plants.基于植物基因网络和多组学数据的NetGP基因组预测

Plant Biotechnol J. 2025 Apr;23(4):1190-1201. doi: 10.1111/pbi.14577. Epub 2025 Feb 14.

Exploiting historical agronomic data to develop genomic prediction strategies for early clonal selection in the Louisiana sugarcane variety development program.利用历史农艺数据为路易斯安那甘蔗品种开发计划中的早期克隆选择制定基因组预测策略。

Plant Genome. 2025 Mar;18(1):e20545. doi: 10.1002/tpg2.20545.

An investigation of machine learning methods applied to genomic prediction in yellow-feathered broilers.应用于黄羽肉鸡基因组预测的机器学习方法研究。

Poult Sci. 2025 Jan;104(1):104489. doi: 10.1016/j.psj.2024.104489. Epub 2024 Nov 1.

Hybrid Prediction in Horticulture Crop Breeding: Progress and Challenges.园艺作物育种中的杂交预测：进展与挑战

Plants (Basel). 2024 Oct 4;13(19):2790. doi: 10.3390/plants13192790.

Supervised Machine Learning Techniques for Breeding Value Prediction in Horses: An Example Using Gait Visual Scores.用于马匹育种值预测的监督式机器学习技术：以步态视觉评分为例

Animals (Basel). 2024 Sep 20;14(18):2723. doi: 10.3390/ani14182723.

Combining phenotypic and genomic data to improve prediction of binary traits.结合表型和基因组数据以改善二元性状的预测。

J Appl Stat. 2023 May 16;51(8):1497-1523. doi: 10.1080/02664763.2023.2208773. eCollection 2024.

Oracle selection provides insight into how far off practice is from Utopia in plant breeding.选择合适的品种有助于洞察植物育种实践与理想状态之间的差距。

Front Plant Sci. 2023 Jul 21;14:1218665. doi: 10.3389/fpls.2023.1218665. eCollection 2023.

Association of Phenotypic Markers of Heat Tolerance with Australian Genomic Estimated Breeding Values and Dairy Cattle Selection Indices.耐热性表型标记与澳大利亚基因组估计育种值及奶牛选择指数的关联

Animals (Basel). 2023 Jul 10;13(14):2259. doi: 10.3390/ani13142259.

Dimensionality of genomic information and its impact on genome-wide associations and variant selection for genomic prediction: a simulation study.基因组信息的维度及其对全基因组关联和基因组预测中变异选择的影响：一项模拟研究。

Genet Sel Evol. 2023 Jul 17;55(1):49. doi: 10.1186/s12711-023-00823-0.

(Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers.基于启发式超参数优化的（准）多任务支持向量回归在复杂性状全基因组预测中的应用：以肉鸡胴体性状为例的研究

G3 (Bethesda). 2023 Aug 9;13(8). doi: 10.1093/g3journal/jkad109.

本文引用的文献

Distribution and location of genetic effects for dairy traits.奶牛性状遗传效应的分布与定位

J Dairy Sci. 2009 Jun;92(6):2931-46. doi: 10.3168/jds.2008-1762.

Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds.全基因组单核苷酸多态性变异调查揭示了牛品种的遗传结构。

Science. 2009 Apr 24;324(5926):528-32. doi: 10.1126/science.1167936.

Factors affecting accuracy from genomic selection in populations derived from multiple inbred lines: a Barley case study.影响多个自交系衍生群体基因组选择准确性的因素：大麦案例研究

Genetics. 2009 May;182(1):355-64. doi: 10.1534/genetics.108.098277. Epub 2009 Mar 18.

Genomic selection using low-density marker panels.使用低密度标记面板的基因组选择。

Genetics. 2009 May;182(1):343-53. doi: 10.1534/genetics.108.100289. Epub 2009 Mar 18.

Reducing dimensionality for prediction of genome-wide breeding values.降低维度以预测全基因组育种值。

Genet Sel Evol. 2009 Mar 18;41(1):29. doi: 10.1186/1297-9686-41-29.

Predicting quantitative traits with regression models for dense molecular markers and pedigree.使用针对密集分子标记和系谱的回归模型预测数量性状。

Genetics. 2009 May;182(1):375-85. doi: 10.1534/genetics.109.101501. Epub 2009 Mar 16.

Genomic breeding value estimation using nonparametric additive regression models.使用非参数加性回归模型进行基因组育种值估计。

Genet Sel Evol. 2009 Jan 27;41(1):20. doi: 10.1186/1297-9686-41-20.

Genome-assisted prediction of a quantitative trait measured in parents and progeny: application to food conversion rate in chickens.利用基因组辅助预测亲本和后代中测量的数量性状：在鸡的饲料转化率中的应用。

Genet Sel Evol. 2009 Jan 5;41(1):3. doi: 10.1186/1297-9686-41-3.

Reproducing kernel Hilbert spaces regression: a general framework for genetic evaluation.再生核希尔伯特空间回归：遗传评估的通用框架

J Anim Sci. 2009 Jun;87(6):1883-7. doi: 10.2527/jas.2008-1259. Epub 2009 Feb 11.

Invited review: Genomic selection in dairy cattle: progress and challenges.特邀综述：奶牛的基因组选择：进展与挑战

J Dairy Sci. 2009 Feb;92(2):433-43. doi: 10.3168/jds.2008-1646.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

比较五种方法从全基因组 SNP 标记预测奶牛公牛的基因组育种值。

A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献