基于机器学习和参数方法对内罗尔牛饲料效率相关性状的基因组预测进行基准测试。

Benchmarking machine learning and parametric methods for genomic prediction of feed efficiency-related traits in Nellore cattle.

机构信息

School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP), Jaboticabal, SP, 14884-900, Brazil.

Department of Animal and Dairy Sciences, University of Wisconsin, Madison, WI, 53706, USA.

出版信息

Sci Rep. 2024 Mar 17;14(1):6404. doi: 10.1038/s41598-024-57234-4.

DOI:10.1038/s41598-024-57234-4

PMID:38493207

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10944497/

Abstract

Genomic selection (GS) offers a promising opportunity for selecting more efficient animals to use consumed energy for maintenance and growth functions, impacting profitability and environmental sustainability. Here, we compared the prediction accuracy of multi-layer neural network (MLNN) and support vector regression (SVR) against single-trait (STGBLUP), multi-trait genomic best linear unbiased prediction (MTGBLUP), and Bayesian regression (BayesA, BayesB, BayesC, BRR, and BLasso) for feed efficiency (FE) traits. FE-related traits were measured in 1156 Nellore cattle from an experimental breeding program genotyped for ~ 300 K markers after quality control. Prediction accuracy (Acc) was evaluated using a forward validation splitting the dataset based on birth year, considering the phenotypes adjusted for the fixed effects and covariates as pseudo-phenotypes. The MLNN and SVR approaches were trained by randomly splitting the training population into fivefold to select the best hyperparameters. The results show that the machine learning methods (MLNN and SVR) and MTGBLUP outperformed STGBLUP and the Bayesian regression approaches, increasing the Acc by approximately 8.9%, 14.6%, and 13.7% using MLNN, SVR, and MTGBLUP, respectively. Acc for SVR and MTGBLUP were slightly different, ranging from 0.62 to 0.69 and 0.62 to 0.68, respectively, with empirically unbiased for both models (0.97 and 1.09). Our results indicated that SVR and MTGBLUBP approaches were more accurate in predicting FE-related traits than Bayesian regression and STGBLUP and seemed competitive for GS of complex phenotypes with various degrees of inheritance.

摘要

基因组选择 (GS) 为选择更有效的动物提供了一个有前途的机会，使它们能够将消耗的能量用于维持和生长功能，从而影响盈利能力和环境可持续性。在这里，我们比较了多层神经网络 (MLNN) 和支持向量回归 (SVR) 与单一性状 (STGBLUP)、多性状基因组最佳线性无偏预测 (MTGBLUP) 和贝叶斯回归 (BayesA、BayesB、BayesC、BRR 和 BLasso) 在饲料效率 (FE) 性状上的预测准确性。FE 相关性状在经过质量控制后，对 1156 头Nellore 牛进行了实验性育种计划的测量，这些牛被用于约 300 K 个标记的基因型。使用向前验证根据出生年份分割数据集来评估预测准确性 (Acc)，考虑到固定效应和协变量调整后的表型作为伪表型。通过随机将训练人群分为五折来训练 MLNN 和 SVR 方法，以选择最佳超参数。结果表明，机器学习方法 (MLNN 和 SVR) 和 MTGBLUP 优于 STGBLUP 和贝叶斯回归方法，使用 MLNN、SVR 和 MTGBLUP 分别将 Acc 提高了约 8.9%、14.6%和 13.7%。SVR 和 MTGBLUP 的 Acc 略有不同，范围分别为 0.62 到 0.69 和 0.62 到 0.68，两个模型的经验无偏性都为 0.97 和 1.09。我们的结果表明，SVR 和 MTGBLUBP 方法在预测 FE 相关性状方面比贝叶斯回归和 STGBLUP 更准确，并且在具有各种遗传程度的复杂表型的 GS 方面似乎具有竞争力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49dc/10944497/d063d54d3f24/41598_2024_57234_Fig1_HTML.jpg

相似文献

Benchmarking machine learning and parametric methods for genomic prediction of feed efficiency-related traits in Nellore cattle.基于机器学习和参数方法对内罗尔牛饲料效率相关性状的基因组预测进行基准测试。

Sci Rep. 2024 Mar 17;14(1):6404. doi: 10.1038/s41598-024-57234-4.

Genome-enabled prediction of reproductive traits in Nellore cattle using parametric models and machine learning methods.利用参数模型和机器学习方法对Nellore 牛的繁殖性状进行基因组预测。

Anim Genet. 2021 Feb;52(1):32-46. doi: 10.1111/age.13021. Epub 2020 Nov 16.

Genomic prediction ability for feed efficiency traits using different models and pseudo-phenotypes under several validation strategies in Nelore cattle.应用不同模型和拟表型在几种验证策略下对尼洛拉牛饲料效率性状进行基因组预测能力。

Animal. 2021 Feb;15(2):100085. doi: 10.1016/j.animal.2020.100085. Epub 2020 Dec 24.

(Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers.基于启发式超参数优化的（准）多任务支持向量回归在复杂性状全基因组预测中的应用：以肉鸡胴体性状为例的研究

G3 (Bethesda). 2023 Aug 9;13(8). doi: 10.1093/g3journal/jkad109.

Accuracy of genomic predictions in Bos indicus (Nellore) cattle.印度野牛（内洛尔牛）基因组预测的准确性。

Genet Sel Evol. 2014 Feb 27;46(1):17. doi: 10.1186/1297-9686-46-17.

Genomic prediction of breeding values for carcass traits in Nellore cattle.内洛尔牛胴体性状育种值的基因组预测

Genet Sel Evol. 2016 Jan 29;48:7. doi: 10.1186/s12711-016-0188-y.

Genome-enabled prediction of meat and carcass traits using Bayesian regression, single-step genomic best linear unbiased prediction and blending methods in Nelore cattle.利用贝叶斯回归、单步基因组最佳线性无偏预测和混合方法，基于基因组预测牛肉和胴体性状在尼洛拉牛中的应用。

Animal. 2021 Jan;15(1):100006. doi: 10.1016/j.animal.2020.100006. Epub 2020 Dec 10.

Genomic prediction of blood biomarkers of metabolic disorders in Holstein cattle using parametric and nonparametric models.利用参数和非参数模型对荷斯坦奶牛代谢紊乱血液生物标志物进行基因组预测。

Genet Sel Evol. 2024 Apr 29;56(1):31. doi: 10.1186/s12711-024-00903-9.

A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers.比较五种方法从全基因组 SNP 标记预测奶牛公牛的基因组育种值。

Genet Sel Evol. 2009 Dec 31;41(1):56. doi: 10.1186/1297-9686-41-56.

Accuracy of predicting genomic breeding values for residual feed intake in Angus and Charolais beef cattle.预测 Angus 和夏洛莱肉牛剩余采食量的基因组育种值的准确性。

J Anim Sci. 2013 Oct;91(10):4669-78. doi: 10.2527/jas.2013-5715.

引用本文的文献

Variable selection strategies for genomic prediction of growth and carcass related traits in experimental Nellore cattle herds under different selection criteria.不同选择标准下实验内洛尔牛群生长和胴体相关性状基因组预测的变量选择策略

Sci Rep. 2025 Jul 1;15(1):22266. doi: 10.1038/s41598-025-06949-z.

Unraveling genomic regions with transmission ratio distortion harboring putative lethal alleles and their biological implications in Nellore cattle from experimental selection lines.解析具有传递率失真的基因组区域，这些区域含有假定的致死等位基因及其在实验选择系内的内罗牛中的生物学意义。

J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf208.

Advances in multi-trait genomic prediction approaches: classification, comparative analysis, and perspectives.多性状基因组预测方法的进展：分类、比较分析及展望

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf211.

Relationship between feed efficiency and reproductive traits in precocious Nelore heifers.早熟内洛尔小母牛的饲料效率与繁殖性状之间的关系。

Trop Anim Health Prod. 2025 Mar 1;57(2):88. doi: 10.1007/s11250-025-04342-6.

Effect of genomic regions harboring putative lethal haplotypes on reproductive performance in closed experimental selection lines of Nellore cattle.携带假定致死单倍型的基因组区域对内洛尔牛封闭实验选择系繁殖性能的影响。

Sci Rep. 2025 Feb 3;15(1):4113. doi: 10.1038/s41598-025-88501-7.

Genome-wide association studies and functional annotation of pre-weaning calf mortality and reproductive traits in Nellore cattle from experimental selection lines.基于实验选择系的内洛尔牛断奶前犊牛死亡率和繁殖性状的全基因组关联研究及功能注释

BMC Genomics. 2024 Dec 18;25(1):1196. doi: 10.1186/s12864-024-11113-4.

Integrating Bioinformatics and Machine Learning for Genomic Prediction in Chickens.将生物信息学和机器学习整合用于鸡的基因组预测。

Genes (Basel). 2024 May 26;15(6):690. doi: 10.3390/genes15060690.

Genet Sel Evol. 2024 Apr 29;56(1):31. doi: 10.1186/s12711-024-00903-9.

本文引用的文献

Integrating on-farm and genomic information improves the predictive ability of milk infrared prediction of blood indicators of metabolic disorders in dairy cows.将农场数据与基因组信息整合，可提高牛奶近红外预测奶牛代谢紊乱血液指标的预测能力。

Genet Sel Evol. 2023 Apr 3;55(1):23. doi: 10.1186/s12711-023-00795-1.

Meta-analysis across Nellore cattle populations identifies common metabolic mechanisms that regulate feed efficiency-related traits.对 Nellore 牛群体的荟萃分析确定了调节与饲料效率相关性状的常见代谢机制。

BMC Genomics. 2022 Jun 7;23(1):424. doi: 10.1186/s12864-022-08671-w.

Evaluating the performance of machine learning methods and variable selection methods for predicting difficult-to-measure traits in Holstein dairy cattle using milk infrared spectral data.利用牛奶近红外光谱数据评估机器学习方法和变量选择方法在荷斯坦奶牛中预测难以测量性状的性能。

J Dairy Sci. 2021 Jul;104(7):8107-8121. doi: 10.3168/jds.2020-19861. Epub 2021 Apr 15.

A guide for kernel generalized regression methods for genomic-enabled prediction.基因组预测的核广义回归方法指南。

Heredity (Edinb). 2021 Apr;126(4):577-596. doi: 10.1038/s41437-021-00412-1. Epub 2021 Mar 1.

Efficient weighting methods for genomic best linear-unbiased prediction (BLUP) adapted to the genetic architectures of quantitative traits.高效的基因组最佳线性无偏预测（BLUP）加权方法，适用于数量性状的遗传结构。

Heredity (Edinb). 2021 Feb;126(2):320-334. doi: 10.1038/s41437-020-00372-y. Epub 2020 Sep 26.

Multitrait genomic prediction of methane emissions in Danish Holstein cattle.丹麦荷斯坦奶牛甲烷排放的多性状基因组预测。

J Dairy Sci. 2020 Oct;103(10):9195-9206. doi: 10.3168/jds.2019-17857. Epub 2020 Jul 31.

Weighted single-step genome-wide association study and pathway analyses for feed efficiency traits in Nellore cattle.加权单步全基因组关联研究及Nellore 牛饲料效率性状的途径分析。

J Anim Breed Genet. 2021 Jan;138(1):23-44. doi: 10.1111/jbg.12496. Epub 2020 Jul 12.

Deep learning models in genomics; are we there yet?基因组学中的深度学习模型；我们做到了吗？

Comput Struct Biotechnol J. 2020 Jun 17;18:1466-1473. doi: 10.1016/j.csbj.2020.06.017. eCollection 2020.

Bayesian and Machine Learning Models for Genomic Prediction of Anterior Cruciate Ligament Rupture in the Canine Model.贝叶斯和机器学习模型在犬科前交叉韧带撕裂的基因组预测中的应用。

G3 (Bethesda). 2020 Aug 5;10(8):2619-2628. doi: 10.1534/g3.120.401244.

Opening the Black Box: Interpretable Machine Learning for Geneticists.打开黑箱：遗传学家的可解释机器学习。

Trends Genet. 2020 Jun;36(6):442-455. doi: 10.1016/j.tig.2020.03.005. Epub 2020 Apr 17.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于机器学习和参数方法对内罗尔牛饲料效率相关性状的基因组预测进行基准测试。

Benchmarking machine learning and parametric methods for genomic prediction of feed efficiency-related traits in Nellore cattle.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献