School of Agricultural and Veterinarian Sciences, São Paulo State University (UNESP), Jaboticabal, SP, 14884-900, Brazil.
Department of Animal and Dairy Sciences, University of Wisconsin, Madison, WI, 53706, USA.
Sci Rep. 2024 Mar 17;14(1):6404. doi: 10.1038/s41598-024-57234-4.
Genomic selection (GS) offers a promising opportunity for selecting more efficient animals to use consumed energy for maintenance and growth functions, impacting profitability and environmental sustainability. Here, we compared the prediction accuracy of multi-layer neural network (MLNN) and support vector regression (SVR) against single-trait (STGBLUP), multi-trait genomic best linear unbiased prediction (MTGBLUP), and Bayesian regression (BayesA, BayesB, BayesC, BRR, and BLasso) for feed efficiency (FE) traits. FE-related traits were measured in 1156 Nellore cattle from an experimental breeding program genotyped for ~ 300 K markers after quality control. Prediction accuracy (Acc) was evaluated using a forward validation splitting the dataset based on birth year, considering the phenotypes adjusted for the fixed effects and covariates as pseudo-phenotypes. The MLNN and SVR approaches were trained by randomly splitting the training population into fivefold to select the best hyperparameters. The results show that the machine learning methods (MLNN and SVR) and MTGBLUP outperformed STGBLUP and the Bayesian regression approaches, increasing the Acc by approximately 8.9%, 14.6%, and 13.7% using MLNN, SVR, and MTGBLUP, respectively. Acc for SVR and MTGBLUP were slightly different, ranging from 0.62 to 0.69 and 0.62 to 0.68, respectively, with empirically unbiased for both models (0.97 and 1.09). Our results indicated that SVR and MTGBLUBP approaches were more accurate in predicting FE-related traits than Bayesian regression and STGBLUP and seemed competitive for GS of complex phenotypes with various degrees of inheritance.
基因组选择 (GS) 为选择更有效的动物提供了一个有前途的机会,使它们能够将消耗的能量用于维持和生长功能,从而影响盈利能力和环境可持续性。在这里,我们比较了多层神经网络 (MLNN) 和支持向量回归 (SVR) 与单一性状 (STGBLUP)、多性状基因组最佳线性无偏预测 (MTGBLUP) 和贝叶斯回归 (BayesA、BayesB、BayesC、BRR 和 BLasso) 在饲料效率 (FE) 性状上的预测准确性。FE 相关性状在经过质量控制后,对 1156 头Nellore 牛进行了实验性育种计划的测量,这些牛被用于约 300 K 个标记的基因型。使用向前验证根据出生年份分割数据集来评估预测准确性 (Acc),考虑到固定效应和协变量调整后的表型作为伪表型。通过随机将训练人群分为五折来训练 MLNN 和 SVR 方法,以选择最佳超参数。结果表明,机器学习方法 (MLNN 和 SVR) 和 MTGBLUP 优于 STGBLUP 和贝叶斯回归方法,使用 MLNN、SVR 和 MTGBLUP 分别将 Acc 提高了约 8.9%、14.6%和 13.7%。SVR 和 MTGBLUP 的 Acc 略有不同,范围分别为 0.62 到 0.69 和 0.62 到 0.68,两个模型的经验无偏性都为 0.97 和 1.09。我们的结果表明,SVR 和 MTGBLUBP 方法在预测 FE 相关性状方面比贝叶斯回归和 STGBLUP 更准确,并且在具有各种遗传程度的复杂表型的 GS 方面似乎具有竞争力。