• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用于动物育种基因组预测的机器学习模型综述。

A review of machine learning models applied to genomic prediction in animal breeding.

作者信息

Chafai Narjice, Hayah Ichrak, Houaga Isidore, Badaoui Bouabid

机构信息

Laboratory of Biodiversity, Ecology, and Genome, Department of Biology, Faculty of Sciences, Mohammed V University in Rabat, Rabat, Morocco.

Centre for Tropical Livestock Genetics and Health, The Roslin Institute, Royal (Dick) School of Veterinary Medicine, The University of Edinburgh, Edinburgh, United Kingdom.

出版信息

Front Genet. 2023 Sep 6;14:1150596. doi: 10.3389/fgene.2023.1150596. eCollection 2023.

DOI:10.3389/fgene.2023.1150596
PMID:37745853
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10516561/
Abstract

The advent of modern genotyping technologies has revolutionized genomic selection in animal breeding. Large marker datasets have shown several drawbacks for traditional genomic prediction methods in terms of flexibility, accuracy, and computational power. Recently, the application of machine learning models in animal breeding has gained a lot of interest due to their tremendous flexibility and their ability to capture patterns in large noisy datasets. Here, we present a general overview of a handful of machine learning algorithms and their application in genomic prediction to provide a meta-picture of their performance in genomic estimated breeding values estimation, genotype imputation, and feature selection. Finally, we discuss a potential adoption of machine learning models in genomic prediction in developing countries. The results of the reviewed studies showed that machine learning models have indeed performed well in fitting large noisy data sets and modeling minor nonadditive effects in some of the studies. However, sometimes conventional methods outperformed machine learning models, which confirms that there's no universal method for genomic prediction. In summary, machine learning models have great potential for extracting patterns from single nucleotide polymorphism datasets. Nonetheless, the level of their adoption in animal breeding is still low due to data limitations, complex genetic interactions, a lack of standardization and reproducibility, and the lack of interpretability of machine learning models when trained with biological data. Consequently, there is no remarkable outperformance of machine learning methods compared to traditional methods in genomic prediction. Therefore, more research should be conducted to discover new insights that could enhance livestock breeding programs.

摘要

现代基因分型技术的出现彻底改变了动物育种中的基因组选择。大型标记数据集在灵活性、准确性和计算能力方面显示出传统基因组预测方法存在的几个缺点。最近,机器学习模型在动物育种中的应用因其极大的灵活性以及在大型噪声数据集中捕捉模式的能力而备受关注。在此,我们对一些机器学习算法及其在基因组预测中的应用进行了概述,以呈现它们在基因组估计育种值估计、基因型填充和特征选择方面的性能全貌。最后,我们讨论了机器学习模型在发展中国家基因组预测中的潜在应用。综述研究结果表明,机器学习模型在拟合大型噪声数据集以及在一些研究中对微小非加性效应建模方面确实表现良好。然而,有时传统方法优于机器学习模型,这证实了基因组预测不存在通用方法。总之,机器学习模型在从单核苷酸多态性数据集中提取模式方面具有巨大潜力。尽管如此,由于数据限制、复杂的基因相互作用、缺乏标准化和可重复性以及使用生物数据训练时机器学习模型缺乏可解释性,它们在动物育种中的应用水平仍然较低。因此,在基因组预测中,机器学习方法与传统方法相比并没有显著的优势。所以,应该进行更多研究以发现能够改进家畜育种计划的新见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f017/10516561/bf3bc58932ec/fgene-14-1150596-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f017/10516561/8666c45361ba/fgene-14-1150596-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f017/10516561/d1256bb9b99e/fgene-14-1150596-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f017/10516561/164ecba54cde/fgene-14-1150596-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f017/10516561/bf3bc58932ec/fgene-14-1150596-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f017/10516561/8666c45361ba/fgene-14-1150596-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f017/10516561/d1256bb9b99e/fgene-14-1150596-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f017/10516561/164ecba54cde/fgene-14-1150596-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f017/10516561/bf3bc58932ec/fgene-14-1150596-g004.jpg

相似文献

1
A review of machine learning models applied to genomic prediction in animal breeding.应用于动物育种基因组预测的机器学习模型综述。
Front Genet. 2023 Sep 6;14:1150596. doi: 10.3389/fgene.2023.1150596. eCollection 2023.
2
Accuracies of genomic predictions for disease resistance of striped catfish to Edwardsiella ictaluri using artificial intelligence algorithms.利用人工智能算法预测条纹斑竹鳖对爱德华氏菌病抗性的基因组准确性。
G3 (Bethesda). 2022 Jan 4;12(1). doi: 10.1093/g3journal/jkab361.
3
Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data.基于机器学习的基因组预测:在合成数据和实际数据上,正则化回归、集成、基于实例和深度学习方法的性能比较。
BMC Genomics. 2024 Feb 7;25(1):152. doi: 10.1186/s12864-023-09933-x.
4
Multitrait machine- and deep-learning models for genomic selection using spectral information in a wheat breeding program.利用小麦育种计划中的光谱信息,基于多种性状的机器和深度学习模型进行基因组选择。
Plant Genome. 2021 Nov;14(3):e20119. doi: 10.1002/tpg2.20119. Epub 2021 Sep 5.
5
Using machine learning to realize genetic site screening and genomic prediction of productive traits in pigs.利用机器学习实现猪生产性状的遗传位点筛选和基因组预测。
FASEB J. 2023 Jun;37(6):e22961. doi: 10.1096/fj.202300245R.
6
A review of deep learning applications for genomic selection.深度学习在基因组选择中的应用综述。
BMC Genomics. 2021 Jan 6;22(1):19. doi: 10.1186/s12864-020-07319-x.
7
Genomic Prediction of Breeding Values Using a Subset of SNPs Identified by Three Machine Learning Methods.使用三种机器学习方法鉴定的单核苷酸多态性(SNP)子集对育种值进行基因组预测。
Front Genet. 2018 Jul 4;9:237. doi: 10.3389/fgene.2018.00237. eCollection 2018.
8
Genotyping by sequencing for genomic prediction in a soybean breeding population.大豆育种群体中用于基因组预测的测序基因分型
BMC Genomics. 2014 Aug 29;15(1):740. doi: 10.1186/1471-2164-15-740.
9
Application of ensemble learning to genomic selection in chinese simmental beef cattle.基于集成学习的基因组选择在西门塔尔牛肉用牛中的应用。
J Anim Breed Genet. 2021 May;138(3):291-299. doi: 10.1111/jbg.12514. Epub 2020 Oct 22.
10
Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction.由大数据、人工智能和综合基因组-环境预测驱动的智能育种。
Mol Plant. 2022 Nov 7;15(11):1664-1695. doi: 10.1016/j.molp.2022.09.001. Epub 2022 Sep 7.

引用本文的文献

1
Unlocking the African bioeconomy and strengthening biodiversity conservation through genomics and bioinformatics.通过基因组学和生物信息学开启非洲生物经济并加强生物多样性保护。
NPJ Biodivers. 2025 Jul 29;4(1):29. doi: 10.1038/s44185-025-00102-9.
2
The Use of Selected Machine Learning Methods in Dairy Cattle Farming: A Review.机器学习方法在奶牛养殖中的应用:综述
Animals (Basel). 2025 Jul 10;15(14):2033. doi: 10.3390/ani15142033.
3
Out of distribution learning in bioinformatics: advancements and challenges.生物信息学中的分布外学习:进展与挑战

本文引用的文献

1
Using machine learning to improve the accuracy of genomic prediction of reproduction traits in pigs.利用机器学习提高猪繁殖性状基因组预测的准确性。
J Anim Sci Biotechnol. 2022 May 17;13(1):60. doi: 10.1186/s40104-022-00708-0.
2
Indigenous Knowledge of Farmers in Breeding Practice and Selection Criteria of Dairy Cows at Chora and Gechi Districts of Ethiopia: An Implication for Genetic Improvements.埃塞俄比亚乔拉和格奇地区农民在奶牛育种实践和选择标准方面的本土知识:对遗传改良的启示
Vet Med Int. 2022 Feb 18;2022:3763724. doi: 10.1155/2022/3763724. eCollection 2022.
3
Prediction of Hanwoo Cattle Phenotypes from Genotypes Using Machine Learning Methods.
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf294.
4
Artificial intelligence: the human response to approach the complexity of big data in biology.人工智能:人类应对生物学大数据复杂性的方式
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf057.
5
ExAutoGP: Enhancing Genomic Prediction Stability and Interpretability with Automated Machine Learning and SHAP.ExAutoGP:通过自动机器学习和SHAP增强基因组预测的稳定性和可解释性
Animals (Basel). 2025 Apr 18;15(8):1172. doi: 10.3390/ani15081172.
6
Breaking down data silos across companies to train genome-wide predictions: A feasibility study in wheat.打破公司间的数据孤岛以训练全基因组预测:小麦的可行性研究
Plant Biotechnol J. 2025 Jul;23(7):2704-2719. doi: 10.1111/pbi.70095. Epub 2025 Apr 20.
7
Enhancing Genomic Prediction Accuracy of Reproduction Traits in Rongchang Pigs Through Machine Learning.通过机器学习提高荣昌猪繁殖性状的基因组预测准确性
Animals (Basel). 2025 Feb 12;15(4):525. doi: 10.3390/ani15040525.
8
Performance Comparison of Genomic Best Linear Unbiased Prediction and Four Machine Learning Models for Estimating Genomic Breeding Values in Working Dogs.基因组最佳线性无偏预测与四种机器学习模型在工作犬基因组育种值估计中的性能比较
Animals (Basel). 2025 Feb 2;15(3):408. doi: 10.3390/ani15030408.
9
The impact of dietary fiber on colorectal cancer patients based on machine learning.基于机器学习的膳食纤维对结直肠癌患者的影响
Front Nutr. 2025 Jan 24;12:1508562. doi: 10.3389/fnut.2025.1508562. eCollection 2025.
10
Integration of machine learning and genome-wide association study to explore the genomic prediction accuracy of agronomic trait in oats (Avena sativa L.).整合机器学习与全基因组关联研究以探究燕麦(Avena sativa L.)农艺性状的基因组预测准确性。
Plant Genome. 2025 Mar;18(1):e20549. doi: 10.1002/tpg2.20549.
使用机器学习方法从基因型预测韩牛表型
Animals (Basel). 2021 Jul 11;11(7):2066. doi: 10.3390/ani11072066.
4
The application of pangenomics and machine learning in genomic selection in plants.泛基因组学和机器学习在植物基因组选择中的应用。
Plant Genome. 2021 Nov;14(3):e20112. doi: 10.1002/tpg2.20112. Epub 2021 Jul 20.
5
G2PDeep: a web-based deep-learning framework for quantitative phenotype prediction and discovery of genomic markers.G2PDeep:一个基于网络的深度学习框架,用于定量表型预测和基因组标记的发现。
Nucleic Acids Res. 2021 Jul 2;49(W1):W228-W236. doi: 10.1093/nar/gkab407.
6
KCRR: a nonlinear machine learning with a modified genomic similarity matrix improved the genomic prediction efficiency.KCRR:一种利用改进的基因组相似性矩阵的非线性机器学习方法,提高了基因组预测效率。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab132.
7
Feature Selection Stability and Accuracy of Prediction Models for Genomic Prediction of Residual Feed Intake in Pigs Using Machine Learning.使用机器学习对猪的剩余采食量进行基因组预测的预测模型的特征选择稳定性和准确性
Front Genet. 2021 Feb 22;12:611506. doi: 10.3389/fgene.2021.611506. eCollection 2021.
8
Identification of Predictor Genes for Feed Efficiency in Beef Cattle by Applying Machine Learning Methods to Multi-Tissue Transcriptome Data.通过将机器学习方法应用于多组织转录组数据来鉴定肉牛饲料效率的预测基因
Front Genet. 2021 Feb 16;12:619857. doi: 10.3389/fgene.2021.619857. eCollection 2021.
9
A review of deep learning applications for genomic selection.深度学习在基因组选择中的应用综述。
BMC Genomics. 2021 Jan 6;22(1):19. doi: 10.1186/s12864-020-07319-x.
10
Applications of Support Vector Machine in Genomic Prediction in Pig and Maize Populations.支持向量机在猪和玉米群体基因组预测中的应用
Front Genet. 2020 Dec 3;11:598318. doi: 10.3389/fgene.2020.598318. eCollection 2020.