• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于模拟数据和三种植物物种的经典与机器学习表型预测方法比较。

A comparison of classical and machine learning-based phenotype prediction methods on simulated data and three plant species.

作者信息

John Maura, Haselbeck Florian, Dass Rupashree, Malisi Christoph, Ricca Patrizia, Dreischer Christian, Schultheiss Sebastian J, Grimm Dominik G

机构信息

Technical University of Munich, Campus Straubing for Biotechnology and Sustainability, Bioinformatics, Straubing, Germany.

Weihenstephan-Triesdorf University of Applied Sciences, Bioinformatics, Straubing, Germany.

出版信息

Front Plant Sci. 2022 Nov 4;13:932512. doi: 10.3389/fpls.2022.932512. eCollection 2022.

DOI:10.3389/fpls.2022.932512
PMID:36407627
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9673477/
Abstract

Genomic selection is an integral tool for breeders to accurately select plants directly from genotype data leading to faster and more resource-efficient breeding programs. Several prediction methods have been established in the last few years. These range from classical linear mixed models to complex non-linear machine learning approaches, such as Support Vector Regression, and modern deep learning-based architectures. Many of these methods have been extensively evaluated on different crop species with varying outcomes. In this work, our aim is to systematically compare 12 different phenotype prediction models, including basic genomic selection methods to more advanced deep learning-based techniques. More importantly, we assess the performance of these models on simulated phenotype data as well as on real-world data from and two breeding datasets from soy and corn. The synthetic phenotypic data allow us to analyze all prediction models and especially the selected markers under controlled and predefined settings. We show that Bayes B and linear regression models with sparsity constraints perform best under different simulation settings with respect to explained variance. Further, we can confirm results from other studies that there is no superiority of more complex neural network-based architectures for phenotype prediction compared to well-established methods. However, on real-world data, for which several prediction models yield comparable results with slight advantages for Elastic Net, this picture is less clear, suggesting that there is a lot of room for future research.

摘要

基因组选择是育种者直接根据基因型数据准确选择植物的一项重要工具,可带来更快且资源利用效率更高的育种计划。在过去几年中已经建立了多种预测方法。这些方法从经典的线性混合模型到复杂的非线性机器学习方法,如支持向量回归,以及基于现代深度学习的架构。其中许多方法已经在不同作物物种上进行了广泛评估,结果各异。在这项工作中,我们的目标是系统地比较12种不同的表型预测模型,包括基本的基因组选择方法到更先进的基于深度学习的技术。更重要的是,我们评估这些模型在模拟表型数据以及来自大豆和玉米的两个育种数据集的真实世界数据上的性能。合成表型数据使我们能够在受控和预定义的设置下分析所有预测模型,特别是所选标记。我们表明,贝叶斯B模型和具有稀疏性约束的线性回归模型在不同模拟设置下在解释方差方面表现最佳。此外,我们可以证实其他研究的结果,即与成熟方法相比,基于更复杂神经网络的架构在表型预测方面没有优势。然而,在真实世界数据上,几种预测模型产生了可比的结果,弹性网络略有优势,情况不太明确,这表明未来研究还有很大空间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1362/9673477/15db1c082091/fpls-13-932512-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1362/9673477/36bc3e9d7d7d/fpls-13-932512-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1362/9673477/add048ca5d8b/fpls-13-932512-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1362/9673477/fc52ad550dd8/fpls-13-932512-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1362/9673477/15db1c082091/fpls-13-932512-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1362/9673477/36bc3e9d7d7d/fpls-13-932512-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1362/9673477/add048ca5d8b/fpls-13-932512-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1362/9673477/fc52ad550dd8/fpls-13-932512-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1362/9673477/15db1c082091/fpls-13-932512-g004.jpg

相似文献

1
A comparison of classical and machine learning-based phenotype prediction methods on simulated data and three plant species.基于模拟数据和三种植物物种的经典与机器学习表型预测方法比较。
Front Plant Sci. 2022 Nov 4;13:932512. doi: 10.3389/fpls.2022.932512. eCollection 2022.
2
Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data.基于机器学习的基因组预测:在合成数据和实际数据上,正则化回归、集成、基于实例和深度学习方法的性能比较。
BMC Genomics. 2024 Feb 7;25(1):152. doi: 10.1186/s12864-023-09933-x.
3
Tabular deep learning: a comparative study applied to multi-task genome-wide prediction.表格深度学习:应用于多任务全基因组预测的比较研究。
BMC Bioinformatics. 2024 Oct 4;25(1):322. doi: 10.1186/s12859-024-05940-1.
4
Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.深度学习与参数化和集成方法在复杂表型基因组预测中的比较。
Genet Sel Evol. 2020 Feb 24;52(1):12. doi: 10.1186/s12711-020-00531-z.
5
Plant Genotype to Phenotype Prediction Using Machine Learning.利用机器学习进行植物基因型到表型的预测
Front Genet. 2022 May 18;13:822173. doi: 10.3389/fgene.2022.822173. eCollection 2022.
6
Improved genomic prediction using machine learning with Variational Bayesian sparsity.使用具有变分贝叶斯稀疏性的机器学习改进基因组预测。
Plant Methods. 2023 Sep 2;19(1):96. doi: 10.1186/s13007-023-01073-3.
7
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
8
Genomic prediction in animals and plants: simulation of data, validation, reporting, and benchmarking.动植物基因组预测:数据模拟、验证、报告和基准测试。
Genetics. 2013 Feb;193(2):347-65. doi: 10.1534/genetics.112.147983. Epub 2012 Dec 5.
9
Accounting for Correlation Between Traits in Genomic Prediction.基因组预测中性状间相关性的考量
Methods Mol Biol. 2022;2467:285-327. doi: 10.1007/978-1-0716-2205-6_10.
10
A review of machine learning models applied to genomic prediction in animal breeding.应用于动物育种基因组预测的机器学习模型综述。
Front Genet. 2023 Sep 6;14:1150596. doi: 10.3389/fgene.2023.1150596. eCollection 2023.

引用本文的文献

1
Semi-parametric validation of genomic predictions and polygenic risk scores with the Blupf90 software suite.使用Blupf90软件套件对基因组预测和多基因风险评分进行半参数验证。
G3 (Bethesda). 2025 Aug 6;15(8). doi: 10.1093/g3journal/jkaf136.
2
Distinguishing isotropic and anisotropic signals for X-ray total scattering using machine learning.利用机器学习区分X射线全散射中的各向同性和各向异性信号。
Acta Crystallogr A Found Adv. 2025 May 1;81(Pt 3):175-187. doi: 10.1107/S2053273325002438. Epub 2025 Apr 4.
3
Improved genomic prediction performance with ensembles of diverse models.

本文引用的文献

1
Efficient permutation-based genome-wide association studies for normal and skewed phenotypic distributions.高效的基于排列的全基因组关联研究,适用于正态和偏态表型分布。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii5-ii12. doi: 10.1093/bioinformatics/btac455.
2
Prediction of Maize Phenotypic Traits With Genomic and Environmental Predictors Using Gradient Boosting Frameworks.使用梯度提升框架,通过基因组和环境预测因子预测玉米表型性状
Front Plant Sci. 2021 Nov 11;12:699589. doi: 10.3389/fpls.2021.699589. eCollection 2021.
3
Deep neural networks for genomic prediction do not estimate marker effects.
通过多种不同模型的集成提高基因组预测性能。
G3 (Bethesda). 2025 May 8;15(5). doi: 10.1093/g3journal/jkaf048.
4
Enhancing Genomic Prediction Accuracy of Reproduction Traits in Rongchang Pigs Through Machine Learning.通过机器学习提高荣昌猪繁殖性状的基因组预测准确性
Animals (Basel). 2025 Feb 12;15(4):525. doi: 10.3390/ani15040525.
5
Explainable artificial intelligence for genotype-to-phenotype prediction in plant breeding: a case study with a dataset from an almond germplasm collection.用于植物育种中基因型到表型预测的可解释人工智能:以杏仁种质资源库数据集为例的研究
Front Plant Sci. 2024 Sep 9;15:1434229. doi: 10.3389/fpls.2024.1434229. eCollection 2024.
6
A review of artificial intelligence-assisted omics techniques in plant defense: current trends and future directions.植物防御中人工智能辅助组学技术综述:当前趋势与未来方向
Front Plant Sci. 2024 Mar 5;15:1292054. doi: 10.3389/fpls.2024.1292054. eCollection 2024.
7
Comparing feature selection and machine learning approaches for predicting methylation from genetic variation.比较用于从基因变异预测甲基化的特征选择和机器学习方法。
Front Neuroinform. 2024 Feb 21;17:1244336. doi: 10.3389/fninf.2023.1244336. eCollection 2023.
8
Review of applications of artificial intelligence (AI) methods in crop research.人工智能(AI)方法在作物研究中的应用综述。
J Appl Genet. 2024 May;65(2):225-240. doi: 10.1007/s13353-023-00826-z. Epub 2024 Jan 13.
9
PopGenAdapt: Semi-Supervised Domain Adaptation for Genotype-to-Phenotype Prediction in Underrepresented Populations.PopGenAdapt:用于代表性不足人群中基因型到表型预测的半监督领域自适应
Pac Symp Biocomput. 2024;29:327-340.
10
PopGenAdapt: Semi-Supervised Domain Adaptation for Genotype-to-Phenotype Prediction in Underrepresented Populations.PopGenAdapt:针对代表性不足人群中基因型到表型预测的半监督域适应方法
bioRxiv. 2023 Oct 10:2023.10.10.561715. doi: 10.1101/2023.10.10.561715.
深度神经网络在基因组预测中并不估计标记效应。
Plant Genome. 2021 Nov;14(3):e20147. doi: 10.1002/tpg2.20147. Epub 2021 Oct 1.
4
Multitrait machine- and deep-learning models for genomic selection using spectral information in a wheat breeding program.利用小麦育种计划中的光谱信息,基于多种性状的机器和深度学习模型进行基因组选择。
Plant Genome. 2021 Nov;14(3):e20119. doi: 10.1002/tpg2.20119. Epub 2021 Sep 5.
5
Why and How to Switch to Genomic Selection: Lessons From Plant and Animal Breeding Experience.为何以及如何转向基因组选择:来自植物和动物育种经验的教训。
Front Genet. 2021 Jul 9;12:629737. doi: 10.3389/fgene.2021.629737. eCollection 2021.
6
Deep Learning for Predicting Complex Traits in Spring Wheat Breeding Program.深度学习在春小麦育种计划中预测复杂性状的应用
Front Plant Sci. 2021 Jan 5;11:613325. doi: 10.3389/fpls.2020.613325. eCollection 2020.
7
A review of deep learning applications for genomic selection.深度学习在基因组选择中的应用综述。
BMC Genomics. 2021 Jan 6;22(1):19. doi: 10.1186/s12864-020-07319-x.
8
Using Local Convolutional Neural Networks for Genomic Prediction.使用局部卷积神经网络进行基因组预测。
Front Genet. 2020 Nov 12;11:561497. doi: 10.3389/fgene.2020.561497. eCollection 2020.
9
Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.深度学习与参数化和集成方法在复杂表型基因组预测中的比较。
Genet Sel Evol. 2020 Feb 24;52(1):12. doi: 10.1186/s12711-020-00531-z.
10
Deep Kernel and Deep Learning for Genome-Based Prediction of Single Traits in Multienvironment Breeding Trials.基于基因组的多环境育种试验单性状预测中的深度核与深度学习
Front Genet. 2019 Dec 9;10:1168. doi: 10.3389/fgene.2019.01168. eCollection 2019.