• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习进行植物基因型到表型的预测

Plant Genotype to Phenotype Prediction Using Machine Learning.

作者信息

Danilevicz Monica F, Gill Mitchell, Anderson Robyn, Batley Jacqueline, Bennamoun Mohammed, Bayer Philipp E, Edwards David

机构信息

School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia.

School of Physics, Mathematics and Computing, University of Western Australia, Perth, WA, Australia.

出版信息

Front Genet. 2022 May 18;13:822173. doi: 10.3389/fgene.2022.822173. eCollection 2022.

DOI:10.3389/fgene.2022.822173
PMID:35664329
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9159391/
Abstract

Genomic prediction tools support crop breeding based on statistical methods, such as the genomic best linear unbiased prediction (GBLUP). However, these tools are not designed to capture non-linear relationships within multi-dimensional datasets, or deal with high dimension datasets such as imagery collected by unmanned aerial vehicles. Machine learning (ML) algorithms have the potential to surpass the prediction accuracy of current tools used for genotype to phenotype prediction, due to their capacity to autonomously extract data features and represent their relationships at multiple levels of abstraction. This review addresses the challenges of applying statistical and machine learning methods for predicting phenotypic traits based on genetic markers, environment data, and imagery for crop breeding. We present the advantages and disadvantages of explainable model structures, discuss the potential of machine learning models for genotype to phenotype prediction in crop breeding, and the challenges, including the scarcity of high-quality datasets, inconsistent metadata annotation and the requirements of ML models.

摘要

基因组预测工具支持基于统计方法的作物育种,例如基因组最佳线性无偏预测(GBLUP)。然而,这些工具并非旨在捕捉多维数据集中的非线性关系,也无法处理高维数据集,如无人机收集的图像数据。机器学习(ML)算法有潜力超越当前用于基因型到表型预测的工具的预测准确性,因为它们能够自动提取数据特征并在多个抽象层次上表示其关系。本综述探讨了应用统计和机器学习方法基于遗传标记、环境数据和图像进行作物育种表型性状预测所面临的挑战。我们阐述了可解释模型结构的优缺点,讨论了机器学习模型在作物育种中进行基因型到表型预测的潜力以及面临的挑战,包括高质量数据集的稀缺、元数据注释不一致以及机器学习模型的要求等。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64a8/9159391/db50656a40fb/fgene-13-822173-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64a8/9159391/2ec7282ad2dd/fgene-13-822173-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64a8/9159391/fc45a368ff9a/fgene-13-822173-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64a8/9159391/6041d9e42c2f/fgene-13-822173-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64a8/9159391/db50656a40fb/fgene-13-822173-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64a8/9159391/2ec7282ad2dd/fgene-13-822173-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64a8/9159391/fc45a368ff9a/fgene-13-822173-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64a8/9159391/6041d9e42c2f/fgene-13-822173-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64a8/9159391/db50656a40fb/fgene-13-822173-g004.jpg

相似文献

1
Plant Genotype to Phenotype Prediction Using Machine Learning.利用机器学习进行植物基因型到表型的预测
Front Genet. 2022 May 18;13:822173. doi: 10.3389/fgene.2022.822173. eCollection 2022.
2
A Comparison of Three Machine Learning Methods for Multivariate Genomic Prediction Using the Sparse Kernels Method (SKM) Library.三种机器学习方法在使用稀疏核方法 (SKM) 库进行多元基因组预测中的比较。
Genes (Basel). 2022 Aug 21;13(8):1494. doi: 10.3390/genes13081494.
3
A Comprehensive Review of High Throughput Phenotyping and Machine Learning for Plant Stress Phenotyping.植物胁迫表型高通量表型分析与机器学习综述
Phenomics. 2022 Apr 4;2(3):156-183. doi: 10.1007/s43657-022-00048-z. eCollection 2022 Jun.
4
Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction.由大数据、人工智能和综合基因组-环境预测驱动的智能育种。
Mol Plant. 2022 Nov 7;15(11):1664-1695. doi: 10.1016/j.molp.2022.09.001. Epub 2022 Sep 7.
5
A Bayesian Genomic Multi-output Regressor Stacking Model for Predicting Multi-trait Multi-environment Plant Breeding Data.一种用于预测多性状多环境植物育种数据的贝叶斯基因组多输出回归器堆叠模型。
G3 (Bethesda). 2019 Oct 7;9(10):3381-3393. doi: 10.1534/g3.119.400336.
6
Genomic prediction with machine learning in sugarcane, a complex highly polyploid clonally propagated crop with substantial non-additive variation for key traits.利用机器学习进行甘蔗基因组预测,甘蔗是一种复杂的高度多倍体克隆繁殖作物,其关键性状存在大量非加性变异。
Plant Genome. 2023 Dec;16(4):e20390. doi: 10.1002/tpg2.20390. Epub 2023 Sep 20.
7
A Benchmarking Between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding.深度学习、支持向量机和贝叶斯阈值最佳线性无偏预测在植物育种中预测有序性状的基准比较
G3 (Bethesda). 2019 Feb 7;9(2):601-618. doi: 10.1534/g3.118.200998.
8
Machine learning approaches for crop improvement: Leveraging phenotypic and genotypic big data.机器学习在作物改良中的应用:利用表型和基因型大数据。
J Plant Physiol. 2021 Feb;257:153354. doi: 10.1016/j.jplph.2020.153354. Epub 2020 Dec 29.
9
Deep learning methods improve genomic prediction of wheat breeding.深度学习方法改进了小麦育种的基因组预测。
Front Plant Sci. 2024 Mar 4;15:1324090. doi: 10.3389/fpls.2024.1324090. eCollection 2024.
10
Explainable artificial intelligence for genotype-to-phenotype prediction in plant breeding: a case study with a dataset from an almond germplasm collection.用于植物育种中基因型到表型预测的可解释人工智能:以杏仁种质资源库数据集为例的研究
Front Plant Sci. 2024 Sep 9;15:1434229. doi: 10.3389/fpls.2024.1434229. eCollection 2024.

引用本文的文献

1
Analyses of crop yield dynamics and the development of a multimodal neural network prediction model with G×E×M interactions.作物产量动态分析以及具有基因型×环境×管理相互作用的多模态神经网络预测模型的开发。
Front Plant Sci. 2025 Jul 31;16:1537990. doi: 10.3389/fpls.2025.1537990. eCollection 2025.
2
MDNN: memetic deep neural network for genomic prediction.MDNN:用于基因组预测的模因深度神经网络。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf352.
3
Integrating multi-omics and machine learning for disease resistance prediction in legumes.

本文引用的文献

1
Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.停止为高风险决策解释黑箱机器学习模型,转而使用可解释模型。
Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.
2
Deep-learning power and perspectives for genomic selection.深度学习在基因组选择中的应用与展望。
Plant Genome. 2021 Nov;14(3):e20122. doi: 10.1002/tpg2.20122. Epub 2021 Jul 26.
3
Crop yield prediction integrating genotype and weather variables using deep learning.
整合多组学和机器学习用于豆类抗病性预测
Theor Appl Genet. 2025 Jun 27;138(7):163. doi: 10.1007/s00122-025-04948-2.
4
Genomic selection: Essence, applications, and prospects.基因组选择:本质、应用与前景。
Plant Genome. 2025 Jun;18(2):e70053. doi: 10.1002/tpg2.70053.
5
WheatGP, a genomic prediction method based on CNN and LSTM.WheatGP,一种基于卷积神经网络(CNN)和长短期记忆网络(LSTM)的基因组预测方法。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf191.
6
National biodiversity data infrastructures: ten essential functions for science, policy, and practice.国家生物多样性数据基础设施:科学、政策与实践的十项基本功能。
Bioscience. 2024 Oct 30;75(2):139-151. doi: 10.1093/biosci/biae109. eCollection 2025 Feb.
7
Enhancing Genomic Prediction Accuracy of Reproduction Traits in Rongchang Pigs Through Machine Learning.通过机器学习提高荣昌猪繁殖性状的基因组预测准确性
Animals (Basel). 2025 Feb 12;15(4):525. doi: 10.3390/ani15040525.
8
Genomic and phenomic prediction for soybean seed yield, protein, and oil.大豆种子产量、蛋白质和油含量的基因组与表型预测
Plant Genome. 2025 Mar;18(1):e70002. doi: 10.1002/tpg2.70002.
9
A review of multimodal deep learning methods for genomic-enabled prediction in plant breeding.用于植物育种中基因组预测的多模态深度学习方法综述。
Genetics. 2024 Nov 5;228(4). doi: 10.1093/genetics/iyae161.
10
Visualizing Plant Responses: Novel Insights Possible Through Affordable Imaging Techniques in the Greenhouse.可视化植物响应:温室中经济实惠的成像技术带来新的见解。
Sensors (Basel). 2024 Oct 17;24(20):6676. doi: 10.3390/s24206676.
利用深度学习整合基因型和气象变量进行作物产量预测。
PLoS One. 2021 Jun 17;16(6):e0252402. doi: 10.1371/journal.pone.0252402. eCollection 2021.
4
Validation of functional polymorphisms affecting maize plant height by unoccupied aerial systems discovers novel temporal phenotypes.利用无人空中系统验证影响玉米株高的功能多态性,发现新的时间表型。
G3 (Bethesda). 2021 Jun 17;11(6). doi: 10.1093/g3journal/jkab075.
5
HARVESTMAN: a framework for hierarchical feature learning and selection from whole genome sequencing data.HARVESTMAN:一种从全基因组测序数据中进行层次特征学习和选择的框架。
BMC Bioinformatics. 2021 Apr 1;22(1):174. doi: 10.1186/s12859-021-04096-6.
6
Deep Learning for Predicting Complex Traits in Spring Wheat Breeding Program.深度学习在春小麦育种计划中预测复杂性状的应用
Front Plant Sci. 2021 Jan 5;11:613325. doi: 10.3389/fpls.2020.613325. eCollection 2020.
7
A review of deep learning applications for genomic selection.深度学习在基因组选择中的应用综述。
BMC Genomics. 2021 Jan 6;22(1):19. doi: 10.1186/s12864-020-07319-x.
8
Integrating multi-omics data for crop improvement.整合多组学数据以促进作物改良。
J Plant Physiol. 2021 Feb;257:153352. doi: 10.1016/j.jplph.2020.153352. Epub 2020 Dec 17.
9
Machine learning in agriculture: from silos to marketplaces.农业中的机器学习:从筒仓到市场
Plant Biotechnol J. 2021 Apr;19(4):648-650. doi: 10.1111/pbi.13521. Epub 2020 Dec 20.
10
GMStool: GWAS-based marker selection tool for genomic prediction from genomic data.GMStool:基于 GWAS 的基因组预测标记选择工具,用于从基因组数据中进行基因组预测。
Sci Rep. 2020 Nov 12;10(1):19653. doi: 10.1038/s41598-020-76759-y.