• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多模态深度学习通过融合基因组学和表型组学提高小麦育种中的产量预测。

Multi-modal deep learning improves grain yield prediction in wheat breeding by fusing genomics and phenomics.

机构信息

Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland.

Swiss Institute of Bioinformatics, Lausanne, Switzerland.

出版信息

Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad336.

DOI:10.1093/bioinformatics/btad336
PMID:37220903
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10246581/
Abstract

MOTIVATION

Developing new crop varieties with superior performance is highly important to ensure robust and sustainable global food security. The speed of variety development is limited by long field cycles and advanced generation selections in plant breeding programs. While methods to predict yield from genotype or phenotype data have been proposed, improved performance and integrated models are needed.

RESULTS

We propose a machine learning model that leverages both genotype and phenotype measurements by fusing genetic variants with multiple data sources collected by unmanned aerial systems. We use a deep multiple instance learning framework with an attention mechanism that sheds light on the importance given to each input during prediction, enhancing interpretability. Our model reaches 0.754 ± 0.024 Pearson correlation coefficient when predicting yield in similar environmental conditions; a 34.8% improvement over the genotype-only linear baseline (0.559 ± 0.050). We further predict yield on new lines in an unseen environment using only genotypes, obtaining a prediction accuracy of 0.386 ± 0.010, a 13.5% improvement over the linear baseline. Our multi-modal deep learning architecture efficiently accounts for plant health and environment, distilling the genetic contribution and providing excellent predictions. Yield prediction algorithms leveraging phenotypic observations during training therefore promise to improve breeding programs, ultimately speeding up delivery of improved varieties.

AVAILABILITY AND IMPLEMENTATION

Available at https://github.com/BorgwardtLab/PheGeMIL (code) and https://doi.org/doi:10.5061/dryad.kprr4xh5p (data).

摘要

动机

开发具有卓越性能的新型作物品种对于确保全球粮食安全的稳健和可持续性至关重要。品种开发的速度受到植物育种计划中田间周期长和代际选择先进的限制。虽然已经提出了从基因型或表型数据预测产量的方法,但需要改进性能和综合模型。

结果

我们提出了一种机器学习模型,通过融合遗传变异与无人航空系统收集的多个数据源,利用基因型和表型测量值。我们使用具有注意力机制的深度多实例学习框架,该机制阐明了在预测过程中对每个输入的重视程度,提高了可解释性。当在相似的环境条件下预测产量时,我们的模型达到了 0.754±0.024 的皮尔逊相关系数;与仅基于基因型的线性基线(0.559±0.050)相比,提高了 34.8%。我们进一步仅使用基因型在看不见的环境中预测新的系谱产量,获得了 0.386±0.010 的预测精度,比线性基线提高了 13.5%。我们的多模态深度学习架构有效地考虑了植物健康和环境,提取了遗传贡献并提供了出色的预测。因此,在训练过程中利用表型观测值的产量预测算法有望改进育种计划,最终加快改良品种的交付。

可用性和实现

可在 https://github.com/BorgwardtLab/PheGeMIL(代码)和 https://doi.org/doi:10.5061/dryad.kprr4xh5p(数据)上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f3d/10246581/b1811f816f1d/btad336f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f3d/10246581/2044a144791d/btad336f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f3d/10246581/25c7c0adcd1e/btad336f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f3d/10246581/15a6ffbcb696/btad336f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f3d/10246581/b1811f816f1d/btad336f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f3d/10246581/2044a144791d/btad336f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f3d/10246581/25c7c0adcd1e/btad336f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f3d/10246581/15a6ffbcb696/btad336f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f3d/10246581/b1811f816f1d/btad336f4.jpg

相似文献

1
Multi-modal deep learning improves grain yield prediction in wheat breeding by fusing genomics and phenomics.多模态深度学习通过融合基因组学和表型组学提高小麦育种中的产量预测。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad336.
2
Phenomic selection in wheat breeding: prediction of the genotype-by-environment interaction in multi-environment breeding trials.小麦育种中的表型组选择:多环境育种试验中基因型与环境互作的预测
Theor Appl Genet. 2022 Oct;135(10):3337-3356. doi: 10.1007/s00122-022-04170-4. Epub 2022 Aug 8.
3
Genomic assisted selection for enhancing line breeding: merging genomic and phenotypic selection in winter wheat breeding programs with preliminary yield trials.通过基因组辅助选择提高品系选育:在冬小麦育种计划中结合基因组和表型选择并进行初步产量试验
Theor Appl Genet. 2017 Feb;130(2):363-376. doi: 10.1007/s00122-016-2818-8. Epub 2016 Nov 8.
4
Spectral-genomic chain-model approach enhances the wheat yield component prediction under the Mediterranean climate.光谱基因组链模型方法提高了在地中海气候下小麦产量构成要素的预测能力。
Physiol Plant. 2024 Jul-Aug;176(4):e14480. doi: 10.1111/ppl.14480.
5
TrG2P: A transfer-learning-based tool integrating multi-trait data for accurate prediction of crop yield.TrG2P:一种基于迁移学习的工具,集成多性状数据,用于准确预测作物产量。
Plant Commun. 2024 Jul 8;5(7):100975. doi: 10.1016/j.xplc.2024.100975. Epub 2024 May 15.
6
Enhancing winter wheat prediction with genomics, phenomics and environmental data.利用基因组学、表型组学和环境数据提高冬小麦预测能力。
BMC Genomics. 2024 May 31;25(1):544. doi: 10.1186/s12864-024-10438-4.
7
Simultaneous improvement of grain yield and protein content in durum wheat by different phenotypic indices and genomic selection.通过不同的表型指标和基因组选择同时提高硬质小麦的籽粒产量和蛋白质含量。
Theor Appl Genet. 2018 Jun;131(6):1315-1329. doi: 10.1007/s00122-018-3080-z. Epub 2018 Mar 6.
8
Multi-Trait Genomic Prediction of Yield-Related Traits in US Soft Wheat under Variable Water Regimes.在不同水分条件下美国软小麦产量相关性状的多性状基因组预测。
Genes (Basel). 2020 Oct 28;11(11):1270. doi: 10.3390/genes11111270.
9
Multimodal deep learning methods enhance genomic prediction of wheat breeding.多模态深度学习方法提高了小麦育种的基因组预测。
G3 (Bethesda). 2023 May 2;13(5). doi: 10.1093/g3journal/jkad045.
10
Simultaneous selection for grain yield and protein content in genomics-assisted wheat breeding.在基因组辅助小麦育种中同时选择粒产量和蛋白质含量。
Theor Appl Genet. 2019 Jun;132(6):1745-1760. doi: 10.1007/s00122-019-03312-5. Epub 2019 Feb 27.

引用本文的文献

1
Artificial Intelligence-Assisted Breeding for Plant Disease Resistance.人工智能辅助的植物抗病育种
Int J Mol Sci. 2025 Jun 1;26(11):5324. doi: 10.3390/ijms26115324.
2
Improving plant breeding through AI-supported data integration.通过人工智能支持的数据整合改进植物育种。
Theor Appl Genet. 2025 Jun 2;138(6):132. doi: 10.1007/s00122-025-04910-2.
3
Application of machine learning and genomics for orphan crop improvement.机器学习与基因组学在小众作物改良中的应用。
Nat Commun. 2025 Jan 24;16(1):982. doi: 10.1038/s41467-025-56330-x.
4
A review of multimodal deep learning methods for genomic-enabled prediction in plant breeding.用于植物育种中基因组预测的多模态深度学习方法综述。
Genetics. 2024 Nov 5;228(4). doi: 10.1093/genetics/iyae161.
5
Big data and artificial intelligence-aided crop breeding: Progress and prospects.大数据与人工智能辅助作物育种:进展与展望
J Integr Plant Biol. 2025 Mar;67(3):722-739. doi: 10.1111/jipb.13791. Epub 2024 Oct 28.
6
Enhancing the potential of phenomic and genomic prediction in winter wheat breeding using high-throughput phenotyping and deep learning.利用高通量表型分析和深度学习提高冬小麦育种中表型组学和基因组预测的潜力。
Front Plant Sci. 2024 May 30;15:1410249. doi: 10.3389/fpls.2024.1410249. eCollection 2024.