• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习进行甘蔗基因组预测,甘蔗是一种复杂的高度多倍体克隆繁殖作物,其关键性状存在大量非加性变异。

Genomic prediction with machine learning in sugarcane, a complex highly polyploid clonally propagated crop with substantial non-additive variation for key traits.

机构信息

Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Queensland, Australia.

Sugar Research Australia, Mackay, Australia.

出版信息

Plant Genome. 2023 Dec;16(4):e20390. doi: 10.1002/tpg2.20390. Epub 2023 Sep 20.

DOI:10.1002/tpg2.20390
PMID:37728221
Abstract

Sugarcane has a complex, highly polyploid genome with multi-species ancestry. Additive models for genomic prediction of clonal performance might not capture interactions between genes and alleles from different ploidies and ancestral species. As such, genomic prediction in sugarcane presents an interesting case for machine learning (ML) methods, which are purportedly able to deal with high levels of complexity in prediction. Here, we investigated deep learning (DL) neural networks, including multilayer networks (MLP) and convolution neural networks (CNN), and an ensemble machine learning approach, random forest (RF), for genomic prediction in sugarcane. The data set used was 2912 sugarcane clones, scored for 26,086 genome wide single nucleotide polymorphism markers, with final assessment trial data for total cane harvested (TCH), commercial cane sugar (CCS), and fiber content (Fiber). The clones in the latest trial (2017) were used as a validation set. We compared prediction accuracy of these methods to genomic best linear unbiased prediction (GBLUP) extended to include dominance and epistatic effects. The prediction accuracies from GBLUP models were up to 0.37 for TCH, 0.43 for CCS, and 0.48 for Fiber, while the optimized ML models had prediction accuracies of 0.35 for TCH, 0.38 for CCS, and 0.48 for Fiber. Both RF and DL neural network models have comparable predictive ability with the additive GBLUP model but are less accurate than the extended GBLUP model.

摘要

甘蔗具有复杂的、高度多倍体基因组,具有多物种起源。用于克隆性能基因组预测的加性模型可能无法捕捉来自不同倍性和祖先物种的基因和等位基因之间的相互作用。因此,甘蔗的基因组预测为机器学习 (ML) 方法提供了一个有趣的案例,据称这些方法能够处理预测中的高水平复杂性。在这里,我们研究了深度学习 (DL) 神经网络,包括多层网络 (MLP) 和卷积神经网络 (CNN),以及一种集成机器学习方法,随机森林 (RF),用于甘蔗的基因组预测。使用的数据集是 2912 个甘蔗克隆,对 26086 个全基因组单核苷酸多态性标记进行评分,最终评估试验数据为总 cane 收获量 (TCH)、商业 cane 糖 (CCS) 和纤维含量 (Fiber)。最新试验 (2017 年) 中的克隆被用作验证集。我们将这些方法的预测准确性与包括显性和上位性效应的基因组最佳线性无偏预测 (GBLUP) 进行了比较。GBLUP 模型的预测准确性高达 TCH 为 0.37,CCS 为 0.43,纤维为 0.48,而优化的 ML 模型的预测准确性为 TCH 为 0.35,CCS 为 0.38,纤维为 0.48。RF 和 DL 神经网络模型都具有与加性 GBLUP 模型相当的预测能力,但准确性低于扩展 GBLUP 模型。

相似文献

1
Genomic prediction with machine learning in sugarcane, a complex highly polyploid clonally propagated crop with substantial non-additive variation for key traits.利用机器学习进行甘蔗基因组预测,甘蔗是一种复杂的高度多倍体克隆繁殖作物,其关键性状存在大量非加性变异。
Plant Genome. 2023 Dec;16(4):e20390. doi: 10.1002/tpg2.20390. Epub 2023 Sep 20.
2
Improved genomic prediction of clonal performance in sugarcane by exploiting non-additive genetic effects.利用非加性遗传效应提高甘蔗无性系表现的基因组预测。
Theor Appl Genet. 2021 Jul;134(7):2235-2252. doi: 10.1007/s00122-021-03822-1. Epub 2021 Apr 26.
3
Accuracy of genomic prediction of complex traits in sugarcane.甘蔗复杂性状基因组预测的准确性。
Theor Appl Genet. 2021 May;134(5):1455-1462. doi: 10.1007/s00122-021-03782-6. Epub 2021 Feb 15.
4
Genomic prediction for sugarcane diseases including hybrid Bayesian-machine learning approaches.甘蔗病害的基因组预测,包括混合贝叶斯-机器学习方法。
Front Plant Sci. 2024 May 1;15:1398903. doi: 10.3389/fpls.2024.1398903. eCollection 2024.
5
Genome-wide prediction for complex traits under the presence of dominance effects in simulated populations using GBLUP and machine learning methods.使用 GBLUP 和机器学习方法在模拟群体中存在显性效应的情况下对复杂性状进行全基因组预测。
J Anim Sci. 2020 Jun 1;98(6). doi: 10.1093/jas/skaa179.
6
Optimising clonal performance in sugarcane: leveraging non-additive effects via mate-allocation strategies.优化甘蔗的克隆性能:通过交配分配策略利用非加性效应。
Front Plant Sci. 2023 Nov 10;14:1260517. doi: 10.3389/fpls.2023.1260517. eCollection 2023.
7
Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.深度学习与参数化和集成方法在复杂表型基因组预测中的比较。
Genet Sel Evol. 2020 Feb 24;52(1):12. doi: 10.1186/s12711-020-00531-z.
8
Use of continuous genotypes for genomic prediction in sugarcane.利用连续基因型进行甘蔗基因组预测。
Plant Genome. 2024 Mar;17(1):e20417. doi: 10.1002/tpg2.20417. Epub 2023 Dec 8.
9
A joint learning approach for genomic prediction in polyploid grasses.多倍体草基因组预测的联合学习方法。
Sci Rep. 2022 Jul 21;12(1):12499. doi: 10.1038/s41598-022-16417-7.
10
Accuracies of genomic predictions for disease resistance of striped catfish to Edwardsiella ictaluri using artificial intelligence algorithms.利用人工智能算法预测条纹斑竹鳖对爱德华氏菌病抗性的基因组准确性。
G3 (Bethesda). 2022 Jan 4;12(1). doi: 10.1093/g3journal/jkab361.

引用本文的文献

1
Genomic prediction for sugarcane diseases including hybrid Bayesian-machine learning approaches.甘蔗病害的基因组预测,包括混合贝叶斯-机器学习方法。
Front Plant Sci. 2024 May 1;15:1398903. doi: 10.3389/fpls.2024.1398903. eCollection 2024.