• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自动化机器学习:玉米杂交种中基于基因组“图像”预测的案例研究。

Automated Machine Learning: A Case Study of Genomic "Image-Based" Prediction in Maize Hybrids.

作者信息

Galli Giovanni, Sabadin Felipe, Yassue Rafael Massahiro, Galves Cassia, Carvalho Humberto Fanelli, Crossa Jose, Montesinos-López Osval Antonio, Fritsche-Neto Roberto

机构信息

Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, Brazil.

School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA, United States.

出版信息

Front Plant Sci. 2022 Mar 7;13:845524. doi: 10.3389/fpls.2022.845524. eCollection 2022.

DOI:10.3389/fpls.2022.845524
PMID:35321444
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8936805/
Abstract

Machine learning methods such as multilayer perceptrons (MLP) and Convolutional Neural Networks (CNN) have emerged as promising methods for genomic prediction (GP). In this context, we assess the performance of MLP and CNN on regression and classification tasks in a case study with maize hybrids. The genomic information was provided to the MLP as a relationship matrix and to the CNN as "genomic images." In the regression task, the machine learning models were compared along with GBLUP. Under the classification task, MLP and CNN were compared. In this case, the traits (plant height and grain yield) were discretized in such a way to create balanced (moderate selection intensity) and unbalanced (extreme selection intensity) datasets for further evaluations. An automatic hyperparameter search for MLP and CNN was performed, and the best models were reported. For both task types, several metrics were calculated under a validation scheme to assess the effect of the prediction method and other variables. Overall, MLP and CNN presented competitive results to GBLUP. Also, we bring new insights on automated machine learning for genomic prediction and its implications to plant breeding.

摘要

诸如多层感知器(MLP)和卷积神经网络(CNN)之类的机器学习方法已成为基因组预测(GP)中很有前景的方法。在此背景下,我们在一个玉米杂交种的案例研究中评估了MLP和CNN在回归和分类任务上的性能。基因组信息以关系矩阵的形式提供给MLP,以“基因组图像”的形式提供给CNN。在回归任务中,将机器学习模型与GBLUP进行了比较。在分类任务下,对MLP和CNN进行了比较。在这种情况下,对性状(株高和籽粒产量)进行离散化处理,以创建平衡(中等选择强度)和不平衡(极端选择强度)数据集用于进一步评估。对MLP和CNN进行了自动超参数搜索,并报告了最佳模型。对于这两种任务类型,在验证方案下计算了几个指标,以评估预测方法和其他变量的效果。总体而言,MLP和CNN呈现出与GBLUP具有竞争力的结果。此外,我们为基因组预测的自动化机器学习及其对植物育种的影响带来了新的见解。

相似文献

1
Automated Machine Learning: A Case Study of Genomic "Image-Based" Prediction in Maize Hybrids.自动化机器学习:玉米杂交种中基于基因组“图像”预测的案例研究。
Front Plant Sci. 2022 Mar 7;13:845524. doi: 10.3389/fpls.2022.845524. eCollection 2022.
2
Machine learning methods for genomic prediction of cow behavioral traits measured by automatic milking systems in North American Holstein cattle.北美荷斯坦奶牛自动挤奶系统测量的奶牛行为性状基因组预测的机器学习方法
J Dairy Sci. 2024 Jul;107(7):4758-4771. doi: 10.3168/jds.2023-24082. Epub 2024 Feb 22.
3
Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.深度学习与参数化和集成方法在复杂表型基因组预测中的比较。
Genet Sel Evol. 2020 Feb 24;52(1):12. doi: 10.1186/s12711-020-00531-z.
4
Genomic prediction for sugarcane diseases including hybrid Bayesian-machine learning approaches.甘蔗病害的基因组预测,包括混合贝叶斯-机器学习方法。
Front Plant Sci. 2024 May 1;15:1398903. doi: 10.3389/fpls.2024.1398903. eCollection 2024.
5
Gut metagenome-derived image augmentation and deep learning improve prediction accuracy of metabolic disease classification.肠道宏基因组衍生的图像增强和深度学习提高代谢疾病分类预测准确性。
Yi Chuan. 2024 Oct;46(10):886-896. doi: 10.16288/j.yczz.24-086.
6
Deep Learning for Predicting Complex Traits in Spring Wheat Breeding Program.深度学习在春小麦育种计划中预测复杂性状的应用
Front Plant Sci. 2021 Jan 5;11:613325. doi: 10.3389/fpls.2020.613325. eCollection 2020.
7
Genomic prediction with machine learning in sugarcane, a complex highly polyploid clonally propagated crop with substantial non-additive variation for key traits.利用机器学习进行甘蔗基因组预测,甘蔗是一种复杂的高度多倍体克隆繁殖作物,其关键性状存在大量非加性变异。
Plant Genome. 2023 Dec;16(4):e20390. doi: 10.1002/tpg2.20390. Epub 2023 Sep 20.
8
Accurate genomic prediction for grain yield and grain moisture content of maize hybrids using multi-environment data.利用多环境数据对玉米杂交种的籽粒产量和籽粒含水量进行准确的基因组预测。
J Integr Plant Biol. 2025 May;67(5):1379-1394. doi: 10.1111/jipb.13857. Epub 2025 Feb 17.
9
HOMLC-Hyperparameter Optimization for Multi-Label Classification of Intrusion Detection Data for Internet of Things Network.用于物联网网络入侵检测数据多标签分类的HOMLC-超参数优化
Sensors (Basel). 2023 Oct 9;23(19):8333. doi: 10.3390/s23198333.
10
Machine Learning for the Genomic Prediction of Growth Traits in a Composite Beef Cattle Population.机器学习用于复合肉牛群体生长性状的基因组预测
Animals (Basel). 2024 Oct 18;14(20):3014. doi: 10.3390/ani14203014.

引用本文的文献

1
Modeling Chickpea Productivity with Artificial Image Objects and Convolutional Neural Network.利用人工图像对象和卷积神经网络对鹰嘴豆生产力进行建模
Plants (Basel). 2024 Sep 1;13(17):2444. doi: 10.3390/plants13172444.
2
Stacked ensembles on basis of parentage information can predict hybrid performance with an accuracy comparable to marker-based GBLUP.基于亲缘关系信息的堆叠集成方法能够预测杂种性能,其准确性与基于标记的基因组最佳线性无偏预测法相当。
Front Plant Sci. 2023 Jul 21;14:1178902. doi: 10.3389/fpls.2023.1178902. eCollection 2023.

本文引用的文献

1
A zero altered Poisson random forest model for genomic-enabled prediction.用于基因组辅助预测的零改变泊松随机森林模型。
G3 (Bethesda). 2021 Feb 9;11(2). doi: 10.1093/g3journal/jkaa057.
2
A review of deep learning applications for genomic selection.深度学习在基因组选择中的应用综述。
BMC Genomics. 2021 Jan 6;22(1):19. doi: 10.1186/s12864-020-07319-x.
3
Machine learning in plant science and plant breeding.植物科学与植物育种中的机器学习
iScience. 2020 Dec 5;24(1):101890. doi: 10.1016/j.isci.2020.101890. eCollection 2021 Jan 22.
4
Using Local Convolutional Neural Networks for Genomic Prediction.使用局部卷积神经网络进行基因组预测。
Front Genet. 2020 Nov 12;11:561497. doi: 10.3389/fgene.2020.561497. eCollection 2020.
5
Nonlinear kernels, dominance, and envirotyping data increase the accuracy of genome-based prediction in multi-environment trials.非线性核、优势和环境数据增加了基于基因组的多环境试验预测的准确性。
Heredity (Edinb). 2021 Jan;126(1):92-106. doi: 10.1038/s41437-020-00353-1. Epub 2020 Aug 27.
6
Opening the Black Box: Interpretable Machine Learning for Geneticists.打开黑箱:遗传学家的可解释机器学习。
Trends Genet. 2020 Jun;36(6):442-455. doi: 10.1016/j.tig.2020.03.005. Epub 2020 Apr 17.
7
Exploring Deep Learning for Complex Trait Genomic Prediction in Polyploid Outcrossing Species.探索深度学习用于多倍体异交物种复杂性状的基因组预测
Front Plant Sci. 2020 Feb 6;11:25. doi: 10.3389/fpls.2020.00025. eCollection 2020.
8
Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.深度学习与参数化和集成方法在复杂表型基因组预测中的比较。
Genet Sel Evol. 2020 Feb 24;52(1):12. doi: 10.1186/s12711-020-00531-z.
9
On the usefulness of parental lines GWAS for predicting low heritability traits in tropical maize hybrids.利用父本系 GWAS 预测热带玉米杂交种中低遗传力性状的有用性。
PLoS One. 2020 Feb 7;15(2):e0228724. doi: 10.1371/journal.pone.0228724. eCollection 2020.
10
Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits.基于参数和机器学习模型的复杂性状基因组预测的基准测试。
G3 (Bethesda). 2019 Nov 5;9(11):3691-3702. doi: 10.1534/g3.119.400498.