深度学习、支持向量机和贝叶斯阈值最佳线性无偏预测在植物育种中预测有序性状的基准比较

A Benchmarking Between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding.

作者信息

Montesinos-López Osval A, Martín-Vallejo Javier, Crossa José, Gianola Daniel, Hernández-Suárez Carlos M, Montesinos-López Abelardo, Juliana Philomin, Singh Ravi

机构信息

Facultad de Telemática.

Departamento de Estadística, Universidad de Salamanca, c/Espejo 2, Salamanca, 37007, España.

出版信息

G3 (Bethesda). 2019 Feb 7;9(2):601-618. doi: 10.1534/g3.118.200998.

DOI:10.1534/g3.118.200998

PMID:30593512

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6385991/

Abstract

Genomic selection is revolutionizing plant breeding. However, still lacking are better statistical models for ordinal phenotypes to improve the accuracy of the selection of candidate genotypes. For this reason, in this paper we explore the genomic based prediction performance of two popular machine learning methods: the Multi Layer Perceptron (MLP) and support vector machine (SVM) methods the Bayesian threshold genomic best linear unbiased prediction (TGBLUP) model. We used the percentage of cases correctly classified (PCCC) as a metric to measure the prediction performance, and seven real data sets to evaluate the prediction accuracy, and found that the best predictions (in four out of the seven data sets) in terms of PCCC occurred under the TGLBUP model, while the worst occurred under the SVM method. Also, in general we found no statistical differences between using 1, 2 and 3 layers under the MLP models, which means that many times the conventional neuronal network model with only one layer is enough. However, although even that the TGBLUP model was better, we found that the predictions of MLP and SVM were very competitive with the advantage that the SVM was the most efficient in terms of the computational time required.

摘要

基因组选择正在彻底改变植物育种。然而，用于有序表型的更好统计模型仍然缺乏，以提高候选基因型选择的准确性。因此，在本文中，我们探讨了两种流行的机器学习方法基于基因组的预测性能：多层感知器（MLP）和支持向量机（SVM）方法以及贝叶斯阈值基因组最佳线性无偏预测（TGBLUP）模型。我们使用正确分类病例的百分比（PCCC）作为衡量预测性能的指标，并使用七个真实数据集来评估预测准确性，发现就PCCC而言，最佳预测（在七个数据集中的四个）出现在TGLBUP模型下，而最差预测出现在SVM方法下。此外，总体而言，我们发现在MLP模型下使用1层、2层和3层之间没有统计学差异，这意味着很多时候仅一层的传统神经网络模型就足够了。然而，尽管TGBLUP模型更好，但我们发现MLP和SVM的预测非常有竞争力，其优势在于SVM在所需计算时间方面效率最高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5468/6385991/63e17f837c79/601f1.jpg

相似文献

A Benchmarking Between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding.

G3 (Bethesda). 2019 Feb 7;9(2):601-618. doi: 10.1534/g3.118.200998.

Comparing gradient boosting machine and Bayesian threshold BLUP for genome-based prediction of categorical traits in wheat breeding.

Plant Genome. 2022 Sep;15(3):e20214. doi: 10.1002/tpg2.20214. Epub 2022 May 10.

New Deep Learning Genomic-Based Prediction Model for Multiple Traits with Binary, Ordinal, and Continuous Phenotypes.

G3 (Bethesda). 2019 May 7;9(5):1545-1556. doi: 10.1534/g3.119.300585.

Maximum Threshold Genomic Prediction Model for Ordinal Traits.

G3 (Bethesda). 2020 Nov 5;10(11):4083-4102. doi: 10.1534/g3.120.401733.

Multi-environment Genomic Prediction of Plant Traits Using Deep Learners With Dense Architecture.

G3 (Bethesda). 2018 Dec 10;8(12):3813-3828. doi: 10.1534/g3.118.200740.

Multi-trait, Multi-environment Deep Learning Modeling for Genomic-Enabled Prediction of Plant Traits.

G3 (Bethesda). 2018 Dec 10;8(12):3829-3840. doi: 10.1534/g3.118.200728.

Applications of Support Vector Machine in Genomic Prediction in Pig and Maize Populations.

Front Genet. 2020 Dec 3;11:598318. doi: 10.3389/fgene.2020.598318. eCollection 2020.

A Bayesian Genomic Multi-output Regressor Stacking Model for Predicting Multi-trait Multi-environment Plant Breeding Data.

G3 (Bethesda). 2019 Oct 7;9(10):3381-3393. doi: 10.1534/g3.119.400336.

Identification of optimal prediction models using multi-omic data for selecting hybrid rice.

Heredity (Edinb). 2019 Sep;123(3):395-406. doi: 10.1038/s41437-019-0210-6. Epub 2019 Mar 25.

Multitrait machine- and deep-learning models for genomic selection using spectral information in a wheat breeding program.

Plant Genome. 2021 Nov;14(3):e20119. doi: 10.1002/tpg2.20119. Epub 2021 Sep 5.

引用本文的文献

Genomic prediction with kinship-based multiple kernel learning produces hypothesis on the underlying inheritance mechanisms of phenotypic traits.

Genome Biol. 2025 Apr 4;26(1):84. doi: 10.1186/s13059-025-03544-3.

Multiomics Research: Principles and Challenges in Integrated Analysis.

Biodes Res. 2024 Dec 5;6:0059. doi: 10.34133/bdr.0059. eCollection 2024.

Genomic prediction with NetGP based on gene network and multi-omics data in plants.

Plant Biotechnol J. 2025 Apr;23(4):1190-1201. doi: 10.1111/pbi.14577. Epub 2025 Feb 14.

Genomic Landscape and Prediction of Udder Traits in Saanen Dairy Goats.

Animals (Basel). 2025 Jan 17;15(2):261. doi: 10.3390/ani15020261.

Integration of machine learning and genome-wide association study to explore the genomic prediction accuracy of agronomic trait in oats (Avena sativa L.).

Plant Genome. 2025 Mar;18(1):e20549. doi: 10.1002/tpg2.20549.

Deep learning for genomic selection of aquatic animals.

Mar Life Sci Technol. 2024 Sep 27;6(4):631-650. doi: 10.1007/s42995-024-00252-y. eCollection 2024 Nov.

Big data and artificial intelligence-aided crop breeding: Progress and prospects.

J Integr Plant Biol. 2025 Mar;67(3):722-739. doi: 10.1111/jipb.13791. Epub 2024 Oct 28.

Machine Learning for the Genomic Prediction of Growth Traits in a Composite Beef Cattle Population.

Animals (Basel). 2024 Oct 18;14(20):3014. doi: 10.3390/ani14203014.

Evaluation of deep learning for predicting rice traits using structural and single-nucleotide genomic variants.

Plant Methods. 2024 Aug 10;20(1):121. doi: 10.1186/s13007-024-01250-y.

Enhancing genomic prediction with Stacking Ensemble Learning in Arabica Coffee.

Front Plant Sci. 2024 Jul 17;15:1373318. doi: 10.3389/fpls.2024.1373318. eCollection 2024.

本文引用的文献

New Deep Learning Genomic-Based Prediction Model for Multiple Traits with Binary, Ordinal, and Continuous Phenotypes.

G3 (Bethesda). 2019 May 7;9(5):1545-1556. doi: 10.1534/g3.119.300585.

Prospects and Challenges of Applied Genomic Selection-A New Paradigm in Breeding for Grain Yield in Bread Wheat.

Plant Genome. 2018 Nov;11(3). doi: 10.3835/plantgenome2018.03.0017.

Multi-trait, Multi-environment Deep Learning Modeling for Genomic-Enabled Prediction of Plant Traits.

G3 (Bethesda). 2018 Dec 10;8(12):3829-3840. doi: 10.1534/g3.118.200728.

Multi-environment Genomic Prediction of Plant Traits Using Deep Learners With Dense Architecture.

G3 (Bethesda). 2018 Dec 10;8(12):3813-3828. doi: 10.1534/g3.118.200740.

Can Deep Learning Improve Genomic Prediction of Complex Human Traits?

Genetics. 2018 Nov;210(3):809-819. doi: 10.1534/genetics.118.301298. Epub 2018 Aug 31.

Modern Machine Learning as a Benchmark for Fitting Neural Responses.

Front Comput Neurosci. 2018 Jul 19;12:56. doi: 10.3389/fncom.2018.00056. eCollection 2018.

Applications of Machine Learning Methods to Genomic Selection in Breeding Wheat for Rust Resistance.

Plant Genome. 2018 Jul;11(2). doi: 10.3835/plantgenome2017.11.0104.

Statistical and Machine Learning forecasting methods: Concerns and ways forward.

PLoS One. 2018 Mar 27;13(3):e0194889. doi: 10.1371/journal.pone.0194889. eCollection 2018.

Increasing Genomic-Enabled Prediction Accuracy by Modeling Genotype × Environment Interactions in Kansas Wheat.

Plant Genome. 2017 Jul;10(2). doi: 10.3835/plantgenome2016.12.0130.

Genome-Based Identification of Heterotic Patterns in Rice.

Rice (N Y). 2017 Dec;10(1):22. doi: 10.1186/s12284-017-0163-4. Epub 2017 May 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

深度学习、支持向量机和贝叶斯阈值最佳线性无偏预测在植物育种中预测有序性状的基准比较

A Benchmarking Between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献