• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

梯度提升机和贝叶斯阈值 BLUP 用于小麦育种中基于基因组的分类性状预测的比较。

Comparing gradient boosting machine and Bayesian threshold BLUP for genome-based prediction of categorical traits in wheat breeding.

机构信息

Facultad de Telemática, Univ. de Colima, Colima, Colima, 28040, México.

Univ. Tecnológica de Manzanillo, Manzanillo, Colima, México.

出版信息

Plant Genome. 2022 Sep;15(3):e20214. doi: 10.1002/tpg2.20214. Epub 2022 May 10.

DOI:10.1002/tpg2.20214
PMID:35535459
Abstract

Genomic selection (GS) is a predictive methodology that is changing plant breeding. Genomic selection trains a statistical machine-learning model using available phenotypic and genotypic data with which predictions are performed for individuals that were only genotyped. For this reason, some statistical machine-learning methods are being implemented in GS, but in order to improve the selection of new genotypes early in the prediction process, the exploration of new statistical machine-learning algorithms must continue. In this paper, we performed a benchmarking study between the Bayesian threshold genomic best linear unbiased predictor model (TGBLUP; popular in GS) and the gradient boosting machine (GBM). This comparison was done using four real wheat (Triticum aestivum L.) data sets with categorical traits measured in terms of two metrics: the proportion of cases correctly classified (PCCC) and the Kappa coefficient in the testing set. Under 10 random partitions with four different sizes of testing proportions (20, 40, 60, and 80%), we compared the two algorithms and found that in three of the four data sets, the GBM outperformed the TGBLUP model in terms of both metrics (PCCC and Kappa coefficient). In the larger data sets (Data Sets 3 and 4), the gain in terms of prediction accuracy of the GBM was considerably significant. For this reason, we encourage more research using the GBM in GS to evaluate its virtues in terms of prediction performance in the context of GS.

摘要

基因组选择(GS)是一种改变植物育种的预测方法。基因组选择使用可用的表型和基因型数据训练统计机器学习模型,并用该模型对仅进行基因型分析的个体进行预测。出于这个原因,一些统计机器学习方法正在被应用于 GS,但为了在预测过程的早期更好地选择新的基因型,必须继续探索新的统计机器学习算法。在本文中,我们在贝叶斯阈值基因组最佳线性无偏预测模型(TGBLUP;在 GS 中很流行)和梯度提升机(GBM)之间进行了基准测试研究。这种比较是使用四个真实的小麦(Triticum aestivum L.)数据集进行的,这些数据集的分类性状是用两种度量来衡量的:测试集中正确分类的案例比例(PCCC)和 Kappa 系数。在 10 个具有 4 种不同测试比例(20%、40%、60%和 80%)的随机分区中,我们比较了两种算法,发现在四个数据集的三个中,GBM 在两个度量(PCCC 和 Kappa 系数)方面都优于 TGBLUP 模型。在较大的数据集(数据集 3 和数据集 4)中,GBM 在预测准确性方面的增益非常显著。因此,我们鼓励更多地使用 GBM 在 GS 中的研究,以评估其在 GS 背景下的预测性能方面的优势。

相似文献

1
Comparing gradient boosting machine and Bayesian threshold BLUP for genome-based prediction of categorical traits in wheat breeding.梯度提升机和贝叶斯阈值 BLUP 用于小麦育种中基于基因组的分类性状预测的比较。
Plant Genome. 2022 Sep;15(3):e20214. doi: 10.1002/tpg2.20214. Epub 2022 May 10.
2
A Benchmarking Between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding.深度学习、支持向量机和贝叶斯阈值最佳线性无偏预测在植物育种中预测有序性状的基准比较
G3 (Bethesda). 2019 Feb 7;9(2):601-618. doi: 10.1534/g3.118.200998.
3
Multitrait machine- and deep-learning models for genomic selection using spectral information in a wheat breeding program.利用小麦育种计划中的光谱信息,基于多种性状的机器和深度学习模型进行基因组选择。
Plant Genome. 2021 Nov;14(3):e20119. doi: 10.1002/tpg2.20119. Epub 2021 Sep 5.
4
A Comparison between Three Tuning Strategies for Gaussian Kernels in the Context of Univariate Genomic Prediction.三种高斯核调优策略在单变量基因组预测中的比较
Genes (Basel). 2022 Dec 3;13(12):2282. doi: 10.3390/genes13122282.
5
A Comparison of Three Machine Learning Methods for Multivariate Genomic Prediction Using the Sparse Kernels Method (SKM) Library.三种机器学习方法在使用稀疏核方法 (SKM) 库进行多元基因组预测中的比较。
Genes (Basel). 2022 Aug 21;13(8):1494. doi: 10.3390/genes13081494.
6
TrG2P: A transfer-learning-based tool integrating multi-trait data for accurate prediction of crop yield.TrG2P:一种基于迁移学习的工具,集成多性状数据,用于准确预测作物产量。
Plant Commun. 2024 Jul 8;5(7):100975. doi: 10.1016/j.xplc.2024.100975. Epub 2024 May 15.
7
Multi-Trait Genomic Prediction of Yield-Related Traits in US Soft Wheat under Variable Water Regimes.在不同水分条件下美国软小麦产量相关性状的多性状基因组预测。
Genes (Basel). 2020 Oct 28;11(11):1270. doi: 10.3390/genes11111270.
8
Using an incomplete block design to allocate lines to environments improves sparse genome-based prediction in plant breeding.使用不完全区组设计将品系分配到不同环境中可改善植物育种中基于稀疏基因组的预测。
Plant Genome. 2022 Mar;15(1):e20194. doi: 10.1002/tpg2.20194. Epub 2022 Feb 16.
9
Genomic Selection in Winter Wheat Breeding Using a Recommender Approach.利用推荐方法进行冬小麦育种中的基因组选择。
Genes (Basel). 2020 Jul 11;11(7):779. doi: 10.3390/genes11070779.
10
Genomic prediction of agronomic traits in wheat using different models and cross-validation designs.利用不同模型和交叉验证设计对小麦农艺性状进行基因组预测。
Theor Appl Genet. 2021 Jan;134(1):381-398. doi: 10.1007/s00122-020-03703-z. Epub 2020 Nov 1.

引用本文的文献

1
Genetic architecture and polygenic risk score prediction of degenerative suspensory ligament desmitis (DSLD) in the Peruvian Horse.秘鲁马退行性悬韧带腱炎(DSLD)的遗传结构与多基因风险评分预测
Front Genet. 2023 Aug 14;14:1201628. doi: 10.3389/fgene.2023.1201628. eCollection 2023.