遗传参数和超参数估计是面包小麦氮利用效率的基础。

Genetic Parameter and Hyper-Parameter Estimation Underlie Nitrogen Use Efficiency in Bread Wheat.

机构信息

INRES-Plant Breeding, Rheinische Friedrich-Wilhelms-Universität Bonn, 53113 Bonn, Germany.

INRES-Plant Nutrition, Rheinische Friedrich-Wilhelms-Universität Bonn, 53113 Bonn, Germany.

出版信息

Int J Mol Sci. 2023 Sep 19;24(18):14275. doi: 10.3390/ijms241814275.

DOI:10.3390/ijms241814275

PMID:37762585

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10531695/

Abstract

Estimation and prediction play a key role in breeding programs. Currently, phenotyping of complex traits such as nitrogen use efficiency (NUE) in wheat is still expensive, requires high-throughput technologies and is very time consuming compared to genotyping. Therefore, researchers are trying to predict phenotypes based on marker information. Genetic parameters such as population structure, genomic relationship matrix, marker density and sample size are major factors that increase the performance and accuracy of a model. However, they play an important role in adjusting the statistically significant false discovery rate (FDR) threshold in estimation. In parallel, there are many genetic hyper-parameters that are hidden and not represented in the given genomic selection (GS) model but have significant effects on the results, such as panel size, number of markers, minor allele frequency, number of call rates for each marker, number of cross validations and batch size in the training set of the genomic file. The main challenge is to ensure the reliability and accuracy of predicted breeding values (BVs) as results. Our study has confirmed the results of bias-variance tradeoff and adaptive prediction error for the ensemble-learning-based model STACK, which has the highest performance when estimating genetic parameters and hyper-parameters in a given GS model compared to other models.

摘要

估计和预测在育种计划中起着关键作用。目前，与基因分型相比，对小麦等复杂性状（如氮利用效率[NUE]）的表型进行测定仍然很昂贵，需要高通量技术且非常耗时。因此，研究人员正试图根据标记信息来预测表型。群体结构、基因组关系矩阵、标记密度和样本大小等遗传参数是提高模型性能和准确性的主要因素。然而，它们在调整估计中统计上显著的错误发现率（FDR）阈值方面起着重要作用。同时，还有许多遗传超参数隐藏在给定的基因组选择（GS）模型中，并未表示出来，但对结果有重大影响，例如面板大小、标记数量、次要等位基因频率、每个标记的调用率数量、交叉验证次数和基因组文件训练集中的批量大小。主要的挑战是确保预测育种值（BV）的可靠性和准确性。我们的研究证实了基于集成学习的模型 STACK 的偏差-方差权衡和自适应预测误差的结果，与其他模型相比，该模型在估计给定 GS 模型中的遗传参数和超参数时具有最高的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a509/10531695/b68ab757af1f/ijms-24-14275-g001a.jpg

相似文献

Genetic Parameter and Hyper-Parameter Estimation Underlie Nitrogen Use Efficiency in Bread Wheat.遗传参数和超参数估计是面包小麦氮利用效率的基础。

Int J Mol Sci. 2023 Sep 19;24(18):14275. doi: 10.3390/ijms241814275.

Phenomic selection in wheat breeding: identification and optimisation of factors influencing prediction accuracy and comparison to genomic selection.小麦育种中的表型组选择：影响预测准确性的因素的鉴定与优化以及与基因组选择的比较

Theor Appl Genet. 2022 Mar;135(3):895-914. doi: 10.1007/s00122-021-04005-8. Epub 2022 Jan 6.

A classic approach for determining genomic prediction accuracy under terminal drought stress and well-watered conditions in wheat landraces and cultivars.一种经典的方法，用于确定小麦地方品种和栽培品种在终末干旱胁迫和充分供水条件下的基因组预测准确性。

PLoS One. 2021 Mar 5;16(3):e0247824. doi: 10.1371/journal.pone.0247824. eCollection 2021.

Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy.优化小麦基因组选择：标记密度、群体大小和群体结构对预测准确性的影响

G3 (Bethesda). 2018 Aug 30;8(9):2889-2899. doi: 10.1534/g3.118.200311.

Genomic prediction of agronomic traits in wheat using different models and cross-validation designs.利用不同模型和交叉验证设计对小麦农艺性状进行基因组预测。

Theor Appl Genet. 2021 Jan;134(1):381-398. doi: 10.1007/s00122-020-03703-z. Epub 2020 Nov 1.

Development and validation of KASP assays for genes underpinning key economic traits in bread wheat.开发和验证 KASP 分析用于支持面包小麦主要经济性状的关键基因。

Theor Appl Genet. 2016 Oct;129(10):1843-60. doi: 10.1007/s00122-016-2743-x. Epub 2016 Jun 15.

Resource allocation optimization with multi-trait genomic prediction for bread wheat (Triticum aestivum L.) baking quality.利用多性状基因组预测优化面包小麦（Triticum aestivum L.）烘焙品质的资源分配。

Theor Appl Genet. 2018 Dec;131(12):2719-2731. doi: 10.1007/s00122-018-3186-3. Epub 2018 Sep 19.

Genomic Selection for Processing and End-Use Quality Traits in the CIMMYT Spring Bread Wheat Breeding Program.利用基因组选择改良 CIMMYT 春小麦育种计划的加工和用途品质性状

Plant Genome. 2016 Jul;9(2). doi: 10.3835/plantgenome2016.01.0005.

Validating the prediction accuracies of marker-assisted and genomic selection of Fusarium head blight resistance in wheat using an independent sample.使用独立样本验证小麦赤霉病抗性的标记辅助选择和基因组选择的预测准确性。

Theor Appl Genet. 2017 Mar;130(3):471-482. doi: 10.1007/s00122-016-2827-7. Epub 2016 Nov 17.

Genome-wide association mapping and genomic prediction of agronomical traits and breeding values in Iranian wheat under rain-fed and well-watered conditions.在雨养和充分灌溉条件下，对伊朗小麦的农艺性状和育种值进行全基因组关联图谱绘制和基因组预测。

BMC Genomics. 2022 Dec 15;23(1):831. doi: 10.1186/s12864-022-08968-w.

本文引用的文献

Improving Genomic Prediction with Machine Learning Incorporating TPE for Hyperparameters Optimization.通过结合树状 Parzen 估计器进行超参数优化的机器学习改进基因组预测。

Biology (Basel). 2022 Nov 11;11(11):1647. doi: 10.3390/biology11111647.

Stacked kinship CNN vs. GBLUP for genomic predictions of additive and complex continuous phenotypes.堆叠亲缘关系 CNN 与 GBLUP 用于加性和复杂连续表型的基因组预测。

Sci Rep. 2022 Nov 18;12(1):19889. doi: 10.1038/s41598-022-24405-0.

Editorial: Genomic Selection: Lessons Learned and Perspectives.社论：基因组选择：经验教训与展望

Front Plant Sci. 2022 May 27;13:890434. doi: 10.3389/fpls.2022.890434. eCollection 2022.

Genome-Enabled Prediction Methods Based on Machine Learning.基于机器学习的基因组预测方法

Methods Mol Biol. 2022;2467:189-218. doi: 10.1007/978-1-0716-2205-6_7.

Efficient learning rate adaptation based on hierarchical optimization approach.基于层次优化方法的高效学习率自适应。

Neural Netw. 2022 Jun;150:326-335. doi: 10.1016/j.neunet.2022.02.014. Epub 2022 Feb 25.

Prediction performance of linear models and gradient boosting machine on complex phenotypes in outbred mice.线性模型和梯度提升机在外交小鼠复杂表型上的预测性能。

G3 (Bethesda). 2022 Apr 4;12(4). doi: 10.1093/g3journal/jkac039.

Heuristic hyperparameter optimization of deep learning models for genomic prediction.启发式深度学习模型的基因组预测超参数优化。

G3 (Bethesda). 2021 Jul 14;11(7). doi: 10.1093/g3journal/jkab032.

Genomic Prediction Using Bayesian Regression Models With Global-Local Prior.使用具有全局-局部先验的贝叶斯回归模型进行基因组预测

Front Genet. 2021 Apr 15;12:628205. doi: 10.3389/fgene.2021.628205. eCollection 2021.

A Stacking Ensemble Learning Framework for Genomic Prediction.一种用于基因组预测的堆叠集成学习框架。

Front Genet. 2021 Mar 4;12:600040. doi: 10.3389/fgene.2021.600040. eCollection 2021.

Feature Selection Stability and Accuracy of Prediction Models for Genomic Prediction of Residual Feed Intake in Pigs Using Machine Learning.使用机器学习对猪的剩余采食量进行基因组预测的预测模型的特征选择稳定性和准确性

Front Genet. 2021 Feb 22;12:611506. doi: 10.3389/fgene.2021.611506. eCollection 2021.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

遗传参数和超参数估计是面包小麦氮利用效率的基础。

Genetic Parameter and Hyper-Parameter Estimation Underlie Nitrogen Use Efficiency in Bread Wheat.

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献