利用参数和非参数稀疏选择指数进行玉米产量的多世代基因组预测。

Multi-generation genomic prediction of maize yield using parametric and non-parametric sparse selection indices.

机构信息

Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI, USA.

Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI, USA.

出版信息

Heredity (Edinb). 2021 Nov;127(5):423-432. doi: 10.1038/s41437-021-00474-1. Epub 2021 Sep 25.

DOI:10.1038/s41437-021-00474-1

PMID:34564692

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8551287/

Abstract

Genomic prediction models are often calibrated using multi-generation data. Over time, as data accumulates, training data sets become increasingly heterogeneous. Differences in allele frequency and linkage disequilibrium patterns between the training and prediction genotypes may limit prediction accuracy. This leads to the question of whether all available data or a subset of it should be used to calibrate genomic prediction models. Previous research on training set optimization has focused on identifying a subset of the available data that is optimal for a given prediction set. However, this approach does not contemplate the possibility that different training sets may be optimal for different prediction genotypes. To address this problem, we recently introduced a sparse selection index (SSI) that identifies an optimal training set for each individual in a prediction set. Using additive genomic relationships, the SSI can provide increased accuracy relative to genomic-BLUP (GBLUP). Non-parametric genomic models using Gaussian kernels (KBLUP) have, in some cases, yielded higher prediction accuracies than standard additive models. Therefore, here we studied whether combining SSIs and kernel methods could further improve prediction accuracy when training genomic models using multi-generation data. Using four years of doubled haploid maize data from the International Maize and Wheat Improvement Center (CIMMYT), we found that when predicting grain yield the KBLUP outperformed the GBLUP, and that using SSI with additive relationships (GSSI) lead to 5-17% increases in accuracy, relative to the GBLUP. However, differences in prediction accuracy between the KBLUP and the kernel-based SSI were smaller and not always significant.

摘要

基因组预测模型通常使用多代数据进行校准。随着时间的推移，随着数据的积累，训练数据集变得越来越不均匀。训练和预测基因型之间等位基因频率和连锁不平衡模式的差异可能会限制预测准确性。这就提出了一个问题，即应该使用所有可用数据还是其中的一个子集来校准基因组预测模型。以前关于训练集优化的研究主要集中在确定给定预测集的最佳可用数据子集上。然而，这种方法并没有考虑到不同的训练集可能对不同的预测基因型是最优的。为了解决这个问题，我们最近引入了一种稀疏选择指数（SSI），它可以为预测集中的每个个体确定最佳的训练集。使用加性基因组关系，SSI 可以相对于基因组-BLUP（GBLUP）提供更高的准确性。使用高斯核（KBLUP）的非参数基因组模型在某些情况下产生的预测准确性高于标准加性模型。因此，在这里，我们研究了在使用多代数据训练基因组模型时，结合 SSI 和核方法是否可以进一步提高预测准确性。我们使用来自国际玉米小麦改良中心（CIMMYT）的四年双倍单倍体玉米数据，发现当预测谷物产量时，KBLUP 优于 GBLUP，而使用加性关系的 SSI（GSSI）相对于 GBLUP 可将准确性提高 5-17%。然而，KBLUP 和基于核的 SSI 之间的预测准确性差异较小，并不总是显著。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d5d/8551287/2d3a83ba286f/41437_2021_474_Fig1_HTML.jpg

相似文献

Multi-generation genomic prediction of maize yield using parametric and non-parametric sparse selection indices.利用参数和非参数稀疏选择指数进行玉米产量的多世代基因组预测。

Heredity (Edinb). 2021 Nov;127(5):423-432. doi: 10.1038/s41437-021-00474-1. Epub 2021 Sep 25.

Genomic-Enabled Prediction in Maize Using Kernel Models with Genotype × Environment Interaction.利用具有基因型×环境互作的核模型对玉米进行基因组预测

G3 (Bethesda). 2017 Jun 7;7(6):1995-2014. doi: 10.1534/g3.117.042341.

Bayesian Genomic Prediction with Genotype × Environment Interaction Kernel Models.使用基因型×环境互作核模型的贝叶斯基因组预测

G3 (Bethesda). 2017 Jan 5;7(1):41-53. doi: 10.1534/g3.116.035584.

Sparse kernel models provide optimization of training set design for genomic prediction in multiyear wheat breeding data.稀疏核模型为多年小麦育种数据中的基因组预测提供了训练集设计的优化。

Plant Genome. 2022 Dec;15(4):e20254. doi: 10.1002/tpg2.20254. Epub 2022 Aug 31.

Optimal breeding-value prediction using a sparse selection index.利用稀疏选择指数进行最优育种值预测。

Genetics. 2021 May 17;218(1). doi: 10.1093/genetics/iyab030.

Enviromic-based kernels may optimize resource allocation with multi-trait multi-environment genomic prediction for tropical Maize.基于环境组学的核函数可通过热带玉米的多性状多环境基因组预测来优化资源分配。

BMC Plant Biol. 2023 Jan 5;23(1):10. doi: 10.1186/s12870-022-03975-1.

Using markers with large effect in genetic and genomic predictions.在遗传和基因组预测中使用具有大效应的标记。

J Anim Sci. 2017 Jan;95(1):59-71. doi: 10.2527/jas.2016.0754.

Calibration and validation of predicted genomic breeding values in an advanced cycle maize population.在一个先进的循环玉米群体中预测基因组育种值的校准和验证。

Theor Appl Genet. 2021 Sep;134(9):3069-3081. doi: 10.1007/s00122-021-03880-5. Epub 2021 Jun 12.

Accuracy of Genomic Prediction in Synthetic Populations Depending on the Number of Parents, Relatedness, and Ancestral Linkage Disequilibrium.取决于亲本数量、亲缘关系和祖先连锁不平衡的合成群体中基因组预测的准确性。

Genetics. 2017 Jan;205(1):441-454. doi: 10.1534/genetics.116.193243. Epub 2016 Nov 9.

Genomic prediction in maize breeding populations with genotyping-by-sequencing.基于测序的基因型鉴定在玉米育种群体中的基因组预测。

G3 (Bethesda). 2013 Nov 6;3(11):1903-26. doi: 10.1534/g3.113.008227.

引用本文的文献

GWAS-assisted and multitrait genomic prediction for improvement of seed yield and canning quality traits in a black bean breeding panel.在一个黑豆育种群体中，利用全基因组关联研究辅助和多性状基因组预测来改良种子产量和罐头品质性状。

G3 (Bethesda). 2025 Mar 18;15(3). doi: 10.1093/g3journal/jkaf007.

Experimental evaluation of effectiveness of genomic selection for resistance to northern corn leaf blight in maize.玉米对北方玉米叶斑病抗性的基因组选择有效性的实验评估。

J Appl Genet. 2024 Oct 24. doi: 10.1007/s13353-024-00911-x.

Utilizing genomic prediction to boost hybrid performance in a sweet corn breeding program.利用基因组预测提升甜玉米育种计划中的杂种性能。

Front Plant Sci. 2024 Apr 25;15:1293307. doi: 10.3389/fpls.2024.1293307. eCollection 2024.

Genome-wide association and genomic prediction for iron and zinc concentration and iron bioavailability in a collection of yellow dry beans.对一批黄干豆中铁和锌浓度以及铁生物利用度的全基因组关联研究和基因组预测

Front Genet. 2024 Feb 6;15:1330361. doi: 10.3389/fgene.2024.1330361. eCollection 2024.

BMC Plant Biol. 2023 Jan 5;23(1):10. doi: 10.1186/s12870-022-03975-1.

本文引用的文献

Maximizing efficiency of genomic selection in CIMMYT's tropical maize breeding program.最大限度地提高 CIMMYT 热带玉米育种计划中基因组选择的效率。

Theor Appl Genet. 2021 Jan;134(1):279-294. doi: 10.1007/s00122-020-03696-9. Epub 2020 Oct 10.

Accounting for Group-Specific Allele Effects and Admixture in Genomic Predictions: Theory and Experimental Evaluation in Maize.在基因组预测中考虑群体特异性等位基因效应和混合：玉米中的理论和实验评估。

Genetics. 2020 Sep;216(1):27-41. doi: 10.1534/genetics.120.303278. Epub 2020 Jul 17.

Will Big Data Close the Missing Heritability Gap?大数据能否弥合遗传缺失的鸿沟？

Genetics. 2017 Nov;207(3):1135-1145. doi: 10.1534/genetics.117.300271. Epub 2017 Sep 11.

Updating the reference population to achieve constant genomic prediction reliability across generations.更新参考群体以实现跨世代基因组预测可靠性的恒定。

Animal. 2016 Jun;10(6):1018-24. doi: 10.1017/S1751731115002785. Epub 2015 Dec 29.

Assessment of Genetic Heterogeneity in Structured Plant Populations Using Multivariate Whole-Genome Regression Models.使用多变量全基因组回归模型评估结构化植物群体中的遗传异质性。

Genetics. 2015 Sep;201(1):323-37. doi: 10.1534/genetics.115.177394. Epub 2015 Jun 29.

Genome-wide regression and prediction with the BGLR statistical package.使用BGLR统计软件包进行全基因组回归与预测。

Genetics. 2014 Oct;198(2):483-95. doi: 10.1534/genetics.114.164442. Epub 2014 Jul 9.

Genomic predictability of interconnected biparental maize populations.玉米双亲亲本群体的基因组可预测性。

Genetics. 2013 Jun;194(2):493-503. doi: 10.1534/genetics.113.150227. Epub 2013 Mar 27.

The effect of linkage disequilibrium and family relationships on the reliability of genomic prediction.连锁不平衡和家族关系对基因组预测可靠性的影响。

Genetics. 2013 Feb;193(2):621-31. doi: 10.1534/genetics.112.146290. Epub 2012 Dec 24.

Multibreed genomic evaluations using purebred Holsteins, Jerseys, and Brown Swiss.使用纯种荷斯坦牛、娟姗牛和瑞士褐牛进行多品种基因组评估。

J Dairy Sci. 2012 Sep;95(9):5378-5383. doi: 10.3168/jds.2011-5006.

Maximizing the reliability of genomic selection by optimizing the calibration set of reference individuals: comparison of methods in two diverse groups of maize inbreds (Zea mays L.).通过优化参考个体的校准集来提高基因组选择的可靠性：两种不同群体的玉米自交系（Zea mays L.）中的方法比较。

Genetics. 2012 Oct;192(2):715-28. doi: 10.1534/genetics.112.141473. Epub 2012 Aug 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用参数和非参数稀疏选择指数进行玉米产量的多世代基因组预测。

Multi-generation genomic prediction of maize yield using parametric and non-parametric sparse selection indices.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献