Widener Sarah, Graef George, Lipka Alexander E, Jarquin Diego
Department of Crop Sciences, University of Illinois, Urbana, IL, United States.
Department of Agronomy and Horticulture, University of Nebraska, Lincoln, NE, United States.
Front Genet. 2021 Jul 23;12:689319. doi: 10.3389/fgene.2021.689319. eCollection 2021.
The effects of climate change create formidable challenges for breeders striving to produce sufficient food quantities in rapidly changing environments. It is therefore critical to investigate the ability of multi-environment genomic prediction (GP) models to predict genomic estimated breeding values (GEBVs) in extreme environments. Exploration of the impact of training set composition on the accuracy of such GEBVs is also essential. Accordingly, we examined the influence of the number of training environments and the use of environmental covariates (ECs) in GS models on four subsets of = 500 lines of the soybean nested association mapping (SoyNAM) panel grown in nine environments in the US-North Central Region. The ensuing analyses provided insights into the influence of both of these factors for predicting grain yield in the most and the least extreme of these environments. We found that only a subset of the available environments was needed to obtain the highest observed prediction accuracies. The inclusion of ECs in the GP model did not substantially increase prediction accuracies relative to competing models, and instead more often resulted in negative prediction accuracies. Combined with the overall low prediction accuracies for grain yield in the most extreme environment, our findings highlight weaknesses in current GP approaches for prediction in extreme environments, and point to specific areas on which to focus future research efforts.
气候变化的影响给育种者带来了巨大挑战,他们努力在快速变化的环境中生产足够数量的粮食。因此,研究多环境基因组预测(GP)模型在极端环境中预测基因组估计育种值(GEBVs)的能力至关重要。探索训练集组成对这类GEBVs准确性的影响也必不可少。相应地,我们在美国中北部地区的9个环境中种植了500个大豆嵌套关联作图(SoyNAM)群体品系的四个子集,研究了训练环境数量和基因组选择(GS)模型中环境协变量(ECs)的使用对其的影响。随后的分析深入了解了这两个因素对这些环境中最极端和最不极端环境下预测籽粒产量的影响。我们发现,只需可用环境的一个子集就能获得最高的观测预测准确性。与竞争模型相比,在GP模型中纳入ECs并没有显著提高预测准确性,反而更常导致负的预测准确性。结合最极端环境下籽粒产量的总体预测准确性较低,我们的研究结果凸显了当前GP方法在极端环境预测中的弱点,并指出了未来研究工作应关注的具体领域。