Ankamah-Yeboah Theresa, Janss Lucas Lodewijk, Jensen Jens Due, Hjortshøj Rasmus Lund, Rasmussen Søren Kjærsgaard
Department of Plant and Environmental Sciences, Faculty of Science, University of Copenhagen, Frederiksberg, Denmark.
Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark.
Front Plant Sci. 2020 May 8;11:539. doi: 10.3389/fpls.2020.00539. eCollection 2020.
With the current advances in the development of low-cost high-density array-based DNA marker technologies, cereal breeding programs are increasingly relying on genomic selection as a tool to accelerate the rate of genetic gain in seed quality traits. Different sources of genetic information are being explored, with the most prevalent being combined additive information from marker and pedigree-based data, and their interaction with the environment. In this, there has been mixed evidence on the performance of use of these data. This study undertook an extensive analysis of 907 elite winter barley ( L.) lines across multiple environments from two breeding companies. Six genomic prediction models were evaluated to demonstrate the effect of using pedigree and marker information individually and in combination, as well their interactions with the environment. Each model was evaluated using three cross-validation schemes that allows the prediction of newly developed lines (lines that have not been evaluated in any environment), prediction of new or unobserved years, and prediction of newly developed lines in unobserved years. The results showed that the best prediction model depends on the cross-validation scheme employed. In predicting newly developed lines in known environments, marker information had no advantage to pedigree information. Predictions in this scenario also benefited from including genotype-by-environment interaction in the models. However, when predicting lines and years not observed previously, marker information was superior to pedigree data. Nonetheless, such scenarios did not benefit from the addition of genotype-by-environment interaction. A combination of pedigree-based and marker-based information produced a similar or only marginal improvement in prediction ability. It was also discovered that combining populations from the different breeding programs to increase training population size had no advantage in prediction.
随着低成本高密度阵列DNA标记技术的不断发展,谷物育种计划越来越依赖基因组选择作为加速种子品质性状遗传增益的工具。目前正在探索不同的遗传信息来源,其中最普遍的是来自标记和系谱数据的综合加性信息,以及它们与环境的相互作用。在这方面,关于使用这些数据的表现存在不同的证据。本研究对来自两家育种公司的907个优良冬大麦(L.)品系在多个环境下进行了广泛分析。评估了六种基因组预测模型,以证明单独使用和组合使用系谱和标记信息的效果,以及它们与环境的相互作用。每个模型使用三种交叉验证方案进行评估,这些方案可以预测新育成的品系(未在任何环境中评估过的品系)、预测新的或未观察到的年份,以及预测未观察到的年份中新育成的品系。结果表明,最佳预测模型取决于所采用的交叉验证方案。在已知环境中预测新育成的品系时,标记信息相对于系谱信息没有优势。在这种情况下,模型中纳入基因型与环境的相互作用也有助于预测。然而,当预测以前未观察到的品系和年份时,标记信息优于系谱数据。尽管如此,这种情况并没有因添加基因型与环境的相互作用而受益。基于系谱和基于标记的信息相结合,在预测能力上产生了相似的或仅略有改善。还发现,将来自不同育种计划的群体组合起来以增加训练群体规模,在预测方面没有优势。