Wang Yu, Mette Michael Florian, Miedaner Thomas, Gottwald Marlen, Wilde Peer, Reif Jochen C, Zhao Yusheng
Leibniz Institute of Plant Breeding and Crop Plant Research (IPK), Gatersleben 06466, Germany.
BMC Genomics. 2014 Jul 4;15(1):556. doi: 10.1186/1471-2164-15-556.
Marker-assisted selection (MAS) and genomic selection (GS) based on genome-wide marker data provide powerful tools to predict the genotypic value of selection material in plant breeding. However, case-to-case optimization of these approaches is required to achieve maximum accuracy of prediction with reasonable input.
Based on extended field evaluation data for grain yield, plant height, starch content and total pentosan content of elite hybrid rye derived from testcrosses involving two bi-parental populations that were genotyped with 1048 molecular markers, we compared the accuracy of prediction of MAS and GS in a cross-validation approach. MAS delivered generally lower and in addition potentially over-estimated accuracies of prediction than GS by ridge regression best linear unbiased prediction (RR-BLUP). The grade of relatedness of the plant material included in the estimation and test sets clearly affected the accuracy of prediction of GS. Within each of the two bi-parental populations, accuracies differed depending on the relatedness of the respective parental lines. Across populations, accuracy increased when both populations contributed to estimation and test set. In contrast, accuracy of prediction based on an estimation set from one population to a test set from the other population was low despite that the two bi-parental segregating populations under scrutiny shared one parental line. Limiting the number of locations or years in field testing reduced the accuracy of prediction of GS equally, supporting the view that to establish robust GS calibration models a sufficient number of test locations is of similar importance as extended testing for more than one year.
In hybrid rye, genomic selection is superior to marker-assisted selection. However, it achieves high accuracies of prediction only for selection candidates closely related to the plant material evaluated in field trials, resulting in a rather pessimistic prognosis for distantly related material. Both, the numbers of evaluation locations and testing years in trials contribute equally to prediction accuracy.
基于全基因组标记数据的标记辅助选择(MAS)和基因组选择(GS)为预测植物育种中选择材料的基因型值提供了强大工具。然而,需要针对具体情况对这些方法进行优化,以便在合理投入的情况下实现最高的预测准确性。
基于对涉及两个双亲群体的测交后代的优良杂交黑麦的籽粒产量、株高、淀粉含量和总戊聚糖含量的扩展田间评估数据,这两个双亲群体用1048个分子标记进行了基因分型,我们采用交叉验证方法比较了MAS和GS的预测准确性。通过岭回归最佳线性无偏预测(RR-BLUP),MAS的预测准确性通常低于GS,而且可能存在高估。估计集和测试集中包含的植物材料的亲缘关系等级明显影响GS的预测准确性。在两个双亲群体中的每一个群体内,准确性因各自亲本系的亲缘关系而异。在不同群体间,当两个群体都对估计集和测试集有贡献时,准确性会提高。相比之下,尽管所研究的两个双亲分离群体共享一个亲本系,但基于一个群体的估计集对另一个群体的测试集进行预测的准确性很低。在田间试验中限制地点或年份的数量会同样降低GS的预测准确性,这支持了这样一种观点,即要建立稳健的GS校准模型,足够数量的测试地点与超过一年的扩展测试同样重要。
在杂交黑麦中,基因组选择优于标记辅助选择。然而,它仅对与田间试验中评估的植物材料密切相关的选择候选者实现了较高的预测准确性,这对远缘相关材料的预后相当悲观。试验中的评估地点数量和测试年份对预测准确性的贡献相同。