Mangin Brigitte, Bonnafous Fanny, Blanchet Nicolas, Boniface Marie-Claude, Bret-Mestries Emmanuelle, Carrère Sébastien, Cottret Ludovic, Legrand Ludovic, Marage Gwenola, Pegot-Espagnet Prune, Munos Stéphane, Pouilly Nicolas, Vear Felicity, Vincourt Patrick, Langlade Nicolas B
LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France.
Terres Inovia, AGIRCastanet-Tolosan, France.
Front Plant Sci. 2017 Sep 21;8:1633. doi: 10.3389/fpls.2017.01633. eCollection 2017.
Prediction of hybrid performance using incomplete factorial mating designs is widely used in breeding programs including different heterotic groups. Based on the general combining ability (GCA) of the parents, predictions are accurate only if the genetic variance resulting from the specific combining ability is small and both parents have phenotyped descendants. Genomic selection (GS) can predict performance using a model trained on both phenotyped and genotyped hybrids that do not necessarily include all hybrid parents. Therefore, GS could overcome the issue of unknown parent GCA. Here, we compared the accuracy of classical GCA-based and genomic predictions for oil content of sunflower seeds using several GS models. Our study involved 452 sunflower hybrids from an incomplete factorial design of 36 female and 36 male lines. Re-sequencing of parental lines allowed to identify 468,194 non-redundant SNPs and to infer the hybrid genotypes. Oil content was observed in a multi-environment trial (MET) over 3 years, leading to nine different environments. We compared GCA-based model to different GS models including female and male genomic kinships with the addition of the female-by-male interaction genomic kinship, the use of functional knowledge as SNPs in genes of oil metabolic pathways, and with epistasis modeling. When both parents have descendants in the training set, the predictive ability was high even for GCA-based prediction, with an average MET value of 0.782. GS performed slightly better (+0.2%). Neither the inclusion of the female-by-male interaction, nor functional knowledge of oil metabolism, nor epistasis modeling improved the GS accuracy. GS greatly improved predictive ability when one or both parents were untested in the training set, increasing GCA-based predictive ability by 10.4% from 0.575 to 0.635 in the MET. In this scenario, performing GS only considering SNPs in oil metabolic pathways did not improve whole genome GS prediction but increased GCA-based prediction ability by 6.4%. Our results show that GS is a major improvement to breeding efficiency compared to the classical GCA modeling when either one or both parents are not well-characterized. This finding could therefore accelerate breeding through reducing phenotyping efforts and more effectively targeting for the most promising crosses.
利用不完全析因交配设计预测杂种性能在包括不同杂种优势群的育种计划中被广泛应用。基于亲本的一般配合力(GCA),只有当特殊配合力产生的遗传方差较小时且双亲都有表型后代时,预测才准确。基因组选择(GS)可以使用在表型和基因型杂种上训练的模型来预测性能,这些杂种不一定包括所有的杂种亲本。因此,GS可以克服亲本GCA未知的问题。在此,我们使用几种GS模型比较了基于经典GCA和基因组预测的向日葵种子含油量的准确性。我们的研究涉及来自36个雌性系和36个雄性系的不完全析因设计的452个向日葵杂种。亲本系的重测序能够鉴定出468,194个非冗余单核苷酸多态性(SNP)并推断杂种基因型。在3年的多环境试验(MET)中观察含油量,产生了9种不同的环境。我们将基于GCA的模型与不同的GS模型进行比较,包括雌性和雄性基因组亲缘关系,并添加雌性×雄性互作基因组亲缘关系,使用功能知识作为油脂代谢途径基因中的SNP,以及进行上位性建模。当双亲在训练集中都有后代时,即使是基于GCA的预测,预测能力也很高,MET平均值为0.782。GS的表现略好(提高0.2%)。雌性×雄性互作、油脂代谢的功能知识或上位性建模均未提高GS的准确性。当一个或两个亲本在训练集中未被测试时,GS极大地提高了预测能力,在MET中基于GCA的预测能力从0.575提高到0.635,提高了10.4%。在这种情况下,仅考虑油脂代谢途径中的SNP进行GS并没有改善全基因组GS预测,但基于GCA的预测能力提高了6.4%。我们的结果表明,当一个或两个亲本特征不明确时,与经典GCA建模相比,GS是育种效率的一项重大改进。因此,这一发现可以通过减少表型分析工作并更有效地针对最有前景的杂交组合来加速育种进程。