Kristensen Peter S, Jahoor Ahmed, Andersen Jeppe R, Cericola Fabio, Orabi Jihad, Janss Luc L, Jensen Just
Nordic Seed A/S, Odder, Denmark.
Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark.
Front Plant Sci. 2018 Feb 2;9:69. doi: 10.3389/fpls.2018.00069. eCollection 2018.
The aim of the this study was to identify SNP markers associated with five important wheat quality traits (grain protein content, Zeleny sedimentation, test weight, thousand-kernel weight, and falling number), and to investigate the predictive abilities of GBLUP and Bayesian Power Lasso models for genomic prediction of these traits. In total, 635 winter wheat lines from two breeding cycles in the Danish plant breeding company Nordic Seed A/S were phenotyped for the quality traits and genotyped for 10,802 SNPs. GWAS were performed using single marker regression and Bayesian Power Lasso models. SNPs with large effects on Zeleny sedimentation were found on chromosome 1B, 1D, and 5D. However, GWAS failed to identify single SNPs with significant effects on the other traits, indicating that these traits were controlled by many QTL with small effects. The predictive abilities of the models for genomic prediction were studied using different cross-validation strategies. Leave-One-Out cross-validations resulted in correlations between observed phenotypes corrected for fixed effects and genomic estimated breeding values of 0.50 for grain protein content, 0.66 for thousand-kernel weight, 0.70 for falling number, 0.71 for test weight, and 0.79 for Zeleny sedimentation. Alternative cross-validations showed that the genetic relationship between lines in training and validation sets had a bigger impact on predictive abilities than the number of lines included in the training set. Using Bayesian Power Lasso instead of GBLUP models, gave similar or slightly higher predictive abilities. Genomic prediction based on all SNPs was more effective than prediction based on few associated SNPs.
本研究的目的是鉴定与五个重要小麦品质性状(籽粒蛋白质含量、泽伦尼沉降值、容重、千粒重和降落数值)相关的单核苷酸多态性(SNP)标记,并研究基因组最佳线性无偏预测(GBLUP)和贝叶斯功率套索模型对这些性状进行基因组预测的能力。丹麦植物育种公司北欧种子有限公司(Nordic Seed A/S)两个育种周期的635个冬小麦品系进行了品质性状表型分析和10,802个SNP的基因分型。使用单标记回归和贝叶斯功率套索模型进行全基因组关联研究(GWAS)。在1B、1D和5D染色体上发现了对泽伦尼沉降值有重大影响的SNP。然而,GWAS未能鉴定出对其他性状有显著影响的单个SNP,表明这些性状受许多微效数量性状位点(QTL)控制。使用不同的交叉验证策略研究了模型的基因组预测能力。留一法交叉验证得出,校正固定效应后的观测表型与基因组估计育种值之间的相关性,籽粒蛋白质含量为0.50,千粒重为0.66,降落数值为0.70,容重为0.71,泽伦尼沉降值为0.79。其他交叉验证表明,训练集和验证集中品系之间的遗传关系对预测能力的影响大于训练集中品系的数量。使用贝叶斯功率套索模型而非GBLUP模型,预测能力相似或略高。基于所有SNP的基因组预测比基于少数相关SNP的预测更有效。