Department of Animal Science and Center for Integrated Animal Genomics, Iowa State University, Ames, Iowa 50011, USA.
Genetics. 2013 Jul;194(3):597-607. doi: 10.1534/genetics.113.152207. Epub 2013 May 2.
Genomic best linear unbiased prediction (BLUP) is a statistical method that uses relationships between individuals calculated from single-nucleotide polymorphisms (SNPs) to capture relationships at quantitative trait loci (QTL). We show that genomic BLUP exploits not only linkage disequilibrium (LD) and additive-genetic relationships, but also cosegregation to capture relationships at QTL. Simulations were used to study the contributions of those types of information to accuracy of genomic estimated breeding values (GEBVs), their persistence over generations without retraining, and their effect on the correlation of GEBVs within families. We show that accuracy of GEBVs based on additive-genetic relationships can decline with increasing training data size and speculate that modeling polygenic effects via pedigree relationships jointly with genomic breeding values using Bayesian methods may prevent that decline. Cosegregation information from half sibs contributes little to accuracy of GEBVs in current dairy cattle breeding schemes but from full sibs it contributes considerably to accuracy within family in corn breeding. Cosegregation information also declines with increasing training data size, and its persistence over generations is lower than that of LD, suggesting the need to model LD and cosegregation explicitly. The correlation between GEBVs within families depends largely on additive-genetic relationship information, which is determined by the effective number of SNPs and training data size. As genomic BLUP cannot capture short-range LD information well, we recommend Bayesian methods with t-distributed priors.
基因组最佳线性无偏预测(BLUP)是一种统计方法,它利用从单核苷酸多态性(SNP)计算得出的个体间关系来捕获数量性状基因座(QTL)上的关系。我们表明,基因组 BLUP 不仅利用了连锁不平衡(LD)和加性遗传关系,还利用了共分离来捕获 QTL 上的关系。模拟研究了这些信息类型对基因组估计育种值(GEBV)准确性的贡献、它们在没有重新训练的情况下在几代中的持久性,以及它们对家族内 GEBV 相关性的影响。我们表明,基于加性遗传关系的 GEBV 准确性可能会随着训练数据大小的增加而下降,我们推测通过贝叶斯方法联合系谱关系和基因组育种值来模拟多基因效应,可能会防止这种下降。半同胞的共分离信息对当前奶牛育种计划中 GEBV 的准确性贡献不大,但全同胞的共分离信息对玉米育种中家族内的准确性贡献很大。共分离信息也随着训练数据大小的增加而下降,其在几代中的持久性低于 LD,这表明需要明确地对 LD 和共分离进行建模。家族内 GEBV 之间的相关性在很大程度上取决于加性遗传关系信息,该信息由有效 SNP 数量和训练数据大小决定。由于基因组 BLUP 不能很好地捕捉短程 LD 信息,因此我们建议使用具有 t 分布先验的贝叶斯方法。