Cuyabano Beatriz C D, Su Guosheng, Lund Mogens S
Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, Denmark.
BMC Genomics. 2014 Dec 23;15(1):1171. doi: 10.1186/1471-2164-15-1171.
A haplotype approach to genomic prediction using high density data in dairy cattle as an alternative to single-marker methods is presented. With the assumption that haplotypes are in stronger linkage disequilibrium (LD) with quantitative trait loci (QTL) than single markers, this study focuses on the use of haplotype blocks (haploblocks) as explanatory variables for genomic prediction. Haploblocks were built based on the LD between markers, which allowed variable reduction. The haploblocks were then used to predict three economically important traits (milk protein, fertility and mastitis) in the Nordic Holstein population.
The haploblock approach improved prediction accuracy compared with the commonly used individual single nucleotide polymorphism (SNP) approach. Furthermore, using an average LD threshold to define the haploblocks (LD≥0.45 between any two markers) increased the prediction accuracies for all three traits, although the improvement was most significant for milk protein (up to 3.1% improvement in prediction accuracy, compared with the individual SNP approach). Hotelling's t-tests were performed, confirming the improvement in prediction accuracy for milk protein. Because the phenotypic values were in the form of de-regressed proofs, the improved accuracy for milk protein may be due to higher reliability of the data for this trait compared with the reliability of the mastitis and fertility data. Comparisons between best linear unbiased prediction (BLUP) and Bayesian mixture models also indicated that the Bayesian model produced the most accurate predictions in every scenario for the milk protein trait, and in some scenarios for fertility.
The haploblock approach to genomic prediction is a promising method for genomic selection in animal breeding. Building haploblocks based on LD reduced the number of variables without the loss of information. This method may play an important role in the future genomic prediction involving while genome sequences.
提出了一种在奶牛中使用高密度数据进行基因组预测的单倍型方法,作为单标记方法的替代方法。假设单倍型与数量性状基因座(QTL)的连锁不平衡(LD)比单标记更强,本研究重点关注使用单倍型块(单倍型框)作为基因组预测的解释变量。单倍型框基于标记之间的LD构建,这允许减少变量。然后使用单倍型框预测北欧荷斯坦牛群体中的三个经济重要性状(乳蛋白、繁殖力和乳腺炎)。
与常用的单个单核苷酸多态性(SNP)方法相比,单倍型框方法提高了预测准确性。此外,使用平均LD阈值定义单倍型框(任意两个标记之间的LD≥0.45)提高了所有三个性状的预测准确性,尽管对乳蛋白的改善最为显著(与单个SNP方法相比,预测准确性提高了3.1%)。进行了霍特林t检验,证实了乳蛋白预测准确性的提高。由于表型值采用去回归证明的形式,乳蛋白准确性的提高可能是由于该性状数据的可靠性高于乳腺炎和繁殖力数据的可靠性。最佳线性无偏预测(BLUP)和贝叶斯混合模型之间的比较还表明,贝叶斯模型在乳蛋白性状的每种情况下以及在某些繁殖力情况下产生了最准确的预测。
基因组预测的单倍型框方法是动物育种中基因组选择的一种有前途的方法。基于LD构建单倍型框减少了变量数量而不损失信息。该方法可能在未来涉及全基因组序列的基因组预测中发挥重要作用。