Department of Animal Sciences, University of Wisconsin-Madison, Wisconsin, USA.
BMC Genomics. 2014 Feb 7;15:109. doi: 10.1186/1471-2164-15-109.
Genome-wide association studies have been deemed successful for identifying statistically associated genetic variants of large effects on complex traits. Past studies have found enrichment of trait-associated SNPs in functionally annotated regions, while depletion was reported for intergenic regions (IGR). However, no systematic examination of connections between genomic regions and predictive ability of complex phenotypes has been carried out.
In this study, we partitioned SNPs based on their annotation to characterize genomic regions that deliver low and high predictive power for three broiler traits in chickens using a whole-genome approach. Additive genomic relationship kernels were constructed for each of the genic regions considered, and a kernel-based Bayesian ridge regression was employed as prediction machine. We found that the predictive performance for ultrasound area of breast meat from using genic regions marked by SNPs was consistently better than that from SNPs in IGR, while IGR tagged by SNPs were better than the genic regions for body weight and hen house egg production. We also noted that predictive ability delivered by the whole battery of markers was close to the best prediction achieved by one of the genomic regions.
Whole-genome regression methods use all available quality filtered SNPs into a model, contrary to accommodating only validated SNPs from exonic or coding regions. Our results suggest that, while differences among genomic regions in terms of predictive ability were observed, the whole-genome approach remains as a promising tool if interest is on prediction of complex traits.
全基因组关联研究已被证明成功地鉴定了对复杂性状具有大效应的统计学关联的遗传变异。过去的研究发现,与性状相关的 SNP 在功能注释区域中富集,而在基因间区域(IGR)中则减少。然而,还没有系统地研究基因组区域与复杂表型预测能力之间的联系。
在这项研究中,我们根据注释将 SNPs 进行分区,使用全基因组方法对三个肉鸡性状的低预测能力和高预测能力的基因组区域进行特征描述。对考虑的每个基因区域都构建了加性基因组关系核,并使用核基贝叶斯岭回归作为预测机。我们发现,使用 SNP 标记的基因区域进行乳房肉超声面积预测的表现始终优于 IGR 中的 SNP,而 SNP 标记的 IGR 优于体重和母鸡舍产蛋的基因区域。我们还注意到,整套标记物提供的预测能力接近通过一个基因组区域实现的最佳预测。
全基因组回归方法将所有可用的经过质量过滤的 SNP 纳入模型中,而不是仅从外显子或编码区域中纳入经过验证的 SNP。我们的结果表明,尽管观察到基因组区域在预测能力方面存在差异,但如果关注的是复杂性状的预测,全基因组方法仍然是一种很有前途的工具。