Pausch Hubert, Emmerling Reiner, Schwarzenbacher Hermann, Fries Ruedi
Lehrstuhl fuer Tierzucht, Technische Universitaet Muenchen, 85354, Freising, Germany.
Institut fuer Tierzucht, Bayerische Landesanstalt fuer Landwirtschaft, 85586, Poing, Germany.
Genet Sel Evol. 2016 Feb 16;48:14. doi: 10.1186/s12711-016-0190-4.
The availability of whole-genome sequence data from key ancestors in bovine populations provides an exhaustive catalogue of polymorphic sites that segregate within and across cattle breeds. Sequence variants identified from the sequenced genome of key ancestors can be imputed into animals that have been genotyped using medium- and high-density genotyping arrays. Association analysis with imputed sequences, particularly when applied to multiple traits simultaneously, is a very powerful approach to detect candidate causal variants that underlie complex phenotypes.
We used whole-genome sequence data from 157 key ancestors of the German Fleckvieh cattle population to impute 20,561,798 sequence variants into 10,363 animals that had (partly imputed) genotypes based on 634,109 single nucleotide polymorphisms (SNPs). Rare variants were more frequent among the sequence-derived than the array-derived genotypes. Association studies with imputed sequence variants were performed using seven correlated udder conformation traits as response variables. The calculation of an approximate multi-trait test statistic enabled us to detect 12 quantitative trait loci (QTL) (P < 2.97 × 10(-9)) that affect different morphological features of the mammary gland. Among the tested variants, the most significant associations were found for imputed sequence variants at 11 QTL, whereas the top association signal was observed for an array-derived variant at a QTL on bovine chromosome 14. Seven QTL were associated with multiple phenotypes. Most QTL were located in non-coding regions of the genome but in close proximity of candidate genes that could be involved in mammary gland morphology (SP5, GC, NPFFR2, CRIM1, RXFP2, TBX5, RBM19 and ADAM12).
Using imputed sequence variants in association analyses allows the detection of QTL at maximum resolution. Multi-trait approaches can reveal QTL that are not detected in single-trait association studies. Most QTL for udder conformation traits were located in non-coding regions of the genome, which suggests that mutations in regulatory sequences are the major determinants of variation in mammary gland morphology in cattle.
牛群体中关键祖先的全基因组序列数据的可用性提供了一个详尽的多态性位点目录,这些位点在牛品种内部和之间分离。从关键祖先的测序基因组中鉴定出的序列变异可以推算到使用中高密度基因分型阵列进行基因分型的动物中。与推算序列的关联分析,特别是当同时应用于多个性状时,是检测复杂表型潜在候选因果变异的一种非常强大的方法。
我们使用来自德国弗莱维赫牛群体157个关键祖先的全基因组序列数据,将20,561,798个序列变异推算到10,363头具有(部分推算)基于634,109个单核苷酸多态性(SNP)的基因型的动物中。稀有变异在序列衍生的基因型中比在阵列衍生的基因型中更频繁。使用七个相关的乳房形态性状作为反应变量,对推算的序列变异进行了关联研究。近似多性状检验统计量的计算使我们能够检测到12个影响乳腺不同形态特征的数量性状位点(QTL)(P < 2.97 × 10(-9))。在测试的变异中,在11个QTL的推算序列变异中发现了最显著的关联,而在牛14号染色体上一个QTL的阵列衍生变异中观察到了最强的关联信号。七个QTL与多种表型相关。大多数QTL位于基因组的非编码区域,但靠近可能参与乳腺形态的候选基因(SP5、GC、NPFFR2、CRIM1、RXFP2、TBX5、RBM19和ADAM12)。
在关联分析中使用推算的序列变异可以以最大分辨率检测QTL。多性状方法可以揭示单性状关联研究中未检测到的QTL。大多数乳房形态性状的QTL位于基因组的非编码区域,这表明调控序列中的突变是牛乳腺形态变异的主要决定因素。