Lipkin E, Straus K, Stein R Tal, Bagnato A, Schiavini F, Fontanesi L, Russo V, Medugorac I, Foerster M, Sölkner J, Dolezal M, Medrano J F, Friedmann A, Soller M
Department of Animal Science, University of California, Davis, California 95616, USA.
Genetics. 2009 Feb;181(2):691-9. doi: 10.1534/genetics.108.097402. Epub 2008 Dec 15.
Great interest was aroused by reports, based on microsatellite markers, of high levels of statistically significant long-range and nonsyntenic linkage disequilibrium (LD) in livestock. Simulation studies showed that this could result from population family structure. In contrast, recent SNP-based studies of livestock populations report much lower levels of LD. In this study we show, on the basis of microsatellite data from four cattle populations, that high levels of long-range LD are indeed obtained when using the multi-allelic D' measure of LD. Long-range and nonsyntenic LD are exceedingly low, however, when evaluated by the standardized chi-square measure of LD, which stands in relation to the predictive ability of LD. Furthermore, specially constructed study populations provided no evidence for appreciable LD resulting from family structure at the grandparent level. We propose that the high statistical significance and family structure effects observed in the earlier studies are due to the use of large sample sizes, which accord high statistical significance to even slight deviations from asymptotic expectations under the null hypothesis. Nevertheless, even after taking sample size into account, our results indicate that microsatellites testify to the presence of usable LD at considerably wider separation distances than SNPs, suggesting that use of SNP haplotypes may considerably increase the usefulness of a given fixed SNP array.
基于微卫星标记的报告显示,家畜中存在高水平具有统计学显著意义的长程和非共线性连锁不平衡(LD),这引起了人们极大的兴趣。模拟研究表明,这可能是由群体家族结构导致的。相比之下,近期基于单核苷酸多态性(SNP)对家畜群体的研究报告显示,LD水平要低得多。在本研究中,我们基于四个牛群体的微卫星数据表明,当使用多等位基因的LD衡量指标D'时,确实会得到高水平的长程LD。然而,当通过与LD预测能力相关的标准化卡方衡量指标来评估时,长程和非共线性LD极低。此外,专门构建的研究群体没有提供证据表明祖父母级别的家族结构会导致明显的LD。我们认为,早期研究中观察到的高统计学显著性和家族结构效应是由于使用了大样本量,这使得即使在零假设下与渐近期望的轻微偏差也具有很高的统计学显著性。尽管如此,即使考虑了样本量,我们的结果表明,微卫星证明在比SNP大得多的间隔距离处存在可用的LD,这表明使用SNP单倍型可能会大大提高给定固定SNP阵列的有用性。