Du Feng-Xing, Clutter Archie C, Lohuis Michael M
Monsanto Company, St. Louis, MO 63137, USA.
Int J Biol Sci. 2007 Feb 10;3(3):166-78. doi: 10.7150/ijbs.3.166.
Knowledge of the extent and range of linkage disequilibrium (LD), defined as non-random association of alleles at two or more loci, in animal populations is extremely valuable in localizing genes affecting quantitative traits, identifying chromosomal regions under selection, studying population history, and characterizing/managing genetic resources and diversity. Two commonly used LD measures, r(2) and D', and their permutation based adjustments, were evaluated using genotypes of more than 6,000 pigs from six commercial lines (two terminal sire lines and four maternal lines) at ~4,500 autosomal SNPs (single nucleotide polymorphisms). The results indicated that permutation only partially removed the dependency of D' on allele frequency and that r(2) is a considerably more robust LD measure. The maximum r(2) was derived as a function of allele frequency. Using the same genotype dataset, the extent of LD in these pig populations was estimated for all possible syntenic SNP pairs using r(2) and the ratio of r(2) over its theoretical maximum. As expected, the extent of LD highest for SNP pairs was found in tightest linkage and decreased as their map distance increased. The level of LD found in these pig populations appears to be lower than previously implied in several other studies using microsatellite genotype data. For all pairs of SNPs approximately 3 centiMorgan (cM) apart, the average r(2) was equal to 0.1. Based on the average population-wise LD found in these six commercial pig lines, we recommend a spacing of 0.1 to 1 cM for a whole genome association study in pig populations.
连锁不平衡(LD)是指两个或多个位点上等位基因的非随机关联。了解动物群体中LD的程度和范围,对于定位影响数量性状的基因、识别受选择的染色体区域、研究群体历史以及表征/管理遗传资源和多样性具有极其重要的价值。我们使用来自六个商业品系(两个终端父本品系和四个母本品系)的6000多头猪在约4500个常染色体单核苷酸多态性(SNP)位点的基因型,对两种常用的LD测量方法r(2)和D'及其基于排列的调整方法进行了评估。结果表明,排列仅部分消除了D'对等位基因频率的依赖性,并且r(2)是一种更为稳健的LD测量方法。最大r(2)是作为等位基因频率的函数得出的。使用相同的基因型数据集,利用r(2)及其与理论最大值的比值,估计了这些猪群体中所有可能的同线SNP对的LD程度。正如预期的那样,SNP对的LD程度在最紧密的连锁中最高,并随着它们的图谱距离增加而降低。这些猪群体中发现的LD水平似乎低于之前其他一些使用微卫星基因型数据的研究中所暗示的水平。对于所有相距约3厘摩(cM)的SNP对,平均r(2)等于0.1。基于在这六个商业猪品系中发现的全群体平均LD,我们建议在猪群体的全基因组关联研究中,SNP的间距为0.1至1 cM。