Zapata C, Carollo C, Rodriguez S
Departamento de Biología Fundamental, Universidad de Santiago, Santiago de Compostela, Spain.
Ann Hum Genet. 2001 Jul;65(Pt 4):395-406. doi: 10.1017/S0003480001008697.
The development of the theory of estimation of gametic disequilibrium for multiallelic systems is particularly necessary, since a large number of the genetic markers available at present are highly polymorphic multiallelic systems. The D' coefficient is one of the most commonly used measures of the extent of overall disequilibrium between all possible pairs of alleles at two multiallelic loci. Nevertheless, the sampling properties of this measure of overall disequilibrium, are to date, unknown. In this work, we have derived explicit expressions by large-sample theory to compute the approximate sampling variance of Dhat' between pairs of multiallelic loci, when samples of haplotypes are taken from populations. Formulae for calculating the asymptotic sampling variance were checked by Monte Carlo simulation. In addition, the magnitude of the sampling variance of Dhat' was investigated under different scenarios of disequilibrium between multiallelic loci. Extensive simulations were also carried out for describing the sampling distribution of Dhat', conditioned on the sample size, number of alleles and their frequencies, and disequilibrium components. It was found that the sampling distribution of Dhat' generally approaches well the theoretical normal distribution for experimental sample sizes, particularly when loci have many alleles. Disequilibrium data between microsatellite loci of human chromosome 11p are used for illustration. These investigations increase substantially our knowledge about this widely used measure of overall disequilibrium, which is relevant to evaluate disequilibrium between multiallelic loci in populations.
多等位基因系统配子不平衡估计理论的发展尤为必要,因为目前可用的大量遗传标记都是高度多态的多等位基因系统。D'系数是衡量两个多等位基因座上所有可能等位基因对之间总体不平衡程度最常用的指标之一。然而,这种总体不平衡指标的抽样特性至今仍不清楚。在这项工作中,我们通过大样本理论推导出了明确的表达式,用于计算从群体中抽取单倍型样本时,多等位基因座对之间Dhat'的近似抽样方差。通过蒙特卡罗模拟检验了计算渐近抽样方差的公式。此外,还研究了在多等位基因座之间不同不平衡情况下Dhat'抽样方差的大小。还进行了广泛的模拟,以描述Dhat'的抽样分布,该分布取决于样本大小、等位基因数量及其频率以及不平衡成分。结果发现,对于实验样本大小,Dhat'的抽样分布通常很好地接近理论正态分布,特别是当基因座有许多等位基因时。以人类11号染色体短臂微卫星基因座之间的不平衡数据为例进行说明。这些研究大大增加了我们对这种广泛使用的总体不平衡指标的了解,这对于评估群体中多等位基因座之间的不平衡是相关的。