Rogers Alan R, Huff Chad
Department of Anthropology, University of Utah, Salt Lake City, Utah 84112, USA.
Genetics. 2009 Jul;182(3):839-44. doi: 10.1534/genetics.108.093153. Epub 2009 May 11.
Linkage disequilibrium is often measured by two statistics, D and r, which can be interpreted as the covariance and the correlation between loci and across gametes. When data consist of diploid genotypes, however, gametes cannot be identified. A variety of iterative statistical methods are used in such cases, all of which assume random mating. Previous work has shown that D and r can be expressed as covariances and correlations across diploid genotypes, provided that mating is random. We show here that this result also holds approximately when mating is nonrandom. This provides a means of estimating these parameters without iteration and without assuming random mating. This estimator is nearly as accurate as the widely used EM estimator and is many times faster.
连锁不平衡通常由两个统计量D和r来衡量,它们可以解释为基因座之间以及配子之间的协方差和相关性。然而,当数据由二倍体基因型组成时,无法识别配子。在这种情况下会使用各种迭代统计方法,所有这些方法都假设随机交配。先前的研究表明,只要交配是随机的,D和r可以表示为二倍体基因型之间的协方差和相关性。我们在此表明,当交配是非随机时,这个结果也大致成立。这提供了一种无需迭代且无需假设随机交配来估计这些参数的方法。这种估计器几乎与广泛使用的期望最大化(EM)估计器一样准确,并且速度要快很多倍。