Weir B S
Biometrics. 1979 Mar;35(1):235-54.
Existing theory for inferences about linkage disequilibrium is restricted to a measure defined on gametic frequencies. Unless gametic frequencies are directly observable, they are inferred from genotypic frequencies under the assumption of random union of gametes. Primary emphasis in this paper is given to genotypic data, and disequilibrium coefficients are defined for all subsets of two or more of the four genes, two at each of two loci, carried by an individual. Linkage disequilibrium coefficients are defined for genes within and between gametes, and methods of estimating and testing these coefficients are given for gametic data. For genotypic data, when coupling and repulsion double heterozygotes cannot be distinguished. Burrows' composite measure of linkage disequilibrium is discussed. In particular, the estimate for this measure and hypothesis tests based on it are compared to the usual maximum likelihood estimate of gametic linkage disequilibrium, and corresponding likelihood ratio or contingency chi-square tests. General use of the composite measure, whether or not random union of gametes is an appropriate assumption, is recommended. Attention is given to small samples, where the non-normality of gene frequencies will have greatest effect on methods of inference based on normal theory. Even tools such as Fisher's z-transformation for the correlation of gene frequencies are found to perform quite satisfactorily.
关于连锁不平衡推断的现有理论仅限于基于配子频率定义的一种度量。除非配子频率可以直接观测到,否则它们是在配子随机结合的假设下从基因型频率推断出来的。本文主要关注基因型数据,并为个体携带的四个基因(两个位点各两个)中两个或更多基因的所有子集定义了不平衡系数。为配子内和配子间的基因定义了连锁不平衡系数,并给出了配子数据中这些系数的估计和检验方法。对于基因型数据,当无法区分相引和相斥双杂合子时,讨论了伯罗斯连锁不平衡的综合度量。特别地,将该度量的估计值和基于它的假设检验与配子连锁不平衡的常用最大似然估计值以及相应的似然比或列联卡方检验进行了比较。建议无论配子随机结合是否是一个合适的假设,都普遍使用这种综合度量。本文关注小样本情况,在小样本中基因频率的非正态性对基于正态理论的推断方法影响最大。甚至像用于基因频率相关性的费希尔z变换这样的工具也被发现表现得相当令人满意。