Rosenberg Noah A, Jakobsson Mattias
Department of Human Genetics, Center for Computational Medicine and Biology, and the Life Sciences Institute, University of Michigan, Ann Arbor, Michigan 48109-2218, USA.
Genetics. 2008 Aug;179(4):2027-36. doi: 10.1534/genetics.107.084772. Epub 2008 Aug 9.
Homozygosity is a commonly used summary of allele-frequency distributions at polymorphic loci. Because high-frequency alleles contribute disproportionately to the homozygosity of a locus, it often occurs that most homozygotes are homozygous for the most frequent allele. To assess the relationship between homozygosity and the highest allele frequency at a locus, for a given homozygosity value, we determine the lower and upper bounds on the frequency of the most frequent allele. These bounds suggest tight constraints on the frequency of the most frequent allele as a function of homozygosity, differing by at most 14 and having an average difference of 23 - pi(2)/18 approximately 0.1184. The close connection between homozygosity and the frequency of the most frequent allele-which we illustrate using allele frequencies from human populations-has the consequence that when one of these two quantities is known, considerable information is available about the other quantity. This relationship also explains the similar performance of statistical tests of population-genetic models that rely on homozygosity and those that rely on the frequency of the most frequent allele, and it provides a basis for understanding the utility of extended homozygosity statistics in identifying haplotypes that have been elevated to high frequency as a result of positive selection.
纯合性是多态性位点上等位基因频率分布常用的一种汇总指标。由于高频等位基因对一个位点的纯合性贡献不成比例,所以经常会出现大多数纯合子是最常见等位基因的纯合子这种情况。为了评估一个位点上纯合性与最高等位基因频率之间的关系,对于给定的纯合性值,我们确定最常见等位基因频率的下限和上限。这些界限表明,作为纯合性的函数,最常见等位基因的频率受到严格限制,差异最多为14,平均差异为23 - π²/18,约为0.1184。我们用人种群的等位基因频率来说明的纯合性与最常见等位基因频率之间的紧密联系,其结果是当这两个量中的一个已知时,就可以获得关于另一个量的大量信息。这种关系还解释了依赖纯合性的群体遗传模型统计检验与依赖最常见等位基因频率的统计检验表现相似的原因,并且它为理解扩展纯合性统计量在识别由于正选择而频率升高的单倍型中的效用提供了基础。