Slatkin M, Rannala B
Department of Integrative Biology, University of California, Berkeley 94720-3140, USA.
Genetics. 1997 Dec;147(4):1855-61. doi: 10.1093/genetics/147.4.1855.
A theory is developed that provides the sampling distribution of low frequency alleles at a single locus under the assumption that each allele is the result of a unique mutation. The numbers of copies of each allele is assumed to follow a linear birth-death process with sampling. If the population is of constant size, standard results from theory of birth-death processes show that the distribution of numbers of copies of each allele is logarithmic and that the joint distribution of numbers of copies of k alleles found in a sample of size n follows the Ewens sampling distribution. If the population from which the sample was obtained was increasing in size, if there are different selective classes of alleles, or if there are differences in penetrance among alleles, the Ewens distribution no longer applies. Likelihood functions for a given set of observations are obtained under different alternative hypotheses. These results are applied to published data from the BRCA1 locus (associated with early onset breast cancer) and the factor VIII locus (associated with hemophilia A) in humans. In both cases, the sampling distribution of alleles allows rejection of the null hypothesis, but relatively small deviations from the null model can account for the data. In particular, roughly the same population growth rate appears consistent with both data sets.
本文提出了一种理论,该理论在每个等位基因都是唯一突变结果的假设下,给出了单个基因座低频等位基因的抽样分布。假设每个等位基因的拷贝数遵循带抽样的线性生死过程。如果种群大小恒定,生死过程理论的标准结果表明,每个等位基因拷贝数的分布是对数分布,并且在大小为(n)的样本中发现的(k)个等位基因拷贝数的联合分布遵循尤恩斯抽样分布。如果获取样本的种群大小在增加,如果存在不同的等位基因选择类别,或者如果等位基因之间的外显率存在差异,尤恩斯分布就不再适用。在不同的替代假设下,得到了给定观测集的似然函数。这些结果应用于人类中与早发性乳腺癌相关的BRCA1基因座和与甲型血友病相关的凝血因子VIII基因座的已发表数据。在这两种情况下,等位基因的抽样分布都允许拒绝原假设,但与原模型的相对小偏差就可以解释这些数据。特别是,大致相同的种群增长率似乎与这两个数据集都一致。