Liu Xiran, Ahsan Zarif, Rosenberg Noah A
Institute for Computational and Mathematical Engineering, Stanford University, Stanford, CA 94305, United States of America.
Department of Biology, Stanford University, Stanford, CA 94305, United States of America.
Theor Popul Biol. 2025 Jun 17. doi: 10.1016/j.tpb.2025.05.002.
Allele-sharing dissimilarity (ASD) statistics are measures of genetic differentiation for pairs of individuals or populations. Given the allele-frequency distributions of two populations - possibly the same population - the expected value of an ASD statistic is computed by evaluating the expectation of the pairwise dissimilarity between two individuals drawn at random, each from its associated allele-frequency distribution. For each of two ASD statistics, which we term D and D, we investigate the extent to which the expected ASD is constrained by allele frequencies in the two populations; in other words, how is the magnitude of the measure bounded as a function of the frequency of the most frequent allelic type? We first consider dissimilarity of a population with itself, obtaining bounds on expected ASD in terms of the frequency of the most frequent allelic type in the population. We then examine pairs of populations that might or might not possess the same most frequent allelic type. Across the unit interval for the frequency of the most frequent allelic type, the expected allele-sharing dissimilarity has a range that is more restricted than the [0,1] interval. The mathematical constraints on expected ASD assist in explaining a pattern observed empirically in human populations, namely that when averaging across loci, allele-sharing dissimilarities between pairs of individuals often tend to vary only within a relatively narrow range.
等位基因共享差异(ASD)统计量是衡量个体或群体对之间遗传分化的指标。给定两个群体(可能是同一个群体)的等位基因频率分布,通过评估从其相关等位基因频率分布中随机抽取的两个个体之间成对差异的期望值,来计算ASD统计量的期望值。对于我们称为D和D的两个ASD统计量中的每一个,我们研究期望的ASD在多大程度上受到两个群体中等位基因频率的限制;换句话说,该度量的大小如何作为最常见等位基因类型频率的函数受到限制?我们首先考虑一个群体与其自身的差异,根据群体中最常见等位基因类型的频率获得期望ASD的界限。然后我们研究可能具有或不具有相同最常见等位基因类型的群体对。在最常见等位基因类型频率的单位区间内,期望的等位基因共享差异的范围比[0,1]区间更受限制。对期望ASD的数学约束有助于解释在人类群体中凭经验观察到的一种模式,即当对各个基因座进行平均时,个体对之间的等位基因共享差异通常仅在相对较窄的范围内变化。