F(ST)与最常见等位基因频率之间的关系。

The relationship between F(ST) and the frequency of the most frequent allele.

机构信息

Department of Evolutionary Biology and Science for Life Laboratory, Uppsala University, SE-752 36, Uppsala, Sweden.

出版信息

Genetics. 2013 Feb;193(2):515-28. doi: 10.1534/genetics.112.144758. Epub 2012 Nov 19.

DOI:10.1534/genetics.112.144758

PMID:23172852

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3567740/

Abstract

F(ST) is frequently used as a summary of genetic differentiation among groups. It has been suggested that F(ST) depends on the allele frequencies at a locus, as it exhibits a variety of peculiar properties related to genetic diversity: higher values for biallelic single-nucleotide polymorphisms (SNPs) than for multiallelic microsatellites, low values among high-diversity populations viewed as substantially distinct, and low values for populations that differ primarily in their profiles of rare alleles. A full mathematical understanding of the dependence of F(ST) on allele frequencies, however, has been elusive. Here, we examine the relationship between F(ST) and the frequency of the most frequent allele, demonstrating that the range of values that F(ST) can take is restricted considerably by the allele-frequency distribution. For a two-population model, we derive strict bounds on F(ST) as a function of the frequency M of the allele with highest mean frequency between the pair of populations. Using these bounds, we show that for a value of M chosen uniformly between 0 and 1 at a multiallelic locus whose number of alleles is left unspecified, the mean maximum F(ST) is ∼0.3585. Further, F(ST) is restricted to values much less than 1 when M is low or high, and the contribution to the maximum F(ST) made by the most frequent allele is on average ∼0.4485. Using bounds on homozygosity that we have previously derived as functions of M, we describe strict bounds on F(ST) in terms of the homozygosity of the total population, finding that the mean maximum F(ST) given this homozygosity is 1 - ln 2 ≈ 0.3069. Our results provide a conceptual basis for understanding the dependence of F(ST) on allele frequencies and genetic diversity and for interpreting the roles of these quantities in computations of F(ST) from population-genetic data. Further, our analysis suggests that many unusual observations of F(ST), including the relatively low F(ST) values in high-diversity human populations from Africa and the relatively low estimates of F(ST) for microsatellites compared to SNPs, can be understood not as biological phenomena associated with different groups of populations or classes of markers but rather as consequences of the intrinsic mathematical dependence of F(ST) on the properties of allele-frequency distributions.

摘要

F(ST) 通常被用作群体间遗传分化的总结。有人认为，F(ST) 取决于基因座的等位基因频率，因为它表现出与遗传多样性有关的多种特殊性质：双等位基因单核苷酸多态性（SNP）的 F(ST) 值高于多等位基因微卫星，高度多样化的群体之间的 F(ST) 值较低，主要在稀有等位基因谱上存在差异的群体的 F(ST) 值较低。然而，人们一直难以完全理解 F(ST) 对等位基因频率的依赖关系。在这里，我们研究了 F(ST) 与最常见等位基因频率之间的关系，证明 F(ST) 的取值范围受到等位基因频率分布的极大限制。对于两群体模型，我们推导出了 F(ST) 作为两个群体之间具有最高平均频率的等位基因频率 M 的函数的严格边界。使用这些边界，我们表明，对于在多等位基因基因座上选择均匀分布在 0 到 1 之间的 M 值，其等位基因数量未指定，平均最大 F(ST) 约为 0.3585。此外，当 M 较低或较高时，F(ST) 受到限制，最常见等位基因对最大 F(ST) 的贡献平均约为 0.4485。使用我们之前推导的作为 M 的函数的杂合度边界，我们用种群的杂合度来描述 F(ST) 的严格边界，发现给定这个杂合度的平均最大 F(ST) 为 1-ln2≈0.3069。我们的结果为理解 F(ST) 对等位基因频率和遗传多样性的依赖关系以及解释这些数量在种群遗传数据中计算 F(ST) 的作用提供了概念基础。此外，我们的分析表明，许多 F(ST) 的异常观察结果，包括来自非洲的高度多样化人类群体中相对较低的 F(ST) 值以及与 SNP 相比微卫星中相对较低的 F(ST) 值，并不是与不同群体或标记类别的生物现象相关，而是 F(ST) 对等位基因频率分布特性的内在数学依赖关系的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5b7/3567740/cef4f40b176c/515fig1.jpg

相似文献

The relationship between F(ST) and the frequency of the most frequent allele.

Genetics. 2013 Feb;193(2):515-28. doi: 10.1534/genetics.112.144758. Epub 2012 Nov 19.

The relationship between homozygosity and the frequency of the most frequent allele.

Genetics. 2008 Aug;179(4):2027-36. doi: 10.1534/genetics.107.084772. Epub 2008 Aug 9.

Upper bounds on FST in terms of the frequency of the most frequent allele and total homozygosity: the case of a specified number of alleles.

Theor Popul Biol. 2014 Nov;97:20-34. doi: 10.1016/j.tpb.2014.08.001. Epub 2014 Aug 14.

Empirical distributions of F(ST) from large-scale human polymorphism data.

PLoS One. 2012;7(11):e49837. doi: 10.1371/journal.pone.0049837. Epub 2012 Nov 21.

Refining the relationship between homozygosity and the frequency of the most frequent allele.

J Math Biol. 2012 Jan;64(1-2):87-108. doi: 10.1007/s00285-011-0406-8. Epub 2011 Feb 9.

Haplotypic background of a private allele at high frequency in the Americas.

Mol Biol Evol. 2009 May;26(5):995-1016. doi: 10.1093/molbev/msp024. Epub 2009 Feb 12.

Bounding measures of genetic similarity and diversity using majorization.

J Math Biol. 2018 Sep;77(3):711-737. doi: 10.1007/s00285-018-1226-x. Epub 2018 Mar 22.

Evolution of Pacific/Asian populations inferred from HLA class II allele frequency distributions.

Tissue Antigens. 2000 May;55(5):383-400. doi: 10.1034/j.1399-0039.2000.550501.x.

, Jost's D, and F are similarly constrained by allele frequencies: A mathematical, simulation, and empirical study.

Mol Ecol. 2019 Apr;28(7):1624-1636. doi: 10.1111/mec.15000.

Inferring microevolutionary patterns from allele-size frequency distributions of minisatellite loci: a worldwide study of the APOB 3' hypervariable region polymorphism.

Hum Biol. 2000 Oct;72(5):733-51.

引用本文的文献

Using mathematical constraints to explain narrow ranges for allele-sharing dissimilarities.

Theor Popul Biol. 2025 Jun 17. doi: 10.1016/j.tpb.2025.05.002.

Mathematical bounds on r and the effect size in case-control genome-wide association studies.

Theor Popul Biol. 2025 Aug;164:1-11. doi: 10.1016/j.tpb.2025.04.003. Epub 2025 May 15.

Error rates in QST-FST comparisons depend on genetic architecture and estimation procedures.

Genetics. 2025 Apr 17;229(4). doi: 10.1093/genetics/iyaf034.

Comprehensive elucidation on the genetic profile of the Hezhou Han population an efficient InDel panel.

Forensic Sci Res. 2024 Apr 9;10(1):owae021. doi: 10.1093/fsr/owae021. eCollection 2025 Mar.

High Diversity and Low Genetic Differentiation Among Geographic Populations of in Western Canada.

Animals (Basel). 2025 Feb 18;15(4):578. doi: 10.3390/ani15040578.

Mathematical bounds on and the effect size in case-control genome-wide association studies.

bioRxiv. 2024 Dec 17:2024.12.17.628943. doi: 10.1101/2024.12.17.628943.

Using mathematical constraints to explain narrow ranges for allele-sharing dissimilarities.

bioRxiv. 2024 Nov 21:2024.11.19.624404. doi: 10.1101/2024.11.19.624404.

Error rates in - comparisons depend on genetic architecture and estimation procedures.

bioRxiv. 2024 Nov 1:2024.10.28.620737. doi: 10.1101/2024.10.28.620737.

Characterization of fine geographic scale population genetics in sugar kelp (Saccharina latissima) using genome-wide markers.

BMC Genomics. 2024 Sep 30;25(1):901. doi: 10.1186/s12864-024-10793-2.

Wright's Hierarchical F-Statistics.

Mol Biol Evol. 2024 May 3;41(5). doi: 10.1093/molbev/msae083.

本文引用的文献

PERSPECTIVE: HIGHLY VARIABLE LOCI AND THEIR INTERPRETATION IN EVOLUTION AND CONSERVATION.

Evolution. 1999 Apr;53(2):313-318. doi: 10.1111/j.1558-5646.1999.tb03767.x.

The genetical structure of populations.

Ann Eugen. 1951 Mar;15(4):323-54. doi: 10.1111/j.1469-1809.1949.tb02451.x.

Genomic patterns of homozygosity in worldwide human populations.

Am J Hum Genet. 2012 Aug 10;91(2):275-92. doi: 10.1016/j.ajhg.2012.06.014.

An abundance of rare functional variants in 202 drug target genes sequenced in 14,002 people.

Science. 2012 Jul 6;337(6090):100-4. doi: 10.1126/science.1217876. Epub 2012 May 17.

Evolution and functional impact of rare coding variation from deep sequencing of human exomes.

Science. 2012 Jul 6;337(6090):64-9. doi: 10.1126/science.1219240. Epub 2012 May 17.

Recent explosive human population growth has resulted in an excess of rare genetic variants.

Science. 2012 May 11;336(6082):740-3. doi: 10.1126/science.1217283.

Mathematical properties of Fst between admixed populations and their parental source populations.

Theor Popul Biol. 2011 Nov;80(3):208-16. doi: 10.1016/j.tpb.2011.05.003. Epub 2011 May 25.

Assessing population structure: F(ST) and related measures.

Mol Ecol Resour. 2011 Jan;11(1):5-18. doi: 10.1111/j.1755-0998.2010.02927.x. Epub 2010 Oct 26.

G'ST and D do not replace FST.

Mol Ecol. 2011 Mar;20(6):1083-91. doi: 10.1111/j.1365-294X.2010.04996.x. Epub 2011 Jan 19.

Refining the relationship between homozygosity and the frequency of the most frequent allele.

J Math Biol. 2012 Jan;64(1-2):87-108. doi: 10.1007/s00285-011-0406-8. Epub 2011 Feb 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

F(ST)与最常见等位基因频率之间的关系。

The relationship between F(ST) and the frequency of the most frequent allele.

机构信息

Department of Evolutionary Biology and Science for Life Laboratory, Uppsala University, SE-752 36, Uppsala, Sweden.

出版信息

Genetics. 2013 Feb;193(2):515-28. doi: 10.1534/genetics.112.144758. Epub 2012 Nov 19.

DOI:10.1534/genetics.112.144758

PMID:23172852

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3567740/

Abstract

摘要

F(ST)与最常见等位基因频率之间的关系。

The relationship between F(ST) and the frequency of the most frequent allele.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

F(ST)与最常见等位基因频率之间的关系。

The relationship between F(ST) and the frequency of the most frequent allele.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献