对FST-杂合性异常值方法的限制。

Constraints on the FST-Heterozygosity Outlier Approach.

作者信息

Flanagan Sarah P, Jones Adam G

机构信息

Biology Department, Texas A&M University, 3258 TAMU, College Station, TX 77843; and National Institute for Mathematical and Biological Synthesis, University of Tennessee, Knoxville, TN.

出版信息

J Hered. 2017 Jul 1;108(5):561-573. doi: 10.1093/jhered/esx048.

DOI:10.1093/jhered/esx048

PMID:28486592

Abstract

The FST-heterozygosity outlier approach has been a popular method for identifying loci under balancing and positive selection since Beaumont and Nichols first proposed it in 1996 and recommended its use for studies sampling a large number of independent populations (at least 10). Since then, their program FDIST2 and a user-friendly program optimized for large datasets, LOSITAN, have been used widely in the population genetics literature, often without the requisite number of samples. We observed empirical datasets whose distributions could not be reconciled with the confidence intervals generated by the null coalescent island model. Here, we use forward-in-time simulations to investigate circumstances under which the FST-heterozygosity outlier approach performs poorly for next-generation single nucleotide polymorphism (SNP) datasets. Our results show that samples involving few independent populations, particularly when migration rates are low, result in distributions of the FST-heterozygosity relationship that are not described by the null model implemented in LOSITAN. In addition, even under favorable conditions LOSITAN rarely provides confidence intervals that precisely fit SNP data, making the associated P-values only roughly valid at best. We present an alternative method, implemented in a new R package named fsthet, which uses the raw empirical data to generate smoothed outlier plots for the FST-heterozygosity relationship.

摘要

自1996年博蒙特和尼科尔斯首次提出FST杂合度异常值方法，并建议将其用于对大量独立群体（至少10个）进行抽样的研究以来，该方法一直是识别平衡选择和正选择位点的常用方法。从那时起，他们的程序FDIST2以及为大型数据集优化的用户友好程序LOSITAN，在群体遗传学文献中被广泛使用，而这些文献往往没有达到所需的样本数量。我们观察到一些经验数据集，其分布与零合并岛模型生成的置信区间不一致。在这里，我们使用时间向前模拟来研究FST杂合度异常值方法在下一代单核苷酸多态性（SNP）数据集上表现不佳的情况。我们的结果表明，涉及少量独立群体的样本，特别是当迁移率较低时，会导致FST杂合度关系的分布无法用LOSITAN中实现的零模型来描述。此外，即使在有利条件下，LOSITAN也很少能提供精确拟合SNP数据的置信区间，这使得相关的P值充其量也只是大致有效。我们提出了一种替代方法，在一个名为fsthet的新R包中实现，该方法使用原始经验数据生成FST杂合度关系的平滑异常值图。

相似文献

Constraints on the FST-Heterozygosity Outlier Approach.对FST-杂合性异常值方法的限制。

J Hered. 2017 Jul 1;108(5):561-573. doi: 10.1093/jhered/esx048.

LOSITAN: a workbench to detect molecular adaptation based on a Fst-outlier method.LOSITAN：一个基于Fst异常值法检测分子适应性的工作台。

BMC Bioinformatics. 2008 Jul 28;9:323. doi: 10.1186/1471-2105-9-323.

GppFst: genomic posterior predictive simulations of FST and dXY for identifying outlier loci from population genomic data.GppFst：FST和dXY的基因组后验预测模拟，用于从群体基因组数据中识别异常位点。

Bioinformatics. 2017 May 1;33(9):1414-1415. doi: 10.1093/bioinformatics/btw795.

Evaluation of demographic history and neutral parameterization on the performance of FST outlier tests.评估人口历史和中性参数化对 FST 异常值检验性能的影响。

Mol Ecol. 2014 May;23(9):2178-92. doi: 10.1111/mec.12725. Epub 2014 Apr 11.

Identifying outlier loci in admixed and in continuous populations using ancestral population differentiation statistics.使用祖先群体分化统计量在混合群体和连续群体中识别异常位点。

Mol Ecol. 2016 Oct;25(20):5029-5042. doi: 10.1111/mec.13822. Epub 2016 Sep 14.

Measuring population differentiation using GST or D? A simulation study with microsatellite DNA markers under a finite island model and nonequilibrium conditions.使用 GST 或 D 衡量种群分化？在有限岛屿模型和非平衡条件下使用微卫星 DNA 标记的模拟研究。

Mol Ecol. 2011 Jun;20(12):2494-509. doi: 10.1111/j.1365-294X.2011.05108.x. Epub 2011 May 9.

The empirical Bayes estimators of fine-scale population structure in high gene flow species.高基因流物种中精细种群结构的经验贝叶斯估计。

Mol Ecol Resour. 2017 Nov;17(6):1210-1222. doi: 10.1111/1755-0998.12663. Epub 2017 Apr 5.

Comparison of F(ST) outlier tests for SNP loci under selection.比较 SNP 位点选择下的 F(ST) 异常值检验。

Mol Ecol Resour. 2011 Mar;11 Suppl 1:184-94. doi: 10.1111/j.1755-0998.2011.02987.x. Epub 2011 Feb 6.

A robust statistical method to detect null alleles in microsatellite and SNP datasets in both panmictic and inbred populations.一种强大的统计方法，用于在随机交配群体和近交群体的微卫星和单核苷酸多态性数据集中检测无效等位基因。

Stat Appl Genet Mol Biol. 2011;10:Article 9. doi: 10.2202/1544-6115.1620.

Development and preliminary evaluation of a genomewide single nucleotide polymorphisms resource generated by RAD-seq for the small yellow croaker (Larimichthys polyactis).利用 RAD-seq 技术开发并初步评估小黄鱼（Larimichthys polyactis）全基因组单核苷酸多态性资源。

Mol Ecol Resour. 2016 May;16(3):755-68. doi: 10.1111/1755-0998.12476. Epub 2015 Oct 29.

引用本文的文献

Genomic Signals of Local Adaptation Associated With Environmental Variables in From Northern Chilean Patagonia.智利北部巴塔哥尼亚地区与环境变量相关的局部适应的基因组信号

Ecol Evol. 2025 Jun 24;15(6):e71524. doi: 10.1002/ece3.71524. eCollection 2025 Jun.

Genetic variations associated with adaptation in Acrocomia palms: A comparative study across the Neotropics for crop improvement.与桄榔属棕榈适应相关的遗传变异：一项针对新热带地区作物改良的比较研究。

PLoS One. 2025 Jun 13;20(6):e0324340. doi: 10.1371/journal.pone.0324340. eCollection 2025.

Genome-wide study for signatures of selection identifies genomic regions and candidate genes associated with milk traits in sheep.全基因组选择特征研究确定了与绵羊产奶性状相关的基因组区域和候选基因。

Mamm Genome. 2025 Mar;36(1):140-150. doi: 10.1007/s00335-025-10107-1. Epub 2025 Feb 4.

Population genomics of a natural Cannabis sativa L. collection from Iran identifies novel genetic loci for flowering time, morphology, sex and chemotyping.来自伊朗的天然大麻（Cannabis sativa L.）群体基因组学研究确定了开花时间、形态、性别和化学分型的新基因座。

BMC Plant Biol. 2025 Jan 21;25(1):80. doi: 10.1186/s12870-025-06045-4.

Delimitation of Endangered Species (Anura: Telmatobiidae) of the Chilean Salt Puna.智利盐普纳地区濒危物种（无尾目：雨蛙科）的划定

Animals (Basel). 2024 Dec 15;14(24):3612. doi: 10.3390/ani14243612.

Genotyping by sequencing reveals lack of local genetic structure between two German L. populations.测序基因分型显示两个德国莱茵衣藻种群之间缺乏局部遗传结构。

For Res (Fayettev). 2022 Jan 26;2:1. doi: 10.48130/FR-2022-0001. eCollection 2022.

Hardy-Weinberg Equilibrium in Meta-Analysis Studies and Large-Scale Genomic Sequencing Era.荟萃分析研究和大规模基因组测序时代的哈迪-温伯格平衡。

Asian Pac J Cancer Prev. 2024 Jul 1;25(7):2229-2235. doi: 10.31557/APJCP.2024.25.7.2229.

Genome scans reveal signals of selection associated with pollution in fish populations of Basilichthys microlepidotus, an endemic species of Chile.基因组扫描揭示了与智利特有物种巴氏南美脂鲤种群中污染相关的选择信号。

Sci Rep. 2024 Jul 8;14(1):15727. doi: 10.1038/s41598-024-66121-x.

Low Diversity and High Genetic Structure for Mart., an Endangered Fruit Tree Species.濒危果树物种马氏栲（Castanopsis lamontii Hance）的低多样性和高遗传结构

Plants (Basel). 2024 Apr 6;13(7):1033. doi: 10.3390/plants13071033.

Capra hircus outliers markers in Brazil: Searching for genomic regions under the action of natural selection.巴西山羊的异常标记：寻找自然选择作用下的基因组区域。

Genet Mol Biol. 2023 Oct 20;46(3):e20230084. doi: 10.1590/1678-4685-GMB-2023-0084. eCollection 2023.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

对FST-杂合性异常值方法的限制。

Constraints on the FST-Heterozygosity Outlier Approach.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献