从非平衡群体的全基因组 SNP 数据中搜索正选择的足迹。

Searching for footprints of positive selection in whole-genome SNP data from nonequilibrium populations.

机构信息

Department of Biology II, Ludwig-Maximilians-University Munich, 82152 Planegg, Germany.

出版信息

Genetics. 2010 Jul;185(3):907-22. doi: 10.1534/genetics.110.116459. Epub 2010 Apr 20.

DOI:10.1534/genetics.110.116459

PMID:20407129

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2907208/

Abstract

A major goal of population genomics is to reconstruct the history of natural populations and to infer the neutral and selective scenarios that can explain the present-day polymorphism patterns. However, the separation between neutral and selective hypotheses has proven hard, mainly because both may predict similar patterns in the genome. This study focuses on the development of methods that can be used to distinguish neutral from selective hypotheses in equilibrium and nonequilibrium populations. These methods utilize a combination of statistics on the basis of the site frequency spectrum (SFS) and linkage disequilibrium (LD). We investigate the patterns of genetic variation along recombining chromosomes using a multitude of comparisons between neutral and selective hypotheses, such as selection or neutrality in equilibrium and nonequilibrium populations and recurrent selection models. We perform hypothesis testing using the classical P-value approach, but we also introduce methods from the machine-learning field. We demonstrate that the combination of SFS- and LD-based statistics increases the power to detect recent positive selection in populations that have experienced past demographic changes.

摘要

群体基因组学的主要目标是重建自然群体的历史，并推断能够解释当今多态性模式的中性和选择情景。然而，中性和选择假设之间的分离一直很难证明，主要是因为两者都可能预测基因组中相似的模式。本研究侧重于开发可用于区分平衡和非平衡群体中中性和选择假设的方法。这些方法利用基于位点频率谱 (SFS) 和连锁不平衡 (LD) 的统计数据的组合。我们使用中性和选择假设之间的多种比较来研究重组染色体上的遗传变异模式，例如平衡和非平衡种群中的选择或中性以及反复选择模型。我们使用经典的 P 值方法进行假设检验，但我们也引入了机器学习领域的方法。我们证明，基于 SFS 和 LD 的统计数据的组合增加了在经历过过去人口变化的群体中检测近期正选择的能力。

相似文献

Searching for footprints of positive selection in whole-genome SNP data from nonequilibrium populations.从非平衡群体的全基因组 SNP 数据中搜索正选择的足迹。

Genetics. 2010 Jul;185(3):907-22. doi: 10.1534/genetics.110.116459. Epub 2010 Apr 20.

On the utility of linkage disequilibrium as a statistic for identifying targets of positive selection in nonequilibrium populations.连锁不平衡作为一种统计量用于识别非平衡群体中正选择目标的效用。

Genetics. 2007 Aug;176(4):2371-9. doi: 10.1534/genetics.106.069450. Epub 2007 Jun 11.

Detecting Positive Selection in Populations Using Genetic Data.利用遗传数据检测群体中的正选择。

Methods Mol Biol. 2020;2090:87-123. doi: 10.1007/978-1-0716-0199-0_5.

A population genetic hidden Markov model for detecting genomic regions under selection.用于检测受选择基因组区域的群体遗传隐马尔可夫模型。

Mol Biol Evol. 2010 Jul;27(7):1673-85. doi: 10.1093/molbev/msq053. Epub 2010 Feb 25.

Multilocus patterns of nucleotide variability and the demographic and selection history of Drosophila melanogaster populations.黑腹果蝇群体的核苷酸变异性多位点模式以及种群统计学和选择历史

Genome Res. 2005 Jun;15(6):790-9. doi: 10.1101/gr.3541005.

Scalable linkage-disequilibrium-based selective sweep detection: a performance guide.基于连锁不平衡的可扩展选择性清除检测：性能指南。

Gigascience. 2016 Feb 8;5:7. doi: 10.1186/s13742-016-0114-9. eCollection 2016.

Elevated Linkage Disequilibrium and Signatures of Soft Sweeps Are Common in Drosophila melanogaster.连锁不平衡升高和软扫荡特征在黑腹果蝇中很常见。

Genetics. 2016 Jun;203(2):863-80. doi: 10.1534/genetics.115.184002. Epub 2016 Apr 20.

A new inference method for detecting an ongoing selective sweep.一种用于检测正在进行的选择性清除的新推理方法。

Genes Genet Syst. 2018 Nov 10;93(4):149-161. doi: 10.1266/ggs.18-00008. Epub 2018 Sep 30.

A genome-wide departure from the standard neutral model in natural populations of Drosophila.果蝇自然种群中全基因组与标准中性模型的偏离。

Genetics. 2000 Sep;156(1):257-68. doi: 10.1093/genetics/156.1.257.

Genomics of Parallel Experimental Evolution in Drosophila.果蝇平行实验进化的基因组学

Mol Biol Evol. 2017 Apr 1;34(4):831-842. doi: 10.1093/molbev/msw282.

引用本文的文献

Footprints of Worldwide Adaptation in Structured Populations of Drosophila melanogaster Through the Expanded DEST 2.0 Genomic Resource.通过扩展的DEST 2.0基因组资源在黑腹果蝇结构化种群中全球适应性的足迹

Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf132.

Signatures of soft selective sweeps predominate in the yellow fever mosquito .软选择清除的特征在埃及伊蚊中占主导地位。

bioRxiv. 2025 Jul 10:2025.07.06.663360. doi: 10.1101/2025.07.06.663360.

Population History Across Timescales in an Urban Archipelago.城市群岛跨时间尺度的人口历史

Genome Biol Evol. 2025 Apr 3;17(4). doi: 10.1093/gbe/evaf048.

Population history across timescales in an urban archipelago.城市群岛不同时间尺度下的人口历史

bioRxiv. 2025 Jan 25:2025.01.24.633650. doi: 10.1101/2025.01.24.633650.

Inferring demographic and selective histories from population genomic data using a 2-step approach in species with coding-sparse genomes: an application to human data.在编码基因稀疏的物种中，使用两步法从群体基因组数据推断种群统计学和选择历史：对人类数据的应用

G3 (Bethesda). 2025 Apr 17;15(4). doi: 10.1093/g3journal/jkaf019.

Inferring demographic and selective histories from population genomic data using a two-step approach in species with coding-sparse genomes: an application to human data.在编码基因稀疏的物种中，采用两步法从群体基因组数据推断群体统计学和选择历史：对人类数据的应用

bioRxiv. 2024 Nov 21:2024.09.19.613979. doi: 10.1101/2024.09.19.613979.

Population size rescaling significantly biases outcomes of forward-in-time population genetic simulations.种群大小重新缩放会显著偏向于时间向前的种群遗传模拟结果。

Genetics. 2025 Jan 8;229(1):1-57. doi: 10.1093/genetics/iyae180.

Tree Sequences as a General-Purpose Tool for Population Genetic Inference.树序列作为一种通用的群体遗传推断工具。

Mol Biol Evol. 2024 Nov 1;41(11). doi: 10.1093/molbev/msae223.

Tree sequences as a general-purpose tool for population genetic inference.树状序列作为群体遗传推断的通用工具。

bioRxiv. 2024 Oct 5:2024.02.20.581288. doi: 10.1101/2024.02.20.581288.

Population size rescaling significantly biases outcomes of forward-in-time population genetic simulations.种群大小重新缩放会显著偏向时间向前的种群遗传模拟结果。

bioRxiv. 2024 Sep 3:2024.04.07.588318. doi: 10.1101/2024.04.07.588318.

本文引用的文献

Uncovering the footprint of positive selection on the X chromosome of Drosophila melanogaster.揭示正选择在黑腹果蝇 X 染色体上的足迹。

Mol Biol Evol. 2010 Jan;27(1):153-60. doi: 10.1093/molbev/msp220.

mbs: modifying Hudson's ms software to generate samples of DNA sequences with a biallelic site under selection.MBS：修改哈德逊的MS软件，以生成在选择下具有双等位基因位点的DNA序列样本。

BMC Bioinformatics. 2009 May 30;10:166. doi: 10.1186/1471-2105-10-166.

Constructing genomic maps of positive selection in humans: where do we go from here?构建人类正向选择的基因组图谱：我们从这里走向何方？

Genome Res. 2009 May;19(5):711-22. doi: 10.1101/gr.086652.108.

Recent strong positive selection on Drosophila melanogaster HDAC6, a gene encoding a stress surveillance factor, as revealed by population genomic analysis.群体基因组分析显示，果蝇黑腹果蝇HDAC6（一种编码应激监测因子的基因）近期受到强烈的正向选择。

Mol Biol Evol. 2009 Jul;26(7):1549-56. doi: 10.1093/molbev/msp065. Epub 2009 Apr 6.

Signals of recent positive selection in a worldwide sample of human populations.全球人类群体样本中近期正向选择的信号。

Genome Res. 2009 May;19(5):826-37. doi: 10.1101/gr.087577.108. Epub 2009 Mar 23.

Darwinian and demographic forces affecting human protein coding genes.影响人类蛋白质编码基因的达尔文主义和人口统计学力量。

Genome Res. 2009 May;19(5):838-49. doi: 10.1101/gr.088336.108. Epub 2009 Mar 11.

A flexible forward simulator for populations subject to selection and demography.一种适用于受选择和人口统计学影响群体的灵活正向模拟器。

Bioinformatics. 2008 Dec 1;24(23):2786-7. doi: 10.1093/bioinformatics/btn522. Epub 2008 Oct 7.

An approximate bayesian estimator suggests strong, recurrent selective sweeps in Drosophila.一种近似贝叶斯估计方法表明，果蝇中存在强烈的、反复出现的选择性清除现象。

PLoS Genet. 2008 Sep 19;4(9):e1000198. doi: 10.1371/journal.pgen.1000198.

Fregene: simulation of realistic sequence-level data in populations and ascertained samples.弗雷根：群体和确定样本中实际序列水平数据的模拟。

BMC Bioinformatics. 2008 Sep 8;9:364. doi: 10.1186/1471-2105-9-364.

Hitchhiking both ways: effect of two interfering selective sweeps on linked neutral variation.双向搭便车：两次干扰性选择性清除对连锁中性变异的影响。

Genetics. 2008 Sep;180(1):301-16. doi: 10.1534/genetics.108.089706. Epub 2008 Aug 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验