从基因频率数据中检测和衡量选择。

Detecting and measuring selection from gene frequency data.

机构信息

Institut National de la Recherche Agronomique, Unité Mixte de Recherche CBGP, (Inra, Ird, Cirad, Montpellier-SupAgro) 34988 Montferrier-sur-Lez Cedex, France.

出版信息

Genetics. 2014 Mar;196(3):799-817. doi: 10.1534/genetics.113.152991. Epub 2013 Dec 20.

DOI:10.1534/genetics.113.152991

PMID:24361938

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3948807/

Abstract

The recent advent of high-throughput sequencing and genotyping technologies makes it possible to produce, easily and cost effectively, large amounts of detailed data on the genotype composition of populations. Detecting locus-specific effects may help identify those genes that have been, or are currently, targeted by natural selection. How best to identify these selected regions, loci, or single nucleotides remains a challenging issue. Here, we introduce a new model-based method, called SelEstim, to distinguish putative selected polymorphisms from the background of neutral (or nearly neutral) ones and to estimate the intensity of selection at the former. The underlying population genetic model is a diffusion approximation for the distribution of allele frequency in a population subdivided into a number of demes that exchange migrants. We use a Markov chain Monte Carlo algorithm for sampling from the joint posterior distribution of the model parameters, in a hierarchical Bayesian framework. We present evidence from stochastic simulations, which demonstrates the good power of SelEstim to identify loci targeted by selection and to estimate the strength of selection acting on these loci, within each deme. We also reanalyze a subset of SNP data from the Stanford HGDP-CEPH Human Genome Diversity Cell Line Panel to illustrate the performance of SelEstim on real data. In agreement with previous studies, our analyses point to a very strong signal of positive selection upstream of the LCT gene, which encodes for the enzyme lactase-phlorizin hydrolase and is associated with adult-type hypolactasia. The geographical distribution of the strength of positive selection across the Old World matches the interpolated map of lactase persistence phenotype frequencies, with the strongest selection coefficients in Europe and in the Indus Valley.

摘要

高通量测序和基因分型技术的出现使得人们可以轻松、经济高效地产生大量有关人群基因型组成的详细数据。检测特定基因座的效应有助于识别那些曾被自然选择或当前正被自然选择靶向的基因。如何最好地识别这些选择区域、基因座或单核苷酸仍然是一个具有挑战性的问题。在这里，我们引入了一种新的基于模型的方法，称为 SelEstim，用于区分假定的选择多态性与中性（或近乎中性）背景下的多态性，并估计前者的选择强度。潜在的群体遗传模型是一个在划分为若干交换移民群体的种群中等位基因频率分布的扩散近似模型。我们在分层贝叶斯框架中使用马尔可夫链蒙特卡罗算法从模型参数的联合后验分布中进行采样。我们从随机模拟中提供了证据，证明了 SelEstim 在识别被选择靶向的基因座和估计这些基因座上选择作用的强度方面具有良好的功效，在每个群体中都是如此。我们还重新分析了斯坦福人类基因组多样性细胞系面板（Stanford HGDP-CEPH Human Genome Diversity Cell Line Panel）中 SNP 数据的一个子集，以说明 SelEstim 在真实数据上的性能。与先前的研究一致，我们的分析表明，在编码乳糖酶-植酸钠水解酶的 LCT 基因上游存在非常强烈的正选择信号，该基因与成人型乳糖不耐受有关。在旧世界范围内，正选择的强度在地理上的分布与乳糖持续存在表型频率的插值图相匹配，在欧洲和印度河流域最强。

相似文献

Detecting and measuring selection from gene frequency data.

Genetics. 2014 Mar;196(3):799-817. doi: 10.1534/genetics.113.152991. Epub 2013 Dec 20.

Bayesian inference of selection in the Wright-Fisher diffusion model.

Stat Appl Genet Mol Biol. 2018 Jun 6;17(3):sagmb-2017-0046. doi: 10.1515/sagmb-2017-0046.

Identifying adaptive genetic divergence among populations from genome scans.

Mol Ecol. 2004 Apr;13(4):969-80. doi: 10.1111/j.1365-294x.2004.02125.x.

Detecting Selection from Linked Sites Using an -Model.

Genetics. 2020 Dec;216(4):1205-1215. doi: 10.1534/genetics.120.303780. Epub 2020 Oct 16.

Transcriptional regulation of the lactase-phlorizin hydrolase gene by polymorphisms associated with adult-type hypolactasia.

Gut. 2003 May;52(5):647-52. doi: 10.1136/gut.52.5.647.

A genome-wide scan shows evidence for local adaptation in a widespread keystone Neotropical forest tree.

Heredity (Edinb). 2019 Aug;123(2):117-137. doi: 10.1038/s41437-019-0188-0. Epub 2019 Feb 12.

A Bayesian outlier criterion to detect SNPs under selection in large data sets.

PLoS One. 2010 Aug 2;5(8):e11913. doi: 10.1371/journal.pone.0011913.

Detecting and Quantifying Natural Selection at Two Linked Loci from Time Series Data of Allele Frequencies with Forward-in-Time Simulations.

Genetics. 2020 Oct;216(2):521-541. doi: 10.1534/genetics.120.303463. Epub 2020 Aug 21.

Genetics of Lactose Intolerance: An Updated Review and Online Interactive World Maps of Phenotype and Genotype Frequencies.

Nutrients. 2020 Sep 3;12(9):2689. doi: 10.3390/nu12092689.

Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle.

Genet Sel Evol. 2015 Dec 23;47:99. doi: 10.1186/s12711-015-0179-4.

引用本文的文献

Using landscape genomics to assess local adaptation and genomic vulnerability of a perennial herb (Vitaceae) in subtropical China.

Front Genet. 2023 Apr 18;14:1150704. doi: 10.3389/fgene.2023.1150704. eCollection 2023.

Fast and accurate joint inference of coancestry parameters for populations and/or individuals.

PLoS Genet. 2023 Jan 19;19(1):e1010054. doi: 10.1371/journal.pgen.1010054. eCollection 2023 Jan.

Discovering candidate SNPs for resilience breeding of red clover.

Front Plant Sci. 2022 Sep 28;13:997860. doi: 10.3389/fpls.2022.997860. eCollection 2022.

A brief history and popularity of methods and tools used to estimate micro-evolutionary forces.

Ecol Evol. 2021 Sep 16;11(20):13723-13743. doi: 10.1002/ece3.8076. eCollection 2021 Oct.

Population structure, adaptation and divergence of the meadow spittlebug, (Hemiptera, Aphrophoridae), revealed by genomic and morphological data.

PeerJ. 2021 Jun 1;9:e11425. doi: 10.7717/peerj.11425. eCollection 2021.

bric à brac controls sex pheromone choice by male European corn borer moths.

Nat Commun. 2021 May 14;12(1):2818. doi: 10.1038/s41467-021-23026-x.

Genomics of Clinal Local Adaptation in Under Continuous Environmental and Spatial Genetic Setting.

G3 (Bethesda). 2020 Aug 5;10(8):2683-2696. doi: 10.1534/g3.120.401285.

Preliminary insights into the genetics of bank vole tolerance to Puumala hantavirus in Sweden.

Ecol Evol. 2018 Oct 26;8(22):11273-11292. doi: 10.1002/ece3.4603. eCollection 2018 Nov.

Generalization of the Ewens sampling formula to arbitrary fitness landscapes.

PLoS One. 2018 Jan 11;13(1):e0190186. doi: 10.1371/journal.pone.0190186. eCollection 2018.

Landscape genomic approach to detect selection signatures in locally adapted Brazilian swine genetic groups.

Ecol Evol. 2017 Oct 12;7(22):9544-9556. doi: 10.1002/ece3.3323. eCollection 2017 Nov.

本文引用的文献

Robust identification of local adaptation from allele frequencies.

Genetics. 2013 Sep;195(1):205-20. doi: 10.1534/genetics.113.152462. Epub 2013 Jul 2.

Testing for associations between loci and environmental gradients using latent factor mixed models.

Mol Biol Evol. 2013 Jul;30(7):1687-99. doi: 10.1093/molbev/mst063. Epub 2013 Mar 29.

Detecting signatures of selection through haplotype differentiation among hierarchically structured populations.

Genetics. 2013 Mar;193(3):929-41. doi: 10.1534/genetics.112.147231. Epub 2013 Jan 10.

Inference of population splits and mixtures from genome-wide allele frequency data.

PLoS Genet. 2012;8(11):e1002967. doi: 10.1371/journal.pgen.1002967. Epub 2012 Nov 15.

Inferring population histories using genome-wide allele frequency data.

Mol Biol Evol. 2013 Mar;30(3):654-68. doi: 10.1093/molbev/mss257. Epub 2012 Nov 15.

rehh: an R package to detect footprints of selection in genome-wide SNP data from haplotype structure.

Bioinformatics. 2012 Apr 15;28(8):1176-7. doi: 10.1093/bioinformatics/bts115. Epub 2012 Mar 7.

Herders of Indian and European cattle share their predominant allele for lactase persistence.

Mol Biol Evol. 2012 Jan;29(1):249-60. doi: 10.1093/molbev/msr190. Epub 2011 Aug 11.

Quantifying population structure using the F-model.

Mol Ecol Resour. 2010 Sep;10(5):821-30. doi: 10.1111/j.1755-0998.2010.02873.x. Epub 2010 May 13.

Adaptations to climate-mediated selective pressures in humans.

PLoS Genet. 2011 Apr;7(4):e1001375. doi: 10.1371/journal.pgen.1001375. Epub 2011 Apr 21.

Evolution of lactase persistence: an example of human niche construction.

Philos Trans R Soc Lond B Biol Sci. 2011 Mar 27;366(1566):863-77. doi: 10.1098/rstb.2010.0268.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从基因频率数据中检测和衡量选择。

Detecting and measuring selection from gene frequency data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献