贝叶斯层级模型中群体结构和局部适应的无似然推断。

Likelihood-free inference of population structure and local adaptation in a Bayesian hierarchical model.

机构信息

School of Biological Sciences, University of Reading, Whiteknights, Reading RG6 6BX, United Kingdom.

出版信息

Genetics. 2010 Jun;185(2):587-602. doi: 10.1534/genetics.109.112391. Epub 2010 Apr 9.

DOI:10.1534/genetics.109.112391

PMID:20382835

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2881139/

Abstract

We address the problem of finding evidence of natural selection from genetic data, accounting for the confounding effects of demographic history. In the absence of natural selection, gene genealogies should all be sampled from the same underlying distribution, often approximated by a coalescent model. Selection at a particular locus will lead to a modified genealogy, and this motivates a number of recent approaches for detecting the effects of natural selection in the genome as "outliers" under some models. The demographic history of a population affects the sampling distribution of genealogies, and therefore the observed genotypes and the classification of outliers. Since we cannot see genealogies directly, we have to infer them from the observed data under some model of mutation and demography. Thus the accuracy of an outlier-based approach depends to a greater or a lesser extent on the uncertainty about the demographic and mutational model. A natural modeling framework for this type of problem is provided by Bayesian hierarchical models, in which parameters, such as mutation rates and selection coefficients, are allowed to vary across loci. It has proved quite difficult computationally to implement fully probabilistic genealogical models with complex demographies, and this has motivated the development of approximations such as approximate Bayesian computation (ABC). In ABC the data are compressed into summary statistics, and computation of the likelihood function is replaced by simulation of data under the model. In a hierarchical setting one may be interested both in hyperparameters and parameters, and there may be very many of the latter--for example, in a genetic model, these may be parameters describing each of many loci or populations. This poses a problem for ABC in that one then requires summary statistics for each locus, which, if used naively, leads to a consequent difficulty in conditional density estimation. We develop a general method for applying ABC to Bayesian hierarchical models, and we apply it to detect microsatellite loci influenced by local selection. We demonstrate using receiver operating characteristic (ROC) analysis that this approach has comparable performance to a full-likelihood method and outperforms it when mutation rates are variable across loci.

摘要

我们解决了从遗传数据中寻找自然选择证据的问题，同时考虑了人口历史的混杂效应。在没有自然选择的情况下，基因谱系应该都来自于相同的基础分布，通常用合并模型来近似。在特定位置的选择会导致基因谱系的改变，这就促使了许多最近的方法来检测基因组中自然选择的影响，将其作为某些模型下的“异常值”。群体的人口历史会影响谱系的抽样分布，从而影响观察到的基因型和异常值的分类。由于我们不能直接看到谱系，所以我们必须根据某种突变和人口模型从观察到的数据中推断它们。因此，基于异常值的方法的准确性在一定程度上取决于人口和突变模型的不确定性。贝叶斯分层模型为这类问题提供了一个自然的建模框架，其中参数（如突变率和选择系数）可以在不同的位置上变化。在具有复杂人口统计学的情况下，实现完全概率性的谱系模型在计算上被证明是相当困难的，这促使了近似方法（如近似贝叶斯计算（ABC））的发展。在 ABC 中，数据被压缩成摘要统计数据，并且计算似然函数被模拟数据所取代。在分层设置中，人们可能对超参数和参数都感兴趣，并且后者可能非常多 - 例如，在遗传模型中，这些可能是描述许多位置或群体的参数。这对 ABC 提出了一个问题，因为它需要每个位置的摘要统计数据，如果使用不当，会导致条件密度估计的困难。我们开发了一种将 ABC 应用于贝叶斯分层模型的一般方法，并将其应用于检测受局部选择影响的微卫星位点。我们通过接收者操作特征（ROC）分析表明，这种方法的性能与全似然方法相当，并且在突变率在位置间变化时，它的性能优于全似然方法。

相似文献

Likelihood-free inference of population structure and local adaptation in a Bayesian hierarchical model.

Genetics. 2010 Jun;185(2):587-602. doi: 10.1534/genetics.109.112391. Epub 2010 Apr 9.

Identifying loci under selection via explicit demographic models.

Mol Ecol Resour. 2021 Nov;21(8):2719-2737. doi: 10.1111/1755-0998.13415. Epub 2021 Jun 3.

AABC: approximate approximate Bayesian computation for inference in population-genetic models.

Theor Popul Biol. 2015 Feb;99:31-42. doi: 10.1016/j.tpb.2014.09.002. Epub 2014 Sep 26.

Estimating demographic parameters from large-scale population genomic data using Approximate Bayesian Computation.

BMC Genet. 2012 Mar 27;13:22. doi: 10.1186/1471-2156-13-22.

The Promise of Inferring the Past Using the Ancestral Recombination Graph.

Genome Biol Evol. 2024 Feb 1;16(2). doi: 10.1093/gbe/evae005.

Inference with selection, varying population size, and evolving population structure: application of ABC to a forward-backward coalescent process with interactions.

Heredity (Edinb). 2021 Feb;126(2):335-350. doi: 10.1038/s41437-020-00381-x. Epub 2020 Oct 30.

A fast and reliable computational method for estimating population genetic parameters.

Genetics. 2008 Jun;179(2):951-63. doi: 10.1534/genetics.108.087049. Epub 2008 May 27.

Joint inference of microsatellite mutation models, population history and genealogies using transdimensional Markov Chain Monte Carlo.

Genetics. 2011 May;188(1):151-64. doi: 10.1534/genetics.110.125260. Epub 2011 Mar 8.

Approximate bayesian computation without summary statistics: the case of admixture.

Genetics. 2009 Apr;181(4):1507-19. doi: 10.1534/genetics.108.098129. Epub 2009 Feb 2.

A method for accurate inference of population size from serially sampled genealogies distorted by selection.

Mol Biol Evol. 2011 Nov;28(11):3171-81. doi: 10.1093/molbev/msr153. Epub 2011 Jun 16.

引用本文的文献

ABCDP: Approximate Bayesian Computation with Differential Privacy.

Entropy (Basel). 2021 Jul 27;23(8):961. doi: 10.3390/e23080961.

Identifying loci under selection via explicit demographic models.

Mol Ecol Resour. 2021 Nov;21(8):2719-2737. doi: 10.1111/1755-0998.13415. Epub 2021 Jun 3.

Inferring Demography and Selection in Organisms Characterized by Skewed Offspring Distributions.

Genetics. 2019 Mar;211(3):1019-1028. doi: 10.1534/genetics.118.301684. Epub 2019 Jan 16.

Local Adaptation in European Firs Assessed through Extensive Sampling across Altitudinal Gradients in Southern Europe.

PLoS One. 2016 Jul 8;11(7):e0158216. doi: 10.1371/journal.pone.0158216. eCollection 2016.

Model-based analysis supports interglacial refugia over long-dispersal events in the diversification of two South American cactus species.

Heredity (Edinb). 2016 Jun;116(6):550-7. doi: 10.1038/hdy.2016.17. Epub 2016 Apr 13.

Likelihood-Free Inference in High-Dimensional Models.

Genetics. 2016 Jun;203(2):893-904. doi: 10.1534/genetics.116.187567. Epub 2016 Apr 6.

Deep Learning for Population Genetic Inference.

PLoS Comput Biol. 2016 Mar 28;12(3):e1004845. doi: 10.1371/journal.pcbi.1004845. eCollection 2016 Mar.

The aggregate site frequency spectrum for comparative population genomic inference.

Mol Ecol. 2015 Dec;24(24):6223-40. doi: 10.1111/mec.13447. Epub 2015 Dec 12.

Detecting Genomic Signatures of Natural Selection with Principal Component Analysis: Application to the 1000 Genomes Data.

Mol Biol Evol. 2016 Apr;33(4):1082-93. doi: 10.1093/molbev/msv334. Epub 2015 Dec 29.

AFLPsim: an R package to simulate and detect dominant markers under selection in hybridizing populations.

Plant Methods. 2014 Dec 13;10:40. doi: 10.1186/1746-4811-10-40. eCollection 2014.

本文引用的文献

ESTIMATING F-STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE.

Evolution. 1984 Nov;38(6):1358-1370. doi: 10.1111/j.1558-5646.1984.tb05657.x.

A Bayesian hierarchical model for analysis of SNP diversity in multilocus, multipopulation samples.

J Am Stat Assoc. 2009 Mar 1;104(485):142-154. doi: 10.1198/jasa.2009.0010.

Detecting loci under selection in a hierarchically structured population.

Heredity (Edinb). 2009 Oct;103(4):285-98. doi: 10.1038/hdy.2009.74. Epub 2009 Jul 22.

Model criticism based on likelihood-free inference, with an application to protein network evolution.

Proc Natl Acad Sci U S A. 2009 Jun 30;106(26):10576-81. doi: 10.1073/pnas.0807882106. Epub 2009 Jun 12.

Efficient approximate Bayesian computation coupled with Markov chain Monte Carlo without likelihood.

Genetics. 2009 Aug;182(4):1207-18. doi: 10.1534/genetics.109.102509. Epub 2009 Jun 8.

Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems.

J R Soc Interface. 2009 Feb 6;6(31):187-202. doi: 10.1098/rsif.2008.0172.

Approximate bayesian computation without summary statistics: the case of admixture.

Genetics. 2009 Apr;181(4):1507-19. doi: 10.1534/genetics.108.098129. Epub 2009 Feb 2.

Selection and sticklebacks.

Mol Ecol. 2008 Aug;17(15):3425-7. doi: 10.1111/j.1365-294x.2008.03863.x.

Testing comparative phylogeographic models of marine vicariance and dispersal using a hierarchical Bayesian approach.

BMC Evol Biol. 2008 Nov 27;8:322. doi: 10.1186/1471-2148-8-322.

A genome-scan method to identify selected loci appropriate for both dominant and codominant markers: a Bayesian perspective.

Genetics. 2008 Oct;180(2):977-93. doi: 10.1534/genetics.108.092221. Epub 2008 Sep 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

贝叶斯层级模型中群体结构和局部适应的无似然推断。

Likelihood-free inference of population structure and local adaptation in a Bayesian hierarchical model.

机构信息

School of Biological Sciences, University of Reading, Whiteknights, Reading RG6 6BX, United Kingdom.

出版信息

Genetics. 2010 Jun;185(2):587-602. doi: 10.1534/genetics.109.112391. Epub 2010 Apr 9.

DOI:10.1534/genetics.109.112391

PMID:20382835

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2881139/

Abstract

摘要

贝叶斯层级模型中群体结构和局部适应的无似然推断。

Likelihood-free inference of population structure and local adaptation in a Bayesian hierarchical model.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

贝叶斯层级模型中群体结构和局部适应的无似然推断。

Likelihood-free inference of population structure and local adaptation in a Bayesian hierarchical model.

机构信息

出版信息