Suppr超能文献

使用精确贝塔近似在赖特-费希尔模型下进行推断。

Inference Under a Wright-Fisher Model Using an Accurate Beta Approximation.

作者信息

Tataru Paula, Bataillon Thomas, Hobolth Asger

机构信息

Bioinformatics Research Centre, Aarhus University, Aarhus C 8000, Denmark

Bioinformatics Research Centre, Aarhus University, Aarhus C 8000, Denmark.

出版信息

Genetics. 2015 Nov;201(3):1133-41. doi: 10.1534/genetics.115.179606. Epub 2015 Aug 26.

Abstract

The large amount and high quality of genomic data available today enable, in principle, accurate inference of evolutionary histories of observed populations. The Wright-Fisher model is one of the most widely used models for this purpose. It describes the stochastic behavior in time of allele frequencies and the influence of evolutionary pressures, such as mutation and selection. Despite its simple mathematical formulation, exact results for the distribution of allele frequency (DAF) as a function of time are not available in closed analytical form. Existing approximations build on the computationally intensive diffusion limit or rely on matching moments of the DAF. One of the moment-based approximations relies on the beta distribution, which can accurately describe the DAF when the allele frequency is not close to the boundaries (0 and 1). Nonetheless, under a Wright-Fisher model, the probability of being on the boundary can be positive, corresponding to the allele being either lost or fixed. Here we introduce the beta with spikes, an extension of the beta approximation that explicitly models the loss and fixation probabilities as two spikes at the boundaries. We show that the addition of spikes greatly improves the quality of the approximation. We additionally illustrate, using both simulated and real data, how the beta with spikes can be used for inference of divergence times between populations with comparable performance to an existing state-of-the-art method.

摘要

如今可用的大量高质量基因组数据原则上能够精确推断观察到的种群的进化历史。赖特 - 费希尔模型是为此目的最广泛使用的模型之一。它描述了等位基因频率随时间的随机行为以及进化压力(如突变和选择)的影响。尽管其数学公式简单,但等位基因频率分布(DAF)作为时间函数的精确结果无法以封闭的解析形式获得。现有的近似方法基于计算密集的扩散极限构建,或者依赖于匹配DAF的矩。基于矩的近似方法之一依赖于贝塔分布,当等位基因频率不接近边界(0和1)时,它可以准确描述DAF。然而,在赖特 - 费希尔模型下,处于边界的概率可能为正,对应于等位基因要么丢失要么固定。在这里,我们引入带尖峰的贝塔分布,这是贝塔近似的扩展,它将丢失和固定概率明确建模为边界处的两个尖峰。我们表明添加尖峰极大地提高了近似的质量。我们还使用模拟数据和真实数据说明了带尖峰的贝塔分布如何用于推断种群之间的分化时间,其性能与现有的最先进方法相当。

相似文献

8
Exact simulation of conditioned Wright-Fisher models.条件Wright-Fisher模型的精确模拟。
J Theor Biol. 2014 Dec 21;363:419-26. doi: 10.1016/j.jtbi.2014.08.027. Epub 2014 Aug 28.
10
Wright-Fisher diffusion bridges.赖特-费希尔扩散桥
Theor Popul Biol. 2018 Jul;122:67-77. doi: 10.1016/j.tpb.2017.09.005. Epub 2017 Oct 6.

引用本文的文献

6
A mechanistic model of gossip, reputations, and cooperation.谣言、声誉和合作的机械模型。
Proc Natl Acad Sci U S A. 2024 May 14;121(20):e2400689121. doi: 10.1073/pnas.2400689121. Epub 2024 May 8.
8
Direct detection of natural selection in Bronze Age Britain.直接检测青铜时代英国的自然选择。
Genome Res. 2022 Nov-Dec;32(11-12):2057-2067. doi: 10.1101/gr.276862.122. Epub 2022 Oct 31.

本文引用的文献

4
Large-scale whole-genome sequencing of the Icelandic population.大规模全基因组测序的冰岛人口。
Nat Genet. 2015 May;47(5):435-44. doi: 10.1038/ng.3247. Epub 2015 Mar 25.
10

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验