• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

赖特-费希尔精确求解器(WFES):无需模拟或扩散理论即可对群体遗传模型进行可扩展分析。

Wright-Fisher exact solver (WFES): scalable analysis of population genetic models without simulation or diffusion theory.

作者信息

Krukov Ivan, de Sanctis Bianca, de Koning A P Jason

机构信息

Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Alberta T2N 1N4, Canada.

Doctoral Program in Biochemistry and Molecular Biology, Bioinformatics Stream, University of Calgary, Calgary, Alberta T2N 1N4, Canada.

出版信息

Bioinformatics. 2017 May 1;33(9):1416-1417. doi: 10.1093/bioinformatics/btw802.

DOI:10.1093/bioinformatics/btw802
PMID:28453671
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5408789/
Abstract

MOTIVATION

The simplifying assumptions that are used widely in theoretical population genetics may not always be appropriate for empirical population genetics. General computational approaches that do not require the assumptions of classical theory are therefore quite desirable. One such general approach is provided by the theory of absorbing Markov chains, which can be used to obtain exact results by directly analyzing population genetic Markov models, such as the classic bi-allelic Wright-Fisher model. Although these approaches are sometimes used, they are usually forgone in favor of simulation methods, due to the perception that they are too computationally burdensome. Here we show that, surprisingly, direct analysis of virtually any Markov chain model in population genetics can be made quite efficient by exploiting transition matrix sparsity and by solving restricted systems of linear equations, allowing a wide variety of exact calculations (within machine precision) to be easily and rapidly made on modern workstation computers.

RESULTS

We introduce Wright-Fisher Exact Solver (WFES), a fast and scalable method for direct analysis of Markov chain models in population genetics. WFES can rapidly solve for both long-term and transient behaviours including fixation and extinction probabilities, expected times to fixation or extinction, sojourn times, expected allele age and variance, and others. Our implementation requires only seconds to minutes of runtime on modern workstations and scales to biological population sizes ranging from humans to model organisms.

AVAILABILITY AND IMPLEMENTATION

The code is available at https://github.com/dekoning-lab/wfes.

CONTACT

jason.dekoning@ucalgary.ca.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

理论群体遗传学中广泛使用的简化假设可能并不总是适用于实证群体遗传学。因此,非常需要不依赖经典理论假设的通用计算方法。吸收马尔可夫链理论提供了这样一种通用方法,它可通过直接分析群体遗传马尔可夫模型(如经典的双等位基因赖特 - 费希尔模型)来获得精确结果。尽管有时会使用这些方法,但由于人们认为它们计算量太大,所以通常会放弃而采用模拟方法。在这里我们表明,令人惊讶的是,通过利用转移矩阵的稀疏性并求解受限线性方程组,可以使群体遗传学中几乎任何马尔可夫链模型的直接分析变得非常高效,从而能够在现代工作站计算机上轻松快速地进行各种精确计算(在机器精度范围内)。

结果

我们引入了赖特 - 费希尔精确求解器(WFES),这是一种用于直接分析群体遗传学中马尔可夫链模型的快速且可扩展的方法。WFES 可以快速求解长期和瞬态行为,包括固定和灭绝概率、固定或灭绝的预期时间、停留时间、预期等位基因年龄和方差等。我们的实现方法在现代工作站上只需几秒到几分钟的运行时间,并且可扩展到从人类到模式生物的生物群体规模。

可用性和实现

代码可在 https://github.com/dekoning-lab/wfes 上获取。

联系方式

jason.dekoning@ucalgary.ca。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
Wright-Fisher exact solver (WFES): scalable analysis of population genetic models without simulation or diffusion theory.赖特-费希尔精确求解器(WFES):无需模拟或扩散理论即可对群体遗传模型进行可扩展分析。
Bioinformatics. 2017 May 1;33(9):1416-1417. doi: 10.1093/bioinformatics/btw802.
2
Allele Age Under Non-Classical Assumptions is Clarified by an Exact Computational Markov Chain Approach.在非经典假设下,通过精确计算的马尔可夫链方法阐明了等位基因年龄。
Sci Rep. 2017 Sep 19;7(1):11869. doi: 10.1038/s41598-017-12239-0.
3
Exact simulation of conditioned Wright-Fisher models.条件Wright-Fisher模型的精确模拟。
J Theor Biol. 2014 Dec 21;363:419-26. doi: 10.1016/j.jtbi.2014.08.027. Epub 2014 Aug 28.
4
EWF: simulating exact paths of the Wright-Fisher diffusion.EWF:模拟 Wright-Fisher 扩散的确切路径。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad017.
5
The Exact Stochastic Process of the Haploid Multi-Allelic Wright-Fisher Mutation Model.单体多等位基因 Wright-Fisher 突变模型的精确随机过程。
6
Exact Markov chain and approximate diffusion solution for haploid genetic drift with one-way mutation.具有单向突变的单倍体遗传漂变的精确马尔可夫链和近似扩散解
Math Biosci. 2016 Feb;272:100-12. doi: 10.1016/j.mbs.2015.12.006. Epub 2015 Dec 24.
7
Exact Markov chains versus diffusion theory for haploid random mating.精确马尔可夫链与扩散理论在单体随机交配中的比较。
Math Biosci. 2010 May;225(1):18-23. doi: 10.1016/j.mbs.2010.01.005. Epub 2010 Jan 25.
8
Exact coalescent for the Wright-Fisher model.赖特-费希尔模型的精确合并理论
Theor Popul Biol. 2006 Jun;69(4):385-94. doi: 10.1016/j.tpb.2005.11.005. Epub 2006 Jan 19.
9
Inference in population genetics using forward and backward, discrete and continuous time processes.在群体遗传学中使用向前和向后、离散和连续时间过程进行推断。
J Theor Biol. 2018 Feb 14;439:166-180. doi: 10.1016/j.jtbi.2017.12.008. Epub 2017 Dec 9.
10
Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model.利用各种参数逼近 Wright-Fisher 模型对遗传时间序列进行选择推断。
G3 (Bethesda). 2019 Dec 3;9(12):4073-4086. doi: 10.1534/g3.119.400778.

引用本文的文献

1
Scaling the discrete-time Wright-Fisher model to biobank-scale datasets.将离散时间 Wright-Fisher 模型扩展到生物库规模数据集。
Genetics. 2023 Nov 1;225(3). doi: 10.1093/genetics/iyad168.
2
Scaling the Discrete-time Wright Fisher model to biobank-scale datasets.将离散时间赖特-费希尔模型扩展到生物样本库规模的数据集。
bioRxiv. 2023 May 22:2023.05.19.541517. doi: 10.1101/2023.05.19.541517.
3
Characterizing Amino Acid Substitution with Complete Linkage of Sites on a Lineage.描述在线性分支上的位点完全连锁的氨基酸替换特征。

本文引用的文献

1
Complete numerical solution of the diffusion equation of random genetic drift.随机遗传漂变扩散方程的完全数值解。
Genetics. 2013 Aug;194(4):973-85. doi: 10.1534/genetics.113.152017. Epub 2013 Jun 7.
2
Joint inference of the distribution of fitness effects of deleterious mutations and population demography based on nucleotide polymorphism frequencies.基于核苷酸多态性频率对有害突变的适合度效应分布和群体人口统计学进行联合推断。
Genetics. 2007 Dec;177(4):2251-61. doi: 10.1534/genetics.107.080663.
Genome Biol Evol. 2021 Oct 1;13(10). doi: 10.1093/gbe/evab225.
4
Using the Mutation-Selection Framework to Characterize Selection on Protein Sequences.使用突变-选择框架来表征蛋白质序列上的选择。
Genes (Basel). 2018 Aug 13;9(8):409. doi: 10.3390/genes9080409.
5
Inferring Fitness Effects from Time-Resolved Sequence Data with a Delay-Deterministic Model.从具有时滞确定性模型的时间分辨序列数据推断适合度效应。
Genetics. 2018 May;209(1):255-264. doi: 10.1534/genetics.118.300790. Epub 2018 Mar 2.
6
Allele Age Under Non-Classical Assumptions is Clarified by an Exact Computational Markov Chain Approach.在非经典假设下,通过精确计算的马尔可夫链方法阐明了等位基因年龄。
Sci Rep. 2017 Sep 19;7(1):11869. doi: 10.1038/s41598-017-12239-0.