• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用位点频率数据估计尺度化突变率和突变偏差。

Estimating the scaled mutation rate and mutation bias with site frequency data.

作者信息

Vogl Claus

机构信息

Institute of Animal Breeding and Genetics, Veterinärmedizinische Universität Wien, Veterinärplatz 1, A-1210 Vienna, Austria.

出版信息

Theor Popul Biol. 2014 Dec;98:19-27. doi: 10.1016/j.tpb.2014.10.002. Epub 2014 Oct 18.

DOI:10.1016/j.tpb.2014.10.002
PMID:25453604
Abstract

The distribution of allele frequencies of a large number of biallelic sites is known as "allele-frequency spectrum" or "site-frequency spectrum" (SFS). Without selection and in regions of relatively high recombination rates, sites may be assumed to be independently and identically distributed. With a beta equilibrium distribution of allelic proportions and binomial sampling, a beta-binomial compound likelihood for each site results. The likelihood of the data and the posterior distribution of two parameters, scaled mutation rate θ and mutation bias α, is investigated in the general case and for small scaled mutation rates θ. In the general case, an expectation-maximization (EM) algorithm is derived to obtain maximum likelihood estimates of both parameters. With an appropriate prior distribution, a Markov chain Monte Carlo sampler to integrate the posterior distribution is also derived. As far as I am aware, previous maximum likelihood or Bayesian estimators of θ, explicitly or implicitly assume small scaled mutation rates, i.e., θ≪1. For θ≪1, maximum-likelihood estimators are also derived for both parameters using a Taylor series expansion of the beta-binomial distribution. The estimator of θ is a variant of the Ewens-Watterson estimator and of the maximum likelihood estimator derived with the Poisson Random Field approach. With a conjugate prior distribution, marginal and conditional beta posterior distributions are also derived for both parameters.

摘要

大量双等位基因位点的等位基因频率分布被称为“等位基因频率谱”或“位点频率谱”(SFS)。在没有选择且重组率相对较高的区域,可以假定位点是独立同分布的。在等位基因比例呈β平衡分布且进行二项式抽样的情况下,每个位点会产生一个β - 二项式复合似然。在一般情况下以及小尺度突变率θ的情况下,研究了数据的似然性以及两个参数(尺度化突变率θ和突变偏差α)的后验分布。在一般情况下,推导了一种期望最大化(EM)算法以获得两个参数的最大似然估计。在具有适当先验分布的情况下,还推导了一个用于对后验分布进行积分的马尔可夫链蒙特卡罗采样器。据我所知,之前对θ的最大似然或贝叶斯估计器,无论是显式还是隐式地,都假定了小尺度突变率,即θ≪1。对于θ≪1,还使用β - 二项式分布的泰勒级数展开为两个参数推导了最大似然估计器。θ的估计器是Ewens - Watterson估计器以及用泊松随机场方法推导的最大似然估计器的一个变体。在具有共轭先验分布的情况下,还为两个参数推导了边际和条件β后验分布。

相似文献

1
Estimating the scaled mutation rate and mutation bias with site frequency data.利用位点频率数据估计尺度化突变率和突变偏差。
Theor Popul Biol. 2014 Dec;98:19-27. doi: 10.1016/j.tpb.2014.10.002. Epub 2014 Oct 18.
2
Robustness of coalescent estimators to between-lineage mutation rate variation.溯祖估计量对谱系间突变率变化的稳健性。
Mol Biol Evol. 2006 Dec;23(12):2355-60. doi: 10.1093/molbev/msl106. Epub 2006 Sep 6.
3
A gradient Markov chain Monte Carlo algorithm for computing multivariate maximum likelihood estimates and posterior distributions: mixture dose-response assessment.用于计算多元极大似然估计值和后验分布的梯度马尔可夫链蒙特卡罗算法:混合剂量反应评估。
Risk Anal. 2012 Feb;32(2):345-59. doi: 10.1111/j.1539-6924.2011.01672.x. Epub 2011 Sep 11.
4
Sufficiency of the number of segregating sites in the limit under finite-sites mutation.有限位点突变下极限情况下分离位点数量的充分性。
Theor Popul Biol. 2010 Sep;78(2):118-22. doi: 10.1016/j.tpb.2010.05.003. Epub 2010 Jun 2.
5
Bayesian and maximum likelihood estimation of genetic maps.遗传图谱的贝叶斯估计和最大似然估计
Genet Res. 2005 Apr;85(2):159-68. doi: 10.1017/S0016672305007494.
6
[The use of the expectation-maximization (EM) algorithm for maximum likelihood estimation of gametic frequencies of multilocus polymorphic codominant systems based on sampled population data].[基于抽样群体数据,使用期望最大化(EM)算法对多位点共显性系统的配子频率进行最大似然估计]
Genetika. 2002 Mar;38(3):407-18.
7
Empirical Bayes models of Poisson clinical trials and sample size determination.泊松临床试验的经验贝叶斯模型与样本量确定
Pharm Stat. 2010 Apr-Jun;9(2):133-41. doi: 10.1002/pst.383.
8
Inference of directional selection and mutation parameters assuming equilibrium.在假设达到平衡的情况下对定向选择和突变参数进行推断。
Theor Popul Biol. 2015 Dec;106:71-82. doi: 10.1016/j.tpb.2015.10.003. Epub 2015 Oct 24.
9
Bayesian estimation of scaled mutation rate under the coalescent: a sequential Monte Carlo approach.在溯祖模型下缩放突变率的贝叶斯估计:一种序贯蒙特卡罗方法。
BMC Bioinformatics. 2017 Dec 8;18(1):541. doi: 10.1186/s12859-017-1948-6.
10
Maximum likelihood and Bayesian methods for estimating the distribution of selective effects among classes of mutations using DNA polymorphism data.利用DNA多态性数据估计各类突变间选择效应分布的最大似然法和贝叶斯方法。
Theor Popul Biol. 2003 Mar;63(2):91-103. doi: 10.1016/s0040-5809(02)00050-3.

引用本文的文献

1
Different mitogenomic codon usage patterns between damselflies and dragonflies and nine complete mitogenomes for odonates.不同的蜻蜓目和蜻蜓科之间的线粒体基因组密码子使用模式和 9 个完整的蜻蜓目线粒体基因组。
Sci Rep. 2019 Jan 24;9(1):678. doi: 10.1038/s41598-018-35760-2.
2
The stationary distribution of a sample from the Wright-Fisher diffusion model with general small mutation rates.具有一般小突变率的赖特 - 费希尔扩散模型样本的平稳分布。
J Math Biol. 2019 Mar;78(4):1211-1224. doi: 10.1007/s00285-018-1306-y. Epub 2018 Nov 13.