• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探索概率系统发育分析的快速计算策略。

Exploring fast computational strategies for probabilistic phylogenetic analysis.

作者信息

Rodrigue Nicolas, Philippe Hervé, Lartillot Nicolas

机构信息

Canadian Institute for Advanced Research, Département de Biochimie, Université de Montréal, Québec, Canada.

出版信息

Syst Biol. 2007 Oct;56(5):711-26. doi: 10.1080/10635150701611258.

DOI:10.1080/10635150701611258
PMID:17849326
Abstract

In recent years, the advent of Markov chain Monte Carlo (MCMC) techniques, coupled with modern computational capabilities, has enabled the study of evolutionary models without a closed form solution of the likelihood function. However, current Bayesian MCMC applications can incur significant computational costs, as they are based on a full sampling from the posterior probability distribution of the parameters of interest. Here, we draw attention as to how MCMC techniques can be embedded within normal approximation strategies for more economical statistical computation. The overall procedure is based on an estimate of the first and second moments of the likelihood function, as well as a maximum likelihood estimate. Through examples, we review several MCMC-based methods used in the statistical literature for such estimation, applying the approaches to constructing posterior distributions under non-analytical evolutionary models relaxing the assumptions of rate homogeneity, and of independence between sites. Finally, we use the procedures for conducting Bayesian model selection, based on Laplace approximations of Bayes factors, which we find to be accurate and computationally advantageous. Altogether, the methods we expound here, as well as other related approaches from the statistical literature, should prove useful when investigating increasingly complex descriptions of molecular evolution, alleviating some of the difficulties associated with nonanalytical models.

摘要

近年来,马尔可夫链蒙特卡罗(MCMC)技术的出现,再加上现代计算能力,使得对没有似然函数闭式解的进化模型进行研究成为可能。然而,当前的贝叶斯MCMC应用可能会产生巨大的计算成本,因为它们基于对感兴趣参数的后验概率分布进行全采样。在此,我们关注如何将MCMC技术嵌入到正态近似策略中,以实现更经济的统计计算。整个过程基于似然函数一阶矩和二阶矩的估计以及最大似然估计。通过实例,我们回顾了统计文献中用于此类估计的几种基于MCMC的方法,并将这些方法应用于在放宽速率齐性假设和位点间独立性假设的非解析进化模型下构建后验分布。最后,我们使用基于贝叶斯因子拉普拉斯近似的程序进行贝叶斯模型选择,发现其准确且在计算上具有优势。总之,我们在此阐述的方法以及统计文献中的其他相关方法,在研究日益复杂的分子进化描述时应会很有用,可缓解与非解析模型相关的一些困难。

相似文献

1
Exploring fast computational strategies for probabilistic phylogenetic analysis.探索概率系统发育分析的快速计算策略。
Syst Biol. 2007 Oct;56(5):711-26. doi: 10.1080/10635150701611258.
2
Identifiability of parameters in MCMC Bayesian inference of phylogeny.系统发育的MCMC贝叶斯推断中参数的可识别性。
Syst Biol. 2002 Oct;51(5):754-60. doi: 10.1080/10635150290102429.
3
Very fast algorithms for evaluating the stability of ML and Bayesian phylogenetic trees from sequence data.用于从序列数据评估最大似然法和贝叶斯系统发育树稳定性的超快速算法。
Genome Inform. 2002;13:82-92.
4
Approximate likelihood calculation on a phylogeny for Bayesian estimation of divergence times.基于贝叶斯估计分歧时间的系统发育近似似然计算。
Mol Biol Evol. 2011 Jul;28(7):2161-72. doi: 10.1093/molbev/msr045. Epub 2011 Feb 10.
5
Exploring heterogeneity in tumour data using Markov chain Monte Carlo.使用马尔可夫链蒙特卡罗方法探索肿瘤数据中的异质性。
Stat Med. 2003 May 30;22(10):1691-707. doi: 10.1002/sim.1441.
6
Bayesian inference of phylogeny and its impact on evolutionary biology.系统发育的贝叶斯推断及其对进化生物学的影响。
Science. 2001 Dec 14;294(5550):2310-4. doi: 10.1126/science.1065889.
7
Data cloning: easy maximum likelihood estimation for complex ecological models using Bayesian Markov chain Monte Carlo methods.数据克隆:使用贝叶斯马尔可夫链蒙特卡罗方法对复杂生态模型进行简便的最大似然估计。
Ecol Lett. 2007 Jul;10(7):551-63. doi: 10.1111/j.1461-0248.2007.01047.x.
8
Phylogenetic MCMC algorithms are misleading on mixtures of trees.系统发育马尔可夫链蒙特卡罗算法在树的混合模型上具有误导性。
Science. 2005 Sep 30;309(5744):2207-9. doi: 10.1126/science.1115493.
9
Searching for convergence in phylogenetic Markov chain Monte Carlo.在系统发育马尔可夫链蒙特卡罗方法中寻找收敛性。
Syst Biol. 2006 Aug;55(4):553-65. doi: 10.1080/10635150600812544.
10
Computational methods for evaluating phylogenetic models of coding sequence evolution with dependence between codons.用于评估密码子间存在依赖性的编码序列进化系统发育模型的计算方法。
Mol Biol Evol. 2009 Jul;26(7):1663-76. doi: 10.1093/molbev/msp078. Epub 2009 Apr 21.

引用本文的文献

1
Stochastic Character Mapping: An Under-Exploited Approach to the Study of Molecular Evolution.随机特征映射:一种尚未充分利用的分子进化研究方法。
J Mol Evol. 2025 Aug;93(4):465-473. doi: 10.1007/s00239-025-10257-5. Epub 2025 Jul 8.
2
Robustness of Phylogenetic Inference to Model Misspecification Caused by Pairwise Epistasis.由成对上位性引起的模型误设定对系统发育推断稳健性的影响
Mol Biol Evol. 2021 Sep 27;38(10):4603-4615. doi: 10.1093/molbev/msab163.
3
Detecting amino acid preference shifts with codon-level mutation-selection mixture models.
检测氨基酸偏好转变的密码子水平突变-选择混合模型。
BMC Evol Biol. 2019 Feb 26;19(1):62. doi: 10.1186/s12862-019-1358-7.
4
Relaxing the Molecular Clock to Different Degrees for Different Substitution Types.针对不同的替换类型,以不同程度放宽分子钟。
Mol Biol Evol. 2015 Aug;32(8):1948-61. doi: 10.1093/molbev/msv099. Epub 2015 Apr 29.
5
On the statistical interpretation of site-specific variables in phylogeny-based substitution models.基于系统发育替换模型的特定位点变量的统计解释。
Genetics. 2013 Feb;193(2):557-64. doi: 10.1534/genetics.112.145722. Epub 2012 Dec 5.
6
Rapid likelihood analysis on large phylogenies using partial sampling of substitution histories.利用替代历史的部分抽样对大型系统发育树进行快速似然分析。
Mol Biol Evol. 2010 Feb;27(2):249-65. doi: 10.1093/molbev/msp228. Epub 2009 Sep 25.
7
Bayesian comparisons of codon substitution models.密码子替换模型的贝叶斯比较。
Genetics. 2008 Nov;180(3):1579-91. doi: 10.1534/genetics.108.092254. Epub 2008 Sep 14.