• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

许多实证比对中马尔可夫链蒙特卡罗性能的特性。

Properties of Markov Chain Monte Carlo Performance across Many Empirical Alignments.

机构信息

School of Life Sciences, University of Hawai'i, Honolulu, HI.

出版信息

Mol Biol Evol. 2021 Apr 13;38(4):1627-1640. doi: 10.1093/molbev/msaa295.

DOI:10.1093/molbev/msaa295
PMID:33185685
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8042746/
Abstract

Nearly all current Bayesian phylogenetic applications rely on Markov chain Monte Carlo (MCMC) methods to approximate the posterior distribution for trees and other parameters of the model. These approximations are only reliable if Markov chains adequately converge and sample from the joint posterior distribution. Although several studies of phylogenetic MCMC convergence exist, these have focused on simulated data sets or select empirical examples. Therefore, much that is considered common knowledge about MCMC in empirical systems derives from a relatively small family of analyses under ideal conditions. To address this, we present an overview of commonly applied phylogenetic MCMC diagnostics and an assessment of patterns of these diagnostics across more than 18,000 empirical analyses. Many analyses appeared to perform well and failures in convergence were most likely to be detected using the average standard deviation of split frequencies, a diagnostic that compares topologies among independent chains. Different diagnostics yielded different information about failed convergence, demonstrating that multiple diagnostics must be employed to reliably detect problems. The number of taxa and average branch lengths in analyses have clear impacts on MCMC performance, with more taxa and shorter branches leading to more difficult convergence. We show that the usage of models that include both Γ-distributed among-site rate variation and a proportion of invariable sites is not broadly problematic for MCMC convergence but is also unnecessary. Changes to heating and the usage of model-averaged substitution models can both offer improved convergence in some cases, but neither are a panacea.

摘要

几乎所有当前的贝叶斯系统发育应用都依赖于马尔可夫链蒙特卡罗 (MCMC) 方法来近似树和模型其他参数的后验分布。只有当马尔可夫链充分收敛并从联合后验分布中采样时,这些近似才是可靠的。尽管已经有一些关于系统发育 MCMC 收敛性的研究,但这些研究都集中在模拟数据集或选择的经验示例上。因此,在经验系统中被认为是 MCMC 常识的许多内容都源于在理想条件下进行的相对较小的分析家族。为了解决这个问题,我们概述了常用的系统发育 MCMC 诊断方法,并评估了这些诊断方法在超过 18000 个经验分析中的模式。许多分析似乎表现良好,并且最有可能通过比较独立链之间的拓扑结构的平均分歧频率标准偏差来检测到收敛失败的情况,这是一种诊断方法。不同的诊断方法提供了关于收敛失败的不同信息,这表明必须采用多种诊断方法才能可靠地检测到问题。分析中的分类单元数量和平均分支长度对 MCMC 性能有明显影响,分类单元数量越多,分支越短,收敛就越困难。我们表明,包含Γ分布的站点间速率变化和不变位点比例的模型的使用对于 MCMC 收敛性并不是广泛存在问题的,但也是不必要的。加热的改变和使用模型平均替代模型在某些情况下都可以提供更好的收敛性,但都不是万能的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/c1eeed6300f3/msaa295f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/41967c5517fa/msaa295f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/da77fddabbe0/msaa295f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/03bf54004df3/msaa295f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/c3d73d198541/msaa295f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/7dd8d70a4782/msaa295f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/c1eeed6300f3/msaa295f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/41967c5517fa/msaa295f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/da77fddabbe0/msaa295f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/03bf54004df3/msaa295f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/c3d73d198541/msaa295f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/7dd8d70a4782/msaa295f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc59/8042746/c1eeed6300f3/msaa295f6.jpg

相似文献

1
Properties of Markov Chain Monte Carlo Performance across Many Empirical Alignments.许多实证比对中马尔可夫链蒙特卡罗性能的特性。
Mol Biol Evol. 2021 Apr 13;38(4):1627-1640. doi: 10.1093/molbev/msaa295.
2
Searching for convergence in phylogenetic Markov chain Monte Carlo.在系统发育马尔可夫链蒙特卡罗方法中寻找收敛性。
Syst Biol. 2006 Aug;55(4):553-65. doi: 10.1080/10635150600812544.
3
AWTY (are we there yet?): a system for graphical exploration of MCMC convergence in Bayesian phylogenetics.AWTY(我们到了吗?):一种用于贝叶斯系统发育学中马尔可夫链蒙特卡罗收敛性图形化探索的系统。
Bioinformatics. 2008 Feb 15;24(4):581-3. doi: 10.1093/bioinformatics/btm388. Epub 2007 Aug 30.
4
The Behavior of Metropolis-Coupled Markov Chains When Sampling Rugged Phylogenetic Distributions.当采样粗糙的系统发生分布时,Metropolis-coupled Markov 链的行为。
Syst Biol. 2018 Jul 1;67(4):729-734. doi: 10.1093/sysbio/syy008.
5
Phylogenetic MCMC algorithms are misleading on mixtures of trees.系统发育马尔可夫链蒙特卡罗算法在树的混合模型上具有误导性。
Science. 2005 Sep 30;309(5744):2207-9. doi: 10.1126/science.1115493.
6
An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees.一种用于在系统发育树的贝叶斯马尔可夫链蒙特卡罗采样中更新分支的高效独立采样器。
Syst Biol. 2016 Jan;65(1):161-76. doi: 10.1093/sysbio/syv051. Epub 2015 Jul 30.
7
Quantifying MCMC exploration of phylogenetic tree space.量化马尔可夫链蒙特卡罗方法对系统发育树空间的探索。
Syst Biol. 2015 May;64(3):472-91. doi: 10.1093/sysbio/syv006. Epub 2015 Jan 27.
8
Geometric ergodicity of a hybrid sampler for Bayesian inference of phylogenetic branch lengths.用于系统发育分支长度贝叶斯推断的混合采样器的几何遍历性
Math Biosci. 2015 Oct;268:9-21. doi: 10.1016/j.mbs.2015.07.002. Epub 2015 Aug 7.
9
Assessing the convergence of Markov Chain Monte Carlo methods: an example from evaluation of diagnostic tests in absence of a gold standard.评估马尔可夫链蒙特卡罗方法的收敛性:来自无金标准情况下诊断试验评估的一个例子。
Prev Vet Med. 2007 May 16;79(2-4):244-56. doi: 10.1016/j.prevetmed.2007.01.003. Epub 2007 Feb 9.
10
Comment on "Phylogenetic MCMC algorithms are misleading on mixtures of trees".对《系统发育MCMC算法在树的混合问题上具有误导性》的评论
Science. 2006 Apr 21;312(5772):367; author reply 367. doi: 10.1126/science.1123622.

引用本文的文献

1
Treatment for peripheral nerve injury: a protocol for a systematic review and Bayesian network meta-analysis.周围神经损伤的治疗:系统评价和贝叶斯网络Meta分析方案
BMJ Open. 2024 Dec 27;14(12):e090497. doi: 10.1136/bmjopen-2024-090497.
2
Lung cancer incidence and mortality in trend and prediction between 2012-2030 in Shandong Province, using a Bayesian age-period-cohort model.利用贝叶斯年龄-时期-队列模型对山东省2012年至2030年肺癌发病率和死亡率的趋势及预测
Front Oncol. 2024 Dec 4;14:1451589. doi: 10.3389/fonc.2024.1451589. eCollection 2024.
3
Phylogenetic Tree Instability After Taxon Addition: Empirical Frequency, Predictability, and Consequences For Online Inference.

本文引用的文献

1
Adaptive Tree Proposals for Bayesian Phylogenetic Inference.自适应树提议用于贝叶斯系统发育推断。
Syst Biol. 2021 Aug 11;70(5):1015-1032. doi: 10.1093/sysbio/syab004.
2
Using Parsimony-Guided Tree Proposals to Accelerate Convergence in Bayesian Phylogenetic Inference.基于简约引导的树提议加速贝叶斯系统发育推断的收敛。
Syst Biol. 2020 Sep 1;69(5):1016-1032. doi: 10.1093/sysbio/syaa002.
3
Interrogating Genomic-Scale Data for Squamata (Lizards, Snakes, and Amphisbaenians) Shows no Support for Key Traditional Morphological Relationships.
分类群添加后的系统发育树不稳定性:在线推断的经验频率、可预测性及后果
Syst Biol. 2025 Feb 10;74(1):101-111. doi: 10.1093/sysbio/syae059.
4
Data integration in Bayesian phylogenetics.贝叶斯系统发育学中的数据整合。
Annu Rev Stat Appl. 2023;10:353-377. doi: 10.1146/annurev-statistics-033021-112532. Epub 2022 Sep 28.
5
Global burden and prediction study of cutaneous squamous cell carcinoma from 1990 to 2030: A systematic analysis and comparison with China.全球 1990 年至 2030 年皮肤鳞状细胞癌的负担和预测研究:与中国的系统分析和比较。
J Glob Health. 2024 May 3;14:04093. doi: 10.7189/jogh.14.04093.
6
How Trustworthy Is Your Tree? Bayesian Phylogenetic Effective Sample Size Through the Lens of Monte Carlo Error.你的树有多可靠?从蒙特卡罗误差角度看贝叶斯系统发育有效样本量。
Bayesian Anal. 2024 Jun;19(2):565-593. doi: 10.1214/22-ba1339. Epub 2024 Apr 9.
7
The Impact of Phenological Gaps on Leaf Characteristics and Foliage Dynamics of an Understory Dwarf Bamboo, .物候间隔对一种林下矮竹的叶片特征和叶动态的影响
Plants (Basel). 2024 Mar 4;13(5):719. doi: 10.3390/plants13050719.
8
Rediscovering the unusual, solitary bryozoan Monobryozoon ambulans Remane, 1936: first molecular and new morphological data clarify its phylogenetic position.重新发现不寻常的单生苔藓虫类游走单苔藓虫Remane,1936年:首个分子数据和新形态学数据阐明其系统发育位置。
Front Zool. 2024 Mar 5;21(1):5. doi: 10.1186/s12983-024-00527-1.
9
Plasticity in the Morphology of Growing Bamboo: A Bayesian Analysis of Exogenous Treatment Effects on Plant Height, Internode Length, and Internode Numbers.生长竹子形态的可塑性:外源处理对株高、节间长度和节数影响的贝叶斯分析
Plants (Basel). 2023 Apr 20;12(8):1713. doi: 10.3390/plants12081713.
10
Fidelity of hyperbolic space for Bayesian phylogenetic inference.双曲空间对贝叶斯系统发育推断的保真度。
PLoS Comput Biol. 2023 Apr 26;19(4):e1011084. doi: 10.1371/journal.pcbi.1011084. eCollection 2023 Apr.
对蜥蜴目(蜥蜴、蛇和蚓蜥)的基因组规模数据进行分析,结果不支持关键的传统形态学关系。
Syst Biol. 2020 May 1;69(3):502-520. doi: 10.1093/sysbio/syz062.
4
Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7.贝叶斯系统发生学中使用 Tracer 1.7 进行的后验总结
Syst Biol. 2018 Sep 1;67(5):901-904. doi: 10.1093/sysbio/syy032.
5
Variation Across Mitochondrial Gene Trees Provides Evidence for Systematic Error: How Much Gene Tree Variation Is Biological?线粒体基因树的变异性为系统误差提供了证据:基因树的变异性有多少是生物学上的?
Syst Biol. 2018 Sep 1;67(5):847-860. doi: 10.1093/sysbio/syy013.
6
The Behavior of Metropolis-Coupled Markov Chains When Sampling Rugged Phylogenetic Distributions.当采样粗糙的系统发生分布时,Metropolis-coupled Markov 链的行为。
Syst Biol. 2018 Jul 1;67(4):729-734. doi: 10.1093/sysbio/syy008.
7
RWTY (R We There Yet): An R Package for Examining Convergence of Bayesian Phylogenetic Analyses.RWTY(我们到了吗):一个用于检查贝叶斯系统发育分析收敛性的 R 包。
Mol Biol Evol. 2017 Apr 1;34(4):1016-1020. doi: 10.1093/molbev/msw279.
8
Bayes Factors Unmask Highly Variable Information Content, Bias, and Extreme Influence in Phylogenomic Analyses.贝叶斯因子揭示了系统发育基因组分析中高度可变的信息内容、偏差和极端影响。
Syst Biol. 2017 Jul 1;66(4):517-530. doi: 10.1093/sysbio/syw101.
9
Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses.从贝叶斯系统发育分析估计树形拓扑结构的有效样本量
Genome Biol Evol. 2016 Aug 16;8(8):2319-32. doi: 10.1093/gbe/evw171.
10
Estimating Bayesian Phylogenetic Information Content.估计贝叶斯系统发育信息含量。
Syst Biol. 2016 Nov;65(6):1009-1023. doi: 10.1093/sysbio/syw042. Epub 2016 May 6.