• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ImOSM:系统发育方法的间歇性进化和稳健性。

ImOSM: intermittent evolution and robustness of phylogenetic methods.

机构信息

Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, University of Veterinary Medicine Vienna, Vienna, Austria.

出版信息

Mol Biol Evol. 2012 Feb;29(2):663-73. doi: 10.1093/molbev/msr220. Epub 2011 Sep 22.

DOI:10.1093/molbev/msr220
PMID:21940641
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3258038/
Abstract

Among the criteria to evaluate the performance of a phylogenetic method, robustness to model violation is of particular practical importance as complete a priori knowledge of evolutionary processes is typically unavailable. For studies of robustness in phylogenetic inference, a utility to add well-defined model violations to the simulated data would be helpful. We therefore introduce ImOSM, a tool to imbed intermittent evolution as model violation into an alignment. Intermittent evolution refers to extra substitutions occurring randomly on branches of a tree, thus changing alignment site patterns. This means that the extra substitutions are placed on the tree after the typical process of sequence evolution is completed. We then study the robustness of widely used phylogenetic methods: maximum likelihood (ML), maximum parsimony (MP), and a distance-based method (BIONJ) to various scenarios of model violation. Violation of rates across sites (RaS) heterogeneity and simultaneous violation of RaS and the transition/transversion ratio on two nonadjacent external branches hinder all the methods recovery of the true topology for a four-taxon tree. For an eight-taxon balanced tree, the violations cause each of the three methods to infer a different topology. Both ML and MP fail, whereas BIONJ, which calculates the distances based on the ML estimated parameters, reconstructs the true tree. Finally, we report that a test of model homogeneity and goodness of fit tests have enough power to detect such model violations. The outcome of the tests can help to actually gain confidence in the inferred trees. Therefore, we recommend using these tests in practical phylogenetic analyses.

摘要

在评估系统发育方法性能的标准中,对模型违反的稳健性具有特别实际的重要性,因为通常无法完全了解进化过程的先验知识。对于系统发育推断中稳健性的研究,一种将明确定义的模型违反添加到模拟数据中的实用程序将是有帮助的。因此,我们引入了 ImOSM,这是一种将间歇性进化作为模型违反嵌入到比对中的工具。间歇性进化是指在树的分支上随机发生的额外替换,从而改变比对位点模式。这意味着额外的替换是在典型的序列进化过程完成后放置在树上的。然后,我们研究了广泛使用的系统发育方法的稳健性:最大似然(ML)、最大简约(MP)和基于距离的方法(BIONJ),以应对各种模型违反情况。跨位点速率(RaS)异质性的违反以及同时违反 RaS 和两个非相邻外部分支的转换/颠换比,会阻碍所有方法恢复具有四个分类单元的树的真实拓扑结构。对于具有八个分类单元的平衡树,违反情况会导致这三种方法中的每一种都推断出不同的拓扑结构。ML 和 MP 都失败了,而 BIONJ 则根据 ML 估计的参数计算距离,从而重建了真实的树。最后,我们报告说,模型同质性检验和拟合优度检验具有足够的能力来检测这种模型违反。这些测试的结果可以帮助实际对推断出的树有信心。因此,我们建议在实际的系统发育分析中使用这些测试。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/a852f6028472/molbiolevolmsr220f06_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/bf48a5cae31f/molbiolevolmsr220f01_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/0e05ccf6bdfa/molbiolevolmsr220f02_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/97efe02d6789/molbiolevolmsr220f03_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/06960e7733dc/molbiolevolmsr220f04_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/59f6b5ec136d/molbiolevolmsr220f05_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/a852f6028472/molbiolevolmsr220f06_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/bf48a5cae31f/molbiolevolmsr220f01_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/0e05ccf6bdfa/molbiolevolmsr220f02_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/97efe02d6789/molbiolevolmsr220f03_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/06960e7733dc/molbiolevolmsr220f04_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/59f6b5ec136d/molbiolevolmsr220f05_ht.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb11/3258038/a852f6028472/molbiolevolmsr220f06_ht.jpg

相似文献

1
ImOSM: intermittent evolution and robustness of phylogenetic methods.ImOSM:系统发育方法的间歇性进化和稳健性。
Mol Biol Evol. 2012 Feb;29(2):663-73. doi: 10.1093/molbev/msr220. Epub 2011 Sep 22.
2
Bayesian and maximum likelihood phylogenetic analyses of protein sequence data under relative branch-length differences and model violation.基于相对分支长度差异和模型违背情况下蛋白质序列数据的贝叶斯和最大似然系统发育分析。
BMC Evol Biol. 2005 Jan 28;5:8. doi: 10.1186/1471-2148-5-8.
3
Relative efficiencies of the maximum-likelihood, neighbor-joining, and maximum-parsimony methods when substitution rate varies with site.当替换率随位点变化时,最大似然法、邻接法和最大简约法的相对效率。
Mol Biol Evol. 1994 Mar;11(2):261-77. doi: 10.1093/oxfordjournals.molbev.a040108.
4
Phylogenetic analysis using parsimony and likelihood methods.使用简约法和似然法进行系统发育分析。
J Mol Evol. 1996 Feb;42(2):294-307. doi: 10.1007/BF02198856.
5
Relative efficiencies of the maximum parsimony and distance-matrix methods in obtaining the correct phylogenetic tree.最大简约法和距离矩阵法在获得正确系统发育树方面的相对效率。
Mol Biol Evol. 1988 May;5(3):298-311. doi: 10.1093/oxfordjournals.molbev.a040497.
6
Improvement of distance-based phylogenetic methods by a local maximum likelihood approach using triplets.使用三元组的局部最大似然方法对基于距离的系统发育方法的改进。
Mol Biol Evol. 2002 Nov;19(11):1952-63. doi: 10.1093/oxfordjournals.molbev.a004019.
7
Evaluating the robustness of phylogenetic methods to among-site variability in substitution processes.评估系统发育方法对替换过程中位点间变异性的稳健性。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):4013-21. doi: 10.1098/rstb.2008.0162.
8
Efficiencies of fast algorithms of phylogenetic inference under the criteria of maximum parsimony, minimum evolution, and maximum likelihood when a large number of sequences are used.在使用大量序列时,基于最大简约法、最小进化法和最大似然法标准的系统发育推断快速算法的效率。
Mol Biol Evol. 2000 Aug;17(8):1251-8. doi: 10.1093/oxfordjournals.molbev.a026408.
9
Relative efficiencies of the maximum likelihood, maximum parsimony, and neighbor-joining methods for estimating protein phylogeny.用于估计蛋白质系统发育的最大似然法、最大简约法和邻接法的相对效率。
Mol Phylogenet Evol. 1993 Mar;2(1):1-5. doi: 10.1006/mpev.1993.1001.
10
Inferring Trees.推断树
Methods Mol Biol. 2017;1525:349-377. doi: 10.1007/978-1-4939-6622-6_14.

引用本文的文献

1
A Novel Test for Absolute Fit of Evolutionary Models Provides a Means to Correctly Identify the Substitution Model and the Model Tree.一种新的进化模型绝对拟合检验方法提供了一种正确识别替代模型和模型树的手段。
Genome Biol Evol. 2019 Aug 1;11(8):2403-2419. doi: 10.1093/gbe/evz167.
2
SpartaABC: a web server to simulate sequences with indel parameters inferred using an approximate Bayesian computation algorithm.SpartaABC:一个 Web 服务器,用于模拟使用近似贝叶斯计算算法推断出的插入缺失参数的序列。
Nucleic Acids Res. 2017 Jul 3;45(W1):W453-W457. doi: 10.1093/nar/gkx322.
3
Inferring Indel Parameters using a Simulation-based Approach.

本文引用的文献

1
MISFITS: evaluating the goodness of fit between a phylogenetic model and an alignment.MISFITS:评估系统发育模型与比对之间的适配度。
Mol Biol Evol. 2011 Jan;28(1):143-52. doi: 10.1093/molbev/msq180. Epub 2010 Jul 19.
2
Phylogenetic tree reconstruction accuracy and model fit when proportions of variable sites change across the tree.系统发育树重建准确性和模型拟合度随树中变位点比例的变化而变化。
Syst Biol. 2010 May;59(3):288-97. doi: 10.1093/sysbio/syq003. Epub 2010 Mar 1.
3
Long-branch attraction bias and inconsistency in Bayesian phylogenetics.
使用基于模拟的方法推断插入缺失参数。
Genome Biol Evol. 2015 Nov 3;7(12):3226-38. doi: 10.1093/gbe/evv212.
4
Ultrafast approximation for phylogenetic bootstrap.快速近似的系统发育自举法。
Mol Biol Evol. 2013 May;30(5):1188-95. doi: 10.1093/molbev/mst024. Epub 2013 Feb 15.
5
On the group theoretical background of assigning stepwise mutations onto phylogenies.关于在系统发育树上分配逐步突变的群论背景。
Algorithms Mol Biol. 2012 Dec 15;7(1):36. doi: 10.1186/1748-7188-7-36.
贝叶斯系统发育学中的长枝吸引偏差和不一致性。
PLoS One. 2009 Dec 9;4(12):e7891. doi: 10.1371/journal.pone.0007891.
4
General heterotachy and distance method adjustments.一般异速和距离法调整。
Mol Biol Evol. 2009 Dec;26(12):2689-97. doi: 10.1093/molbev/msp184. Epub 2009 Aug 17.
5
INDELible: a flexible simulator of biological sequence evolution.INDELible:一款灵活的生物序列进化模拟器。
Mol Biol Evol. 2009 Aug;26(8):1879-88. doi: 10.1093/molbev/msp098. Epub 2009 May 7.
6
LineageSpecificSeqgen: generating sequence data with lineage-specific variation in the proportion of variable sites.谱系特异性序列生成器:生成可变位点比例具有谱系特异性变异的序列数据。
BMC Evol Biol. 2008 Nov 21;8:317. doi: 10.1186/1471-2148-8-317.
7
Accommodating the effect of ancient DNA damage on inferences of demographic histories.适应古代DNA损伤对人口历史推断的影响。
Mol Biol Evol. 2009 Feb;26(2):245-8. doi: 10.1093/molbev/msn256. Epub 2008 Nov 11.
8
A rapid bootstrap algorithm for the RAxML Web servers.一种用于RAxML网络服务器的快速自引导算法。
Syst Biol. 2008 Oct;57(5):758-71. doi: 10.1080/10635150802429642.
9
The impact of single substitutions on multiple sequence alignments.单个替换对多序列比对的影响。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):4041-7. doi: 10.1098/rstb.2008.0140.
10
Modelling heterotachy in phylogenetic inference by reversible-jump Markov chain Monte Carlo.通过可逆跳跃马尔可夫链蒙特卡罗方法在系统发育推断中对异速进行建模。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3955-64. doi: 10.1098/rstb.2008.0178.