• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统发育树重建准确性和模型拟合度随树中变位点比例的变化而变化。

Phylogenetic tree reconstruction accuracy and model fit when proportions of variable sites change across the tree.

机构信息

Institut für Botanik III, Heinrich-Heine Universität, Universitätstrasse 1, Düsseldorf, Germany.

出版信息

Syst Biol. 2010 May;59(3):288-97. doi: 10.1093/sysbio/syq003. Epub 2010 Mar 1.

DOI:10.1093/sysbio/syq003
PMID:20525636
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2850392/
Abstract

Commonly used phylogenetic models assume a homogeneous process through time in all parts of the tree. However, it is known that these models can be too simplistic as they do not account for nonhomogeneous lineage-specific properties. In particular, it is now widely recognized that as constraints on sequences evolve, the proportion and positions of variable sites can vary between lineages causing heterotachy. The extent to which this model misspecification affects tree reconstruction is still unknown. Here, we evaluate the effect of changes in the proportions and positions of variable sites on model fit and tree estimation. We consider 5 current models of nucleotide sequence evolution in a Bayesian Markov chain Monte Carlo framework as well as maximum parsimony (MP). We show that for a tree with 4 lineages where 2 nonsister taxa undergo a change in the proportion of variable sites tree reconstruction under the best-fitting model, which is chosen using a relative test, often results in the wrong tree. In this case, we found that an absolute test of model fit is a better predictor of tree estimation accuracy. We also found further evidence that MP is not immune to heterotachy. In addition, we show that increased sampling of taxa that have undergone a change in proportion and positions of variable sites is critical for accurate tree reconstruction.

摘要

常用的系统发育模型假设树的所有部分在整个时间内都具有均匀的进化过程。然而,已知这些模型可能过于简单化,因为它们没有考虑到非均匀的谱系特异性属性。特别是,现在已经广泛认识到,随着序列约束的进化,变异位点的比例和位置在谱系之间可能会发生变化,导致异速进化。这种模型指定不当对树重建的影响程度尚不清楚。在这里,我们评估了变异位点比例和位置变化对模型拟合和树估计的影响。我们在贝叶斯马尔可夫链蒙特卡罗框架中考虑了 5 种当前的核苷酸序列进化模型,以及最大简约法 (MP)。我们表明,对于一个有 4 个谱系的树,其中 2 个非姐妹分类群的变异位点比例发生变化,使用相对测试选择最佳拟合模型进行树重建,通常会导致错误的树。在这种情况下,我们发现模型拟合的绝对测试是树估计准确性的更好预测指标。我们还进一步发现证据表明,MP 不能免受异速进化的影响。此外,我们表明,增加对比例和位置发生变化的分类群的采样对于准确的树重建至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/d148f8fd4242/sysbiosyq003f09_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/a46683281bc8/sysbiosyq003f01_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/c9843dd32059/sysbiosyq003f02_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/e085a99fdb4c/sysbiosyq003f03_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/cae92e283923/sysbiosyq003f04_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/c84b26a27053/sysbiosyq003f05_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/0cb604bc77f6/sysbiosyq003f06_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/d4d5da1f1b79/sysbiosyq003f07_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/988fbf2b3545/sysbiosyq003f08_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/d148f8fd4242/sysbiosyq003f09_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/a46683281bc8/sysbiosyq003f01_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/c9843dd32059/sysbiosyq003f02_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/e085a99fdb4c/sysbiosyq003f03_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/cae92e283923/sysbiosyq003f04_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/c84b26a27053/sysbiosyq003f05_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/0cb604bc77f6/sysbiosyq003f06_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/d4d5da1f1b79/sysbiosyq003f07_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/988fbf2b3545/sysbiosyq003f08_3c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3eec/2850392/d148f8fd4242/sysbiosyq003f09_3c.jpg

相似文献

1
Phylogenetic tree reconstruction accuracy and model fit when proportions of variable sites change across the tree.系统发育树重建准确性和模型拟合度随树中变位点比例的变化而变化。
Syst Biol. 2010 May;59(3):288-97. doi: 10.1093/sysbio/syq003. Epub 2010 Mar 1.
2
Modelling heterotachy in phylogenetic inference by reversible-jump Markov chain Monte Carlo.通过可逆跳跃马尔可夫链蒙特卡罗方法在系统发育推断中对异速进行建模。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3955-64. doi: 10.1098/rstb.2008.0178.
3
Modeling compositional heterogeneity.对成分异质性进行建模。
Syst Biol. 2004 Jun;53(3):485-95. doi: 10.1080/10635150490445779.
4
LineageSpecificSeqgen: generating sequence data with lineage-specific variation in the proportion of variable sites.谱系特异性序列生成器:生成可变位点比例具有谱系特异性变异的序列数据。
BMC Evol Biol. 2008 Nov 21;8:317. doi: 10.1186/1471-2148-8-317.
5
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
6
Modelling the ancestral sequence distribution and model frequencies in context-dependent models for primate non-coding sequences.对灵长类非编码序列的上下文相关模型中的祖先序列分布和模型频率进行建模。
BMC Evol Biol. 2010 Aug 10;10:244. doi: 10.1186/1471-2148-10-244.
7
Heterotachy and tree building: a case study with plastids and eubacteria.异速进化与系统发育树构建:以质体和真细菌为例的研究
Mol Biol Evol. 2006 Jan;23(1):40-5. doi: 10.1093/molbev/msj005. Epub 2005 Sep 8.
8
Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST.马科夫调制连续时间马科夫链在 BEAST 中识别位点和分支特异性进化变化。
Syst Biol. 2021 Jan 1;70(1):181-189. doi: 10.1093/sysbio/syaa037.
9
Bayes or bootstrap? A simulation study comparing the performance of Bayesian Markov chain Monte Carlo sampling and bootstrapping in assessing phylogenetic confidence.贝叶斯法还是自助法?一项比较贝叶斯马尔可夫链蒙特卡罗抽样和自助法在评估系统发育置信度时性能的模拟研究。
Mol Biol Evol. 2003 Feb;20(2):255-66. doi: 10.1093/molbev/msg028.
10
Guided tree topology proposals for Bayesian phylogenetic inference.贝叶斯系统发育推断的引导树拓扑提议。
Syst Biol. 2012 Jan;61(1):1-11. doi: 10.1093/sysbio/syr074. Epub 2011 Aug 9.

引用本文的文献

1
Multi-sample long-read nanopore sequencing of Agabus bipustulatus (Coleoptera: Dytiscidae: Agabinae) mitogenome produces effectively reliable data for downstream analyses.对二斑沼梭(鞘翅目:龙虱科:沼梭亚科)线粒体基因组进行多样本长读长纳米孔测序,可为下游分析有效产出可靠数据。
J Insect Sci. 2025 Jan 20;25(1). doi: 10.1093/jisesa/ieaf009.
2
Comparing Partitioned Models to Mixture Models: Do Information Criteria Apply?比较分区模型与混合模型:信息准则是否适用?
Syst Biol. 2022 Oct 12;71(6):1541-1548. doi: 10.1093/sysbio/syac003.
3
An Analysis of Methylome Evolution in Primates.

本文引用的文献

1
LineageSpecificSeqgen: generating sequence data with lineage-specific variation in the proportion of variable sites.谱系特异性序列生成器:生成可变位点比例具有谱系特异性变异的序列数据。
BMC Evol Biol. 2008 Nov 21;8:317. doi: 10.1186/1471-2148-8-317.
2
Modelling heterotachy in phylogenetic inference by reversible-jump Markov chain Monte Carlo.通过可逆跳跃马尔可夫链蒙特卡罗方法在系统发育推断中对异速进行建模。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3955-64. doi: 10.1098/rstb.2008.0178.
3
Difficulties in testing for covarion-like properties of sequences under the confounding influence of changing proportions of variable sites.
灵长类动物甲基组进化分析。
Mol Biol Evol. 2021 Oct 27;38(11):4700-4714. doi: 10.1093/molbev/msab189.
4
Genome-wide mapping using new AFLP markers to explore intraspecific variation among pathogenic Sporothrix species.利用新的 AFLP 标记进行全基因组图谱绘制,以探索致病性申克孢子丝菌种内变异。
PLoS Negl Trop Dis. 2020 Jul 1;14(7):e0008330. doi: 10.1371/journal.pntd.0008330. eCollection 2020 Jul.
5
A Novel Test for Absolute Fit of Evolutionary Models Provides a Means to Correctly Identify the Substitution Model and the Model Tree.一种新的进化模型绝对拟合检验方法提供了一种正确识别替代模型和模型树的手段。
Genome Biol Evol. 2019 Aug 1;11(8):2403-2419. doi: 10.1093/gbe/evz167.
6
Evaluating the Evolutionary Origins of Unexpected Character Distributions within the Bacterial Planctomycetes-Verrucomicrobia-Chlamydiae Superphylum.评估细菌浮霉菌门-疣微菌门-衣原体超门中意想不到的特征分布的进化起源。
Front Microbiol. 2012 Nov 23;3:401. doi: 10.3389/fmicb.2012.00401. eCollection 2012.
7
ImOSM: intermittent evolution and robustness of phylogenetic methods.ImOSM:系统发育方法的间歇性进化和稳健性。
Mol Biol Evol. 2012 Feb;29(2):663-73. doi: 10.1093/molbev/msr220. Epub 2011 Sep 22.
在可变位点比例变化的混杂影响下,对序列的类共变性质进行检测时所面临的困难。
Mol Biol Evol. 2008 Jul;25(7):1512-20. doi: 10.1093/molbev/msn098. Epub 2008 Apr 18.
4
A mixed branch length model of heterotachy improves phylogenetic accuracy.一种异速进化的混合分支长度模型提高了系统发育准确性。
Mol Biol Evol. 2008 Jun;25(6):1054-66. doi: 10.1093/molbev/msn042. Epub 2008 Mar 3.
5
Taxon sampling affects inferences of macroevolutionary processes from phylogenetic trees.分类群抽样会影响从系统发育树推断宏观进化过程。
Syst Biol. 2008 Feb;57(1):160-6. doi: 10.1080/10635150701884640.
6
The problem of rooting rapid radiations.快速辐射的根源问题。
Mol Biol Evol. 2007 Nov;24(11):2400-11. doi: 10.1093/molbev/msm178. Epub 2007 Aug 23.
7
Artifactual phylogenies caused by correlated distribution of substitution rates among sites and lineages: the good, the bad, and the ugly.由位点和谱系间替换率的相关分布导致的人为系统发育树:好的、坏的和丑陋的。
Syst Biol. 2007 Feb;56(1):68-82. doi: 10.1080/10635150601175578.
8
Accounting for variation of substitution rates through time in Bayesian phylogeny reconstruction of Sapotoideae (Sapotaceae).在山榄科(Sapotaceae)山榄亚科(Sapotoideae)的贝叶斯系统发育重建中考虑替换率随时间的变化。
Mol Phylogenet Evol. 2006 Jun;39(3):706-21. doi: 10.1016/j.ympev.2006.01.018. Epub 2006 Feb 21.
9
Affiliation of Cochlosoma to trichomonads confirmed by phylogenetic analysis of the small-subunit rRNA gene and a new family concept of the order Trichomonadida.
Int J Syst Evol Microbiol. 2006 Jan;56(Pt 1):305-12. doi: 10.1099/ijs.0.63754-0.
10
A tale of two processes.两个过程的故事。
Syst Biol. 2005 Dec;54(6):948-51. doi: 10.1080/10635150500234682.