• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

由位点和谱系间替换率的相关分布导致的人为系统发育树:好的、坏的和丑陋的。

Artifactual phylogenies caused by correlated distribution of substitution rates among sites and lineages: the good, the bad, and the ugly.

作者信息

Ruano-Rubio Valentin, Fares Mario A

机构信息

Molecular Evolution and Bioinformatics Laboratory, Department of Biology, National University of Ireland, Maynooth, Ireland.

出版信息

Syst Biol. 2007 Feb;56(1):68-82. doi: 10.1080/10635150601175578.

DOI:10.1080/10635150601175578
PMID:17366138
Abstract

Despite the advances in understanding molecular evolution, current phylogenetic methods barely take account of a fraction of the complexity of evolution. We are chiefly constrained by our incomplete knowledge of molecular evolutionary processes and the limits of computational power. These limitations lead to the establishment of either biologically simplistic models that rarely account for a fraction of the complexity involved or overfitting models that add little resolution to the problem. Such oversimplified models may lead us to assign high confidence to an incorrect tree (inconsistency). Rate-across-site (RAS) models are commonly used evolutionary models in phylogenetic studies. These account for heterogeneity in the evolutionary rates among sites but do not account for changing within-site rates across lineages (heterotachy). If heterotachy is common, using RAS models may lead to systematic errors in tree inference. In this work we show possible misleading effects in tree inference when the assumption of constant within-site rates across lineages is violated using maximum likelihood. Using a simulation study, we explore the ways in which gamma stationary models can lead to wrong topology or to deceptive bootstrap support values when the within-site rates change across lineages. More precisely, we show that different degrees of heterotachy mislead phylogenetic inference when the model assumed is stationary. Finally, we propose a geometry-based approach to visualize and to test for the possible existence of bias due to heterotachy.

摘要

尽管在理解分子进化方面取得了进展,但当前的系统发育方法几乎没有考虑到进化复杂性的一小部分。我们主要受到对分子进化过程的不完全了解以及计算能力限制的约束。这些限制导致建立要么是生物学上过于简单的模型,很少考虑到所涉及复杂性的一小部分,要么是过度拟合的模型,对问题几乎没有增加分辨率。这种过于简化的模型可能会导致我们对错误的树赋予高置信度(不一致性)。位点间速率(RAS)模型是系统发育研究中常用的进化模型。这些模型考虑了位点间进化速率的异质性,但没有考虑跨谱系的位点内速率变化(异速进化)。如果异速进化很常见,使用RAS模型可能会导致树推断中的系统误差。在这项工作中,我们展示了在违反跨谱系位点内速率恒定的假设时,使用最大似然法进行树推断可能产生的误导性影响。通过模拟研究,我们探讨了在位点内速率跨谱系变化时,伽马平稳模型可能导致错误拓扑或欺骗性自展支持值的方式。更确切地说,我们表明当假设的模型是平稳的时,不同程度的异速进化会误导系统发育推断。最后,我们提出一种基于几何的方法来可视化并测试由于异速进化可能存在的偏差。

相似文献

1
Artifactual phylogenies caused by correlated distribution of substitution rates among sites and lineages: the good, the bad, and the ugly.由位点和谱系间替换率的相关分布导致的人为系统发育树:好的、坏的和丑陋的。
Syst Biol. 2007 Feb;56(1):68-82. doi: 10.1080/10635150601175578.
2
Maximum likelihood outperforms maximum parsimony even when evolutionary rates are heterotachous.即使进化速率是异速的,最大似然法也优于最大简约法。
Mol Biol Evol. 2005 Nov;22(11):2139-41. doi: 10.1093/molbev/msi212. Epub 2005 Jul 13.
3
On inconsistency of the neighbor-joining, least squares, and minimum evolution estimation when substitution processes are incorrectly modeled.当替代过程建模错误时邻接法、最小二乘法和最小进化估计的不一致性
Mol Biol Evol. 2004 Sep;21(9):1629-42. doi: 10.1093/molbev/msh159. Epub 2004 May 21.
4
General heterotachy and distance method adjustments.一般异速和距离法调整。
Mol Biol Evol. 2009 Dec;26(12):2689-97. doi: 10.1093/molbev/msp184. Epub 2009 Aug 17.
5
Likelihood, parsimony, and heterogeneous evolution.似然性、简约性与异质性进化
Mol Biol Evol. 2005 May;22(5):1161-4. doi: 10.1093/molbev/msi123. Epub 2005 Mar 2.
6
Heterotachy and tree building: a case study with plastids and eubacteria.异速进化与系统发育树构建:以质体和真细菌为例的研究
Mol Biol Evol. 2006 Jan;23(1):40-5. doi: 10.1093/molbev/msj005. Epub 2005 Sep 8.
7
Pitfalls of heterogeneous processes for phylogenetic reconstruction.系统发育重建异质过程的陷阱。
Syst Biol. 2007 Feb;56(1):113-24. doi: 10.1080/10635150701245388.
8
The effect of heterotachy in multigene analysis using the neighbor joining method.使用邻接法进行多基因分析时异速现象的影响。
Mol Phylogenet Evol. 2009 Sep;52(3):846-51. doi: 10.1016/j.ympev.2009.05.025. Epub 2009 May 29.
9
An empirical assessment of long-branch attraction artefacts in deep eukaryotic phylogenomics.对深层真核生物系统发育基因组学中长枝吸引假象的实证评估。
Syst Biol. 2005 Oct;54(5):743-57. doi: 10.1080/10635150500234609.
10
A test for heterotachy using multiple pairs of sequences.使用多对序列进行异序测试。
Mol Biol Evol. 2011 May;28(5):1661-73. doi: 10.1093/molbev/msq346. Epub 2010 Dec 24.

引用本文的文献

1
Cladogenesis and Genomic Streamlining in Extracellular Endosymbionts of Tropical Stink Bugs.细胞外共生体热带臭虫的分支进化和基因组简化。
Genome Biol Evol. 2018 Feb 1;10(2):680-693. doi: 10.1093/gbe/evy033.
2
Hsp27 gene in Drosophila ananassae subgroup was split by a recently acquired intron.果蝇类中的Hsp27基因被一个最近获得的内含子分割。
J Genet. 2016 Jun;95(2):257-62. doi: 10.1007/s12041-016-0629-y.
3
Multiple origins of endosymbiosis within the Enterobacteriaceae (γ-Proteobacteria): convergence of complex phylogenetic approaches.
肠杆菌科(γ-变形菌)内共生的多种起源:复杂系统发育方法的趋同。
BMC Biol. 2011 Dec 28;9:87. doi: 10.1186/1741-7007-9-87.
4
Using shifts in amino acid frequency and substitution rate to identify latent structural characters in base-excision repair enzymes.利用氨基酸频率和替换率的变化来鉴定碱基切除修复酶中的潜在结构特征。
PLoS One. 2011;6(10):e25246. doi: 10.1371/journal.pone.0025246. Epub 2011 Oct 6.
5
Phylogenetic tree reconstruction accuracy and model fit when proportions of variable sites change across the tree.系统发育树重建准确性和模型拟合度随树中变位点比例的变化而变化。
Syst Biol. 2010 May;59(3):288-97. doi: 10.1093/sysbio/syq003. Epub 2010 Mar 1.
6
Suprafamilial relationships among Rodentia and the phylogenetic effect of removing fast-evolving nucleotides in mitochondrial, exon and intron fragments.啮齿目动物之间的超科级关系以及去除线粒体、外显子和内含子片段中快速进化核苷酸的系统发育效应。
BMC Evol Biol. 2008 Nov 26;8:321. doi: 10.1186/1471-2148-8-321.
7
LineageSpecificSeqgen: generating sequence data with lineage-specific variation in the proportion of variable sites.谱系特异性序列生成器:生成可变位点比例具有谱系特异性变异的序列数据。
BMC Evol Biol. 2008 Nov 21;8:317. doi: 10.1186/1471-2148-8-317.
8
A mixed branch length model of heterotachy improves phylogenetic accuracy.一种异速进化的混合分支长度模型提高了系统发育准确性。
Mol Biol Evol. 2008 Jun;25(6):1054-66. doi: 10.1093/molbev/msn042. Epub 2008 Mar 3.
9
Topological estimation biases with covarion evolution.具有协变进化的拓扑估计偏差。
J Mol Evol. 2008 Jan;66(1):50-60. doi: 10.1007/s00239-007-9062-4. Epub 2007 Dec 14.