• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

核苷酸序列进化中的时空异质性。

Spatial and temporal heterogeneity in nucleotide sequence evolution.

作者信息

Whelan Simon

机构信息

Faculty of Life Sciences, University of Manchester, Michael Smith Building, Manchester M13 9PT, United Kingdom.

出版信息

Mol Biol Evol. 2008 Aug;25(8):1683-94. doi: 10.1093/molbev/msn119. Epub 2008 May 22.

DOI:10.1093/molbev/msn119
PMID:18502771
Abstract

Models of nucleotide substitution make many simplifying assumptions about the evolutionary process, including that the same process acts on all sites in an alignment and on all branches on the phylogenetic tree. Many studies have shown that in reality the substitution process is heterogeneous and that this variability can introduce systematic errors into many forms of phylogenetic analyses. I propose a new rigorous approach for describing heterogeneity called a temporal hidden Markov model (THMM), which can distinguish between among site (spatial) heterogeneity and among lineage (temporal) heterogeneity. Several versions of the THMM are applied to 16 sets of aligned sequences to quantitatively assess the different forms of heterogeneity acting within them. The most general THMM provides the best fit in all the data sets examined, providing strong evidence of pervasive heterogeneity during evolution. Investigating individual forms of heterogeneity provides further insights. In agreement with previous studies, spatial rate heterogeneity (rates across sites [RAS]) is inferred to be the single most prevalent form of heterogeneity. Interestingly, RAS appears so dominant that failure to independently include it in the THMM masks other forms of heterogeneity, particularly temporal heterogeneity. Incorporating RAS into the THMM reveals substantial temporal and spatial heterogeneity in nucleotide composition and bias toward transition substitution in all alignments examined, although the relative importance of different forms of heterogeneity varies between data sets. Furthermore, the improvements in model fit observed by adding complexity to the model suggest that the THMMs used in this study do not capture all the evolutionary heterogeneity occurring in the data. These observations all indicate that current tests may consistently underestimate the degree of temporal heterogeneity occurring in data. Finally, there is a weak link between the amount of heterogeneity detected and the level of divergence between the sequences, suggesting that variability in the evolutionary process will be a particular problem for deep phylogeny.

摘要

核苷酸替换模型对进化过程做了许多简化假设,包括同一过程作用于比对中的所有位点以及系统发育树的所有分支。许多研究表明,实际上替换过程是异质性的,这种变异性会给多种形式的系统发育分析引入系统误差。我提出一种新的严格方法来描述异质性,称为时间隐马尔可夫模型(THMM),它可以区分位点间(空间)异质性和谱系间(时间)异质性。将几种版本的THMM应用于16组比对序列,以定量评估其中存在的不同形式的异质性。最通用的THMM在所有检验的数据集中拟合效果最佳,有力证明了进化过程中普遍存在异质性。对个体形式的异质性进行研究能提供更多见解。与先前研究一致,空间速率异质性(位点间速率[RAS])被推断为最普遍的异质性形式。有趣的是,RAS显得如此占主导地位,以至于在THMM中未能独立纳入它会掩盖其他形式的异质性,尤其是时间异质性。将RAS纳入THMM后发现,在所检验的所有比对中,核苷酸组成存在大量时间和空间异质性,且偏向于转换替换,尽管不同形式异质性的相对重要性在不同数据集之间有所不同。此外,通过增加模型复杂度观察到的模型拟合改进表明,本研究中使用的THMM并未捕捉到数据中发生的所有进化异质性。这些观察结果都表明,当前的检验可能会持续低估数据中发生的时间异质性程度。最后,检测到的异质性量与序列间的分歧水平之间存在微弱联系,这表明进化过程中的变异性对于深层次系统发育将是一个特别的问题。

相似文献

1
Spatial and temporal heterogeneity in nucleotide sequence evolution.核苷酸序列进化中的时空异质性。
Mol Biol Evol. 2008 Aug;25(8):1683-94. doi: 10.1093/molbev/msn119. Epub 2008 May 22.
2
Dynamically heterogenous partitions and phylogenetic inference: an evaluation of analytical strategies with cytochrome b and ND6 gene sequences in cranes.动态异质分区与系统发育推断:利用鹤类细胞色素b和ND6基因序列对分析策略的评估
Mol Phylogenet Evol. 1999 Nov;13(2):302-13. doi: 10.1006/mpev.1999.0646.
3
The genetic code can cause systematic bias in simple phylogenetic models.遗传密码会在简单的系统发育模型中导致系统偏差。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):4003-11. doi: 10.1098/rstb.2008.0171.
4
Reconstruction of ancestral nucleotide sequences and estimation of substitution frequencies in a star phylogeny.星状系统发育树中祖先核苷酸序列的重建及替换频率的估计。
Gene. 2007 Apr 1;390(1-2):75-83. doi: 10.1016/j.gene.2006.11.022. Epub 2006 Dec 14.
5
Parallel rate heterogeneity in chloroplast and mitochondrial genomes of Brazil nut trees (Lecythidaceae) is consistent with lineage effects.巴西坚果(玉蕊科)叶绿体和线粒体基因组中的平行速率异质性与谱系效应一致。
Mol Biol Evol. 2008 Jul;25(7):1282-96. doi: 10.1093/molbev/msn074. Epub 2008 Apr 2.
6
On the correlation between composition and site-specific evolutionary rate: implications for phylogenetic inference.关于组成与位点特异性进化速率之间的相关性:对系统发育推断的影响。
Mol Biol Evol. 2006 Feb;23(2):352-64. doi: 10.1093/molbev/msj040. Epub 2005 Oct 19.
7
An evolutionary model for protein-coding regions with conserved RNA structure.具有保守RNA结构的蛋白质编码区域的进化模型。
Mol Biol Evol. 2004 Oct;21(10):1913-22. doi: 10.1093/molbev/msh199. Epub 2004 Jun 30.
8
Artifactual phylogenies caused by correlated distribution of substitution rates among sites and lineages: the good, the bad, and the ugly.由位点和谱系间替换率的相关分布导致的人为系统发育树:好的、坏的和丑陋的。
Syst Biol. 2007 Feb;56(1):68-82. doi: 10.1080/10635150601175578.
9
Multidimensional vector space representation for convergent evolution and molecular phylogeny.趋同进化和分子系统发育的多维向量空间表示
Mol Biol Evol. 2005 Mar;22(3):704-15. doi: 10.1093/molbev/msi051. Epub 2004 Nov 17.
10
Accounting for gene rate heterogeneity in phylogenetic inference.在系统发育推断中考虑基因速率异质性。
Syst Biol. 2007 Apr;56(2):194-205. doi: 10.1080/10635150701291804.

引用本文的文献

1
Infinite Mixture Models for Improved Modeling of Across-Site Evolutionary Variation.用于改进跨位点进化变异建模的无限混合模型。
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf199.
2
Phylogenomics provides robust support for a two-domains tree of life.系统发生基因组学为二域生命树提供了强有力的支持。
Nat Ecol Evol. 2020 Jan;4(1):138-147. doi: 10.1038/s41559-019-1040-x. Epub 2019 Dec 9.
3
Population Genetics Based Phylogenetics Under Stabilizing Selection for an Optimal Amino Acid Sequence: A Nested Modeling Approach.
基于稳定选择的最优氨基酸序列的种群遗传学系统发育:嵌套建模方法。
Mol Biol Evol. 2019 Apr 1;36(4):834-851. doi: 10.1093/molbev/msy222.
4
The tangled bank of amino acids.错综复杂的氨基酸库
Protein Sci. 2016 Jul;25(7):1354-62. doi: 10.1002/pro.2930. Epub 2016 May 12.
5
Evidence of Statistical Inconsistency of Phylogenetic Methods in the Presence of Multiple Sequence Alignment Uncertainty.在存在多序列比对不确定性的情况下系统发育方法统计不一致性的证据。
Genome Biol Evol. 2015 Jul 1;7(8):2102-16. doi: 10.1093/gbe/evv127.
6
The relationship between dN/dS and scaled selection coefficients.非同义替换率与标度化选择系数之间的关系。
Mol Biol Evol. 2015 Apr;32(4):1097-108. doi: 10.1093/molbev/msv003. Epub 2015 Jan 8.
7
Building megaphylogenies for macroecology: taking up the challenge.构建用于宏观生态学的巨型系统发育树:迎接挑战。
Ecography. 2013 Jan 1;36(1):13-26. doi: 10.1111/j.1600-0587.2012.07773.x.
8
Bayesian selection of nucleotide substitution models and their site assignments.贝叶斯选择核苷酸替换模型及其位点分配。
Mol Biol Evol. 2013 Mar;30(3):669-88. doi: 10.1093/molbev/mss258. Epub 2012 Dec 11.
9
Addressing inter-gene heterogeneity in maximum likelihood phylogenomic analysis: yeasts revisited.解决最大似然系统发生基因组分析中的基因间异质性:以酵母为例。
PLoS One. 2011;6(8):e22783. doi: 10.1371/journal.pone.0022783. Epub 2011 Aug 5.
10
Sources of signal in 62 protein-coding nuclear genes for higher-level phylogenetics of arthropods.用于节肢动物高级系统发育的 62 个蛋白质编码核基因中的信号源。
PLoS One. 2011;6(8):e23408. doi: 10.1371/journal.pone.0023408. Epub 2011 Aug 4.