• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

当序列位点以可变速率进化时重建树。

Reconstructing trees when sequence sites evolve at variable rates.

作者信息

Steel M A, Székely L A, Hendy M D

机构信息

Mathematics and Statistics Department, University of Canterbury, Christchurch, NZ.

出版信息

J Comput Biol. 1994 Summer;1(2):153-63. doi: 10.1089/cmb.1994.1.153.

DOI:10.1089/cmb.1994.1.153
PMID:8790461
Abstract

For a sequence of colors independently evolving on a tree under a simple Markov model, we consider conditions under which the tree can be uniquely recovered from the "sequence spectrum"-the expected frequencies of the various leaf colorations. This is relevant for phylogenetic analysis (where colors represent nucleotides or amino acids; leaves represent extant taxa) as the sequence spectrum is estimated directly from a collection of aligned sequences. Allowing the rate of the evolutionary process to vary across sites is an important extension over most previous studies-we show that, given suitable restrictions on the rate distribution, the true tree (up to the placement of its root) is uniquely identified by its sequence spectrum. However, if the rate distribution is unknown and arbitrary, then, for simple models, it is possible for every tree to produce the same sequence spectrum. Hence there is a logical barrier to accurate, consistent phylogenetic inference for these models when assumptions about the rate distribution are not made. This result exploits a novel theorem on the action of polynomials with non-negative coefficients on sequences.

摘要

对于在简单马尔可夫模型下在树上独立演化的一系列颜色,我们考虑在哪些条件下可以从“序列谱”(即各种叶色的预期频率)唯一地恢复树。这与系统发育分析相关(其中颜色代表核苷酸或氨基酸;叶代表现存分类群),因为序列谱是直接从一组比对序列中估计出来的。允许进化过程的速率在不同位点变化是对大多数先前研究的一个重要扩展——我们表明,在对速率分布有适当限制的情况下,真实的树(直至其根的位置)由其序列谱唯一确定。然而,如果速率分布未知且任意,那么对于简单模型,每棵树都有可能产生相同的序列谱。因此,在不对速率分布做出假设的情况下,这些模型进行准确、一致的系统发育推断存在逻辑障碍。这一结果利用了关于非负系数多项式对序列作用的一个新定理。

相似文献

1
Reconstructing trees when sequence sites evolve at variable rates.当序列位点以可变速率进化时重建树。
J Comput Biol. 1994 Summer;1(2):153-63. doi: 10.1089/cmb.1994.1.153.
2
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
3
Identifiability of the unrooted species tree topology under the coalescent model with time-reversible substitution processes, site-specific rate variation, and invariable sites.在具有时间可逆替换过程、位点特异性速率变异和不变位点的合并模型下无根物种树拓扑结构的可识别性。
J Theor Biol. 2015 Jun 7;374:35-47. doi: 10.1016/j.jtbi.2015.03.006. Epub 2015 Mar 17.
4
Identifying evolutionary trees and substitution parameters for the general Markov model with invariable sites.识别具有不变位点的一般马尔可夫模型的进化树和替换参数。
Math Biosci. 2008 Jan;211(1):18-33. doi: 10.1016/j.mbs.2007.09.001. Epub 2007 Sep 25.
5
Developing a statistically powerful measure for quartet tree inference using phylogenetic identities and Markov invariants.利用系统发育一致性和马尔可夫不变量开发一种用于四重树推断的具有统计学效力的方法。
J Math Biol. 2017 Dec;75(6-7):1619-1654. doi: 10.1007/s00285-017-1129-2. Epub 2017 Apr 22.
6
Do tree split probabilities determine the branch lengths?树分裂概率决定分支长度吗?
J Theor Biol. 2015 Jun 7;374:54-9. doi: 10.1016/j.jtbi.2015.03.024. Epub 2015 Apr 3.
7
Joint Bayesian estimation of alignment and phylogeny.比对与系统发育的联合贝叶斯估计。
Syst Biol. 2005 Jun;54(3):401-18. doi: 10.1080/10635150590947041.
8
What can and what cannot be inferred from pairwise sequence comparisons?从成对序列比较中可以推断出什么,又不能推断出什么?
Math Biosci. 1998 Dec 1;154(1):1-21. doi: 10.1016/s0025-5564(98)10044-5.
9
Consistency of Bayesian inference of resolved phylogenetic trees.解析系统发育树的贝叶斯推断的一致性
J Theor Biol. 2013 Nov 7;336:246-9. doi: 10.1016/j.jtbi.2013.08.012. Epub 2013 Aug 20.
10
Modelling the ancestral sequence distribution and model frequencies in context-dependent models for primate non-coding sequences.对灵长类非编码序列的上下文相关模型中的祖先序列分布和模型频率进行建模。
BMC Evol Biol. 2010 Aug 10;10:244. doi: 10.1186/1471-2148-10-244.

引用本文的文献

1
A quartet-based approach for inferring phylogenetically informative features from genomic and phenomic data.一种基于四重奏的方法,用于从基因组和表型组数据中推断系统发育信息特征。
Comput Struct Biotechnol J. 2025 Aug 22;27:3710-3718. doi: 10.1016/j.csbj.2025.08.015. eCollection 2025.
2
Parsimony and the rank of a flattening matrix.简约与展平矩阵的秩。
J Math Biol. 2023 Feb 9;86(3):44. doi: 10.1007/s00285-023-01875-y.
3
Phylogenetic mixtures and linear invariants for equal input models.等输入模型的系统发育混合与线性不变量
J Math Biol. 2017 Apr;74(5):1107-1138. doi: 10.1007/s00285-016-1055-8. Epub 2016 Sep 7.
4
Identifiability and inference of non-parametric rates-across-sites models on large-scale phylogenies.大规模系统发育树上非参数跨位点速率模型的可识别性与推断
J Math Biol. 2013 Oct;67(4):767-97. doi: 10.1007/s00285-012-0571-4. Epub 2012 Aug 9.
5
Measuring fit of sequence data to phylogenetic model: gain of power using marginal tests.衡量序列数据与系统发育模型的拟合度:使用边缘检验增加功效。
J Mol Evol. 2009 Oct;69(4):289-99. doi: 10.1007/s00239-009-9268-8. Epub 2009 Oct 23.
6
Estimation of evolutionary distances under stationary and nonstationary models of nucleotide substitution.核苷酸替换的平稳和非平稳模型下进化距离的估计。
Proc Natl Acad Sci U S A. 1998 May 26;95(11):5899-905. doi: 10.1073/pnas.95.11.5899.