• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从过滤数据中估算树木:形态系统发生学模型的可识别性。

Estimating trees from filtered data: identifiability of models for morphological phylogenetics.

机构信息

Department of Mathematics and Statistics, University of Alaska Fairbanks, PO Box 756660, Fairbanks, AK 99775, USA.

出版信息

J Theor Biol. 2010 Mar 7;263(1):108-19. doi: 10.1016/j.jtbi.2009.12.001. Epub 2009 Dec 11.

DOI:10.1016/j.jtbi.2009.12.001
PMID:20004210
Abstract

As an alternative to parsimony analyses, stochastic models have been proposed (Lewis, 2001; Nylander et al., 2004) for morphological characters, so that maximum likelihood or Bayesian analyses may be used for phylogenetic inference. A key feature of these models is that they account for ascertainment bias, in that only varying, or parsimony-informative characters are observed. However, statistical consistency of such model-based inference requires that the model parameters be identifiable from the joint distribution they entail, and this issue has not been addressed. Here we prove that parameters for several such models, with finite state spaces of arbitrary size, are identifiable, provided the tree has at least eight leaves. If the tree topology is already known, then seven leaves suffice for identifiability of the numerical parameters. The method of proof involves first inferring a full distribution of both parsimony-informative and non-informative pattern joint probabilities from the parsimony-informative ones, using phylogenetic invariants. The failure of identifiability of the tree parameter for four-taxon trees is also investigated.

摘要

作为简约分析的替代方法,已经提出了用于形态特征的随机模型(Lewis,2001;Nylander 等人,2004),以便可以对系统发育进行最大似然或贝叶斯分析。这些模型的一个关键特征是它们考虑了鉴定偏差,即仅观察到变化或简约信息丰富的特征。然而,基于模型的推断的统计一致性要求模型参数可以从它们所涉及的联合分布中识别出来,而这个问题尚未得到解决。在这里,我们证明了具有任意大小有限状态空间的几个这样的模型的参数是可识别的,前提是树至少有八个叶子。如果树拓扑结构已知,则七个叶子就足以识别数值参数。证明方法涉及首先使用系统发育不变量从简约信息丰富的特征推断出简约信息丰富和非信息丰富模式联合概率的完整分布。还研究了四分类树的树参数不可识别的情况。

相似文献

1
Estimating trees from filtered data: identifiability of models for morphological phylogenetics.从过滤数据中估算树木:形态系统发生学模型的可识别性。
J Theor Biol. 2010 Mar 7;263(1):108-19. doi: 10.1016/j.jtbi.2009.12.001. Epub 2009 Dec 11.
2
Fair-balance paradox, star-tree paradox, and Bayesian phylogenetics.公平平衡悖论、星树悖论与贝叶斯系统发育学
Mol Biol Evol. 2007 Aug;24(8):1639-55. doi: 10.1093/molbev/msm081. Epub 2007 May 7.
3
Fundamental differences between the methods of maximum likelihood and maximum posterior probability in phylogenetics.系统发育学中最大似然法与最大后验概率法的根本差异。
Syst Biol. 2006 Feb;55(1):116-21. doi: 10.1080/10635150500481648.
4
Identifiability of parameters in MCMC Bayesian inference of phylogeny.系统发育的MCMC贝叶斯推断中参数的可识别性。
Syst Biol. 2002 Oct;51(5):754-60. doi: 10.1080/10635150290102429.
5
Very fast algorithms for evaluating the stability of ML and Bayesian phylogenetic trees from sequence data.用于从序列数据评估最大似然法和贝叶斯系统发育树稳定性的超快速算法。
Genome Inform. 2002;13:82-92.
6
Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous.当进化具有异质性时最大简约法和似然法系统发育分析的性能
Nature. 2004 Oct 21;431(7011):980-4. doi: 10.1038/nature02917.
7
An examination of the monophyly of morning glory taxa using Bayesian phylogenetic inference.使用贝叶斯系统发育推断对旋花科分类群的单系性进行研究。
Syst Biol. 2002 Oct;51(5):740-53. doi: 10.1080/10635150290102401.
8
Failed refutations: further comments on parsimony and likelihood methods and their relationship to Popper's degree of corroboration.失败的反驳:关于简约法和似然法及其与波普尔确证度关系的进一步评论
Syst Biol. 2003 Jun;52(3):352-67.
9
Phylogenetic MCMC algorithms are misleading on mixtures of trees.系统发育马尔可夫链蒙特卡罗算法在树的混合模型上具有误导性。
Science. 2005 Sep 30;309(5744):2207-9. doi: 10.1126/science.1115493.
10
Species trees from gene trees: reconstructing Bayesian posterior distributions of a species phylogeny using estimated gene tree distributions.从基因树构建物种树:利用估计的基因树分布重建物种系统发育的贝叶斯后验分布。
Syst Biol. 2007 Jun;56(3):504-14. doi: 10.1080/10635150701429982.

引用本文的文献

1
An early-diverging iguanodontian (Dinosauria: Rhabdodontomorpha) from the Late Cretaceous of North America.早白垩世北美的一种早期分化的禽龙类(恐龙:坚尾龙类)。
PLoS One. 2023 Jun 7;18(6):e0286042. doi: 10.1371/journal.pone.0286042. eCollection 2023.
2
Parameter Identifiability for a Profile Mixture Model of Protein Evolution.蛋白质进化轨迹混合模型的参数可识别性
J Comput Biol. 2021 Jun;28(6):570-586. doi: 10.1089/cmb.2020.0315. Epub 2021 May 6.
3
Including autapomorphies is important for paleontological tip-dating with clocklike data, but not with non-clock data.
对于使用类似时钟的数据进行古生物学末端定年而言,纳入自近裔性状很重要,但对于非时钟数据则不然。
PeerJ. 2018 Apr 6;6:e4553. doi: 10.7717/peerj.4553. eCollection 2018.
4
Combined Analysis of Extant Rhynchonellida (Brachiopoda) using Morphological and Molecular Data.利用形态学和分子数据对现存小嘴贝目(腕足动物门)进行综合分析
Syst Biol. 2018 Jan 1;67(1):32-48. doi: 10.1093/sysbio/syx049.
5
Utility of characters evolving at diverse rates of evolution to resolve quartet trees with unequal branch lengths: analytical predictions of long-branch effects.以不同进化速率演变的性状在解析具有不等分支长度的四重树时的效用:长分支效应的分析预测
BMC Evol Biol. 2015 May 14;15:86. doi: 10.1186/s12862-015-0364-7.
6
Bayesian analysis using a simple likelihood model outperforms parsimony for estimation of phylogeny from discrete morphological data.使用简单似然模型的贝叶斯分析在从离散形态数据估计系统发育方面比简约法表现更优。
PLoS One. 2014 Oct 3;9(10):e109210. doi: 10.1371/journal.pone.0109210. eCollection 2014.
7
Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent.在溯祖理论下,根据无根基因树的分布确定有根物种树。
J Math Biol. 2011 Jun;62(6):833-62. doi: 10.1007/s00285-010-0355-7. Epub 2010 Jul 23.