• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

存在缺失数据时基于似然法的系统发育分析的误导性结果。

Misleading results of likelihood-based phylogenetic analyses in the presence of missing data.

作者信息

Simmons Mark P

出版信息

Cladistics. 2012 Apr;28(2):208-222. doi: 10.1111/j.1096-0031.2011.00375.x. Epub 2011 Oct 3.

DOI:10.1111/j.1096-0031.2011.00375.x
PMID:34872185
Abstract

The amount of missing data in many contemporary phylogenetic analyses has substantially increased relative to previous norms, particularly in supermatrix studies that compile characters from multiple previous analyses. In such cases the missing data are non-randomly distributed and usually present in all partitions (i.e. groups of characters) sampled. Parametric methods often provide greater resolution and support than parsimony in such cases, yet this may be caused by extrapolation of branch lengths from one partition to another. In this study I use contrived and simulated examples to demonstrate that likelihood, even when applied to simple matrices with little or no homoplasy, homogeneous evolution across groups of characters, perfect model fit, and hundreds or thousands of variable characters, can provide strong support for incorrect topologies when the matrices have non-random distributions of missing data distributed across all partitions. I do so using a systematic exploration of alternative seven-taxon tree topologies and distributions of missing data in two partitions to demonstrate that these likelihood-based artefacts may occur frequently and are not shared by parsimony. I also demonstrate that Bayesian Markov chain Monte Carlo analysis is more robust to these artefacts than is likelihood. © The Willi Hennig Society 2011.

摘要

与之前的标准相比,许多当代系统发育分析中的缺失数据量大幅增加,尤其是在从多个先前分析中汇编性状的超级矩阵研究中。在这种情况下,缺失数据并非随机分布,通常存在于所有抽样的分区(即性状组)中。在这种情况下,参数方法通常比简约法提供更高的分辨率和支持度,但这可能是由于将分支长度从一个分区外推到另一个分区所致。在本研究中,我使用人为设定和模拟的例子来证明,即使将似然法应用于几乎没有或没有同塑性、性状组间进化均匀、模型拟合完美且有数百或数千个可变性状的简单矩阵时,当矩阵在所有分区中具有非随机分布的缺失数据时,似然法也可能为错误的拓扑结构提供有力支持。我通过系统探索七分类单元树拓扑结构的替代方案以及两个分区中缺失数据的分布来做到这一点,以证明这些基于似然法的假象可能经常出现,且简约法不会出现这种情况。我还证明,贝叶斯马尔可夫链蒙特卡罗分析比似然法对这些假象更具稳健性。© 威利·亨尼希协会 2011 年。

相似文献

1
Misleading results of likelihood-based phylogenetic analyses in the presence of missing data.存在缺失数据时基于似然法的系统发育分析的误导性结果。
Cladistics. 2012 Apr;28(2):208-222. doi: 10.1111/j.1096-0031.2011.00375.x. Epub 2011 Oct 3.
2
Radical instability and spurious branch support by likelihood when applied to matrices with non-random distributions of missing data.当应用于具有非随机缺失数据分布的矩阵时,似然法会导致激进的不稳定性和虚假的分支支持。
Mol Phylogenet Evol. 2012 Jan;62(1):472-84. doi: 10.1016/j.ympev.2011.10.017. Epub 2011 Oct 31.
3
Limitations of locally sampled characters in phylogenetic analyses of sparse supermatrices.在稀疏超级矩阵的系统发育分析中,局部分布特征的局限性。
Mol Phylogenet Evol. 2014 May;74:1-14. doi: 10.1016/j.ympev.2014.01.030. Epub 2014 Feb 14.
4
A confounding effect of missing data on character conflict in maximum likelihood and Bayesian MCMC phylogenetic analyses.缺失数据对最大似然法和贝叶斯MCMC系统发育分析中特征冲突的混杂效应。
Mol Phylogenet Evol. 2014 Nov;80:267-80. doi: 10.1016/j.ympev.2014.08.021. Epub 2014 Aug 27.
5
Quantification and relative severity of inflated branch-support values generated by alternative methods: an empirical example.替代方法生成的膨胀分支支持值的量化和相对严重程度:一个实证例子。
Mol Phylogenet Evol. 2013 Apr;67(1):277-96. doi: 10.1016/j.ympev.2013.01.020. Epub 2013 Feb 9.
6
The devil in the details: interactions between the branch-length prior and likelihood model affect node support and branch lengths in the phylogeny of the Psoraceae.细节中的魔鬼:分支长度先验和似然模型之间的相互作用影响了 Psoraceae 系统发育中的节点支持和分支长度。
Syst Biol. 2011 Jul;60(4):541-61. doi: 10.1093/sysbio/syr022. Epub 2011 Mar 24.
7
Effects of data incompleteness on the relative performance of parsimony and Bayesian approaches in a supermatrix phylogenetic reconstruction of Mustelidae and Procyonidae (Carnivora).数据不完整性对鼬科和浣熊科(食肉目)超矩阵系统发育重建中简约法和贝叶斯法相对性能的影响。
Cladistics. 2010 Apr;26(2):168-194. doi: 10.1111/j.1096-0031.2009.00281.x. Epub 2009 Sep 1.
8
Phylogenetic analysis and intraspecific variation: performance of parsimony, likelihood, and distance methods.系统发育分析与种内变异:简约法、似然法和距离法的性能
Syst Biol. 1998 Jun;47(2):228-53. doi: 10.1080/106351598260897.
9
The relative performance of Bayesian and parsimony approaches when sampling characters evolving under homogeneous and heterogeneous sets of parameters.在对在同质和异质参数集下进化的性状进行抽样时,贝叶斯方法和简约法的相对性能。
Cladistics. 2006 Apr;22(2):171-185. doi: 10.1111/j.1096-0031.2006.00098.x.
10
Disparate parametric branch-support values from ambiguous characters.来自模糊特征的不同参数分支支持值。
Mol Phylogenet Evol. 2014 Sep;78:66-86. doi: 10.1016/j.ympev.2014.04.029. Epub 2014 May 10.

引用本文的文献

1
Gentrius: Generating Trees Compatible With a Set of Unrooted Subtrees and its Application to Phylogenetic Terraces.金特里乌斯:生成与一组无根子树兼容的树及其在系统发育阶地中的应用。
Mol Biol Evol. 2024 Nov 1;41(11). doi: 10.1093/molbev/msae219.
2
A Guide to Phylogenomic Inference.系统发育基因组推断指南。
Methods Mol Biol. 2024;2802:267-345. doi: 10.1007/978-1-0716-3838-5_11.
3
Fossilization can mislead analyses of phenotypic disparity.化石可能会误导对表型差异的分析。
Proc Biol Sci. 2023 Aug 9;290(2004):20230522. doi: 10.1098/rspb.2023.0522.
4
Blumea chishangensis sp. nov. (Asteraceae: Inuleae) from Taiwan and new insights into the phylogeny of Blumea.台湾红凤菜新种(菊科:旋覆花族)及红凤菜系统发育新见解
Bot Stud. 2022 Jul 12;63(1):21. doi: 10.1186/s40529-022-00350-z.
5
DNA Barcodes Combined with Multilocus Data of Representative Taxa Can Generate Reliable Higher-Level Phylogenies.DNA 条码与具代表性类群的多位点数据相结合可产生可靠的高级阶系统发育。
Syst Biol. 2022 Feb 10;71(2):382-395. doi: 10.1093/sysbio/syab038.
6
High-throughput methods for efficiently building massive phylogenies from natural history collections.利用自然历史标本馆高效构建大规模系统发育树的高通量方法。
Appl Plant Sci. 2021 Feb 27;9(2):e11410. doi: 10.1002/aps3.11410. eCollection 2021 Feb.
7
An integrative phylogenomic approach to elucidate the evolutionary history and divergence times of Neuropterida (Insecta: Holometabola).神经翅目(昆虫:全变态)的进化历史和分化时间的综合系统基因组学方法。
BMC Evol Biol. 2020 Jun 3;20(1):64. doi: 10.1186/s12862-020-01631-6.
8
Phylogeny of Paleozoic limbed vertebrates reassessed through revision and expansion of the largest published relevant data matrix.通过修订和扩充已发表的最大相关数据矩阵对古生代有肢脊椎动物的系统发育进行重新评估。
PeerJ. 2019 Jan 4;6:e5565. doi: 10.7717/peerj.5565. eCollection 2019.
9
The prevalence of terraced treescapes in analyses of phylogenetic data sets.分析系统发育数据集时,阶地式树景的出现频率。
BMC Evol Biol. 2018 Apr 4;18(1):46. doi: 10.1186/s12862-018-1162-9.
10
Differences between hard and soft phylogenetic data.硬数据和软数据之间的差异。
Proc Biol Sci. 2017 Dec 20;284(1869). doi: 10.1098/rspb.2017.2150.