文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

歧义数据对最大似然法和贝叶斯推断得出的系统发育估计的影响。

The effect of ambiguous data on phylogenetic estimates obtained by maximum likelihood and Bayesian inference.

机构信息

Section of Integrative Biology, University of Texas at Austin, 1 University Station C0930, Austin, TX 78712, USA.

出版信息

Syst Biol. 2009 Feb;58(1):130-45. doi: 10.1093/sysbio/syp017. Epub 2009 May 22.


DOI:10.1093/sysbio/syp017
PMID:20525573
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7539334/
Abstract

Although an increasing number of phylogenetic data sets are incomplete, the effect of ambiguous data on phylogenetic accuracy is not well understood. We use 4-taxon simulations to study the effects of ambiguous data (i.e., missing characters or gaps) in maximum likelihood (ML) and Bayesian frameworks. By introducing ambiguous data in a way that removes confounding factors, we provide the first clear understanding of 1 mechanism by which ambiguous data can mislead phylogenetic analyses. We find that in both ML and Bayesian frameworks, among-site rate variation can interact with ambiguous data to produce misleading estimates of topology and branch lengths. Furthermore, within a Bayesian framework, priors on branch lengths and rate heterogeneity parameters can exacerbate the effects of ambiguous data, resulting in strongly misleading bipartition posterior probabilities. The magnitude and direction of the ambiguous data bias are a function of the number and taxonomic distribution of ambiguous characters, the strength of topological support, and whether or not the model is correctly specified. The results of this study have major implications for all analyses that rely on accurate estimates of topology or branch lengths, including divergence time estimation, ancestral state reconstruction, tree-dependent comparative methods, rate variation analysis, phylogenetic hypothesis testing, and phylogeographic analysis.

摘要

尽管越来越多的系统发育数据集是不完整的,但模糊数据对系统发育准确性的影响还没有得到很好的理解。我们使用四分类模拟来研究最大似然法(ML)和贝叶斯框架中模糊数据(即缺失字符或空位)的影响。通过以一种消除混杂因素的方式引入模糊数据,我们首次清楚地了解了模糊数据可能误导系统发育分析的一种机制。我们发现,在 ML 和贝叶斯框架中,种间速率变化可以与模糊数据相互作用,从而产生拓扑结构和分支长度的误导性估计。此外,在贝叶斯框架内,分支长度和速率异质性参数的先验概率可以加剧模糊数据的影响,导致强烈误导的二分体后验概率。模糊数据偏差的幅度和方向是模糊字符的数量和分类分布、拓扑结构支持的强度以及模型是否正确指定的函数。本研究的结果对所有依赖于拓扑结构或分支长度的准确估计的分析都有重大影响,包括分歧时间估计、祖先状态重建、基于树的比较方法、速率变化分析、系统发育假设检验和系统地理学分析。

相似文献

[1]
The effect of ambiguous data on phylogenetic estimates obtained by maximum likelihood and Bayesian inference.

Syst Biol. 2009-5-22

[2]
Branch length estimation and divergence dating: estimates of error in Bayesian and maximum likelihood frameworks.

BMC Evol Biol. 2010-1-11

[3]
Bayesian and maximum likelihood phylogenetic analyses of protein sequence data under relative branch-length differences and model violation.

BMC Evol Biol. 2005-1-28

[4]
The devil in the details: interactions between the branch-length prior and likelihood model affect node support and branch lengths in the phylogeny of the Psoraceae.

Syst Biol. 2011-3-24

[5]
Robustness of compound Dirichlet priors for Bayesian inference of branch lengths.

Syst Biol. 2012-2-10

[6]
A confounding effect of missing data on character conflict in maximum likelihood and Bayesian MCMC phylogenetic analyses.

Mol Phylogenet Evol. 2014-11

[7]
Tail paradox, partial identifiability, and influential priors in Bayesian branch length inference.

Mol Biol Evol. 2011-9-2

[8]
Assessment of substitution model adequacy using frequentist and Bayesian methods.

Mol Biol Evol. 2010-7-8

[9]
Missing data in phylogenetic analysis: reconciling results from simulations and empirical data.

Syst Biol. 2011-10

[10]
Impact of missing data on phylogenies inferred from empirical phylogenomic data sets.

Mol Biol Evol. 2012-8-28

引用本文的文献

[1]
Teasing apart the sources of phylogenetic tree discordance across three genomes in the oak family (Fagaceae).

BMC Plant Biol. 2025-7-17

[2]
Metagenomic Identification of Fusarium solani Strain as Cause of US Fungal Meningitis Outbreak Associated with Surgical Procedures in Mexico, 2023.

Emerg Infect Dis. 2025-5

[3]
Evolutionary and epidemic dynamics of COVID-19 in Germany exemplified by three Bayesian phylodynamic case studies.

Bioinform Biol Insights. 2025-3-12

[4]
Exploring SNP filtering strategies: the influence of strict vs soft core.

Microb Genom. 2025-1

[5]
Data-driven guidelines for phylogenomic analyses using SNP data.

Appl Plant Sci. 2024-8-9

[6]
16S rRNA phylogeny and clustering is not a reliable proxy for genome-based taxonomy in .

Microb Genom. 2024-9

[7]
Using de novo transcriptomes to decipher the relationships in cutthroat trout subspecies ().

Evol Appl. 2024-7-11

[8]
A Guide to Phylogenomic Inference.

Methods Mol Biol. 2024

[9]
Evolutionary history of arbuscular mycorrhizal fungi and genomic signatures of obligate symbiosis.

BMC Genomics. 2024-5-29

[10]
Central African dwarf crocodiles found in syntopy are comparably divergent to South American dwarf caimans.

Biol Lett. 2024-5

本文引用的文献

[1]
EXPERIMENTAL MOLECULAR EVOLUTION OF BACTERIOPHAGE T7.

Evolution. 1993-8

[2]
Phylogenetic mixtures on a single tree can mimic a tree of another topology.

Syst Biol. 2007-10

[3]
MaxAlign: maximizing usable data in an alignment.

BMC Bioinformatics. 2007-8-28

[4]
The importance of data partitioning and the utility of Bayes factors in Bayesian phylogenetics.

Syst Biol. 2007-8

[5]
Phylogeny of North American fireflies (Coleoptera: Lampyridae): implications for the evolution of light signals.

Mol Phylogenet Evol. 2007-10

[6]
Fair-balance paradox, star-tree paradox, and Bayesian phylogenetics.

Mol Biol Evol. 2007-8

[7]
The Bayesian "star paradox" persists for long finite sequences.

Mol Biol Evol. 2007-4

[8]
The supermatrix approach to systematics.

Trends Ecol Evol. 2007-1

[9]
Is there a star tree paradox?

Mol Biol Evol. 2006-10

[10]
Heterotachy and long-branch attraction in phylogenetics.

BMC Evol Biol. 2005-10-6

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索