• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯因子揭示了系统发育基因组分析中高度可变的信息内容、偏差和极端影响。

Bayes Factors Unmask Highly Variable Information Content, Bias, and Extreme Influence in Phylogenomic Analyses.

作者信息

Brown Jeremy M, Thomson Robert C

机构信息

Department of Biological Sciences and Museum of Natural Science, Louisiana State University, 202 Life Science Building, Baton Rouge, LA 70803, USA.

Department of Biology, University of Hawaíi at Manoa, 2538 McCarthy Mall, Edmondson Hall Rm 216, Honolulu, HI 96822, USA.

出版信息

Syst Biol. 2017 Jul 1;66(4):517-530. doi: 10.1093/sysbio/syw101.

DOI:10.1093/sysbio/syw101
PMID:28003531
Abstract

As the application of genomic data in phylogenetics has become routine, a number of cases have arisen where alternative data sets strongly support conflicting conclusions. This sensitivity to analytical decisions has prevented firm resolution of some of the most recalcitrant nodes in the tree of life. To better understand the causes and nature of this sensitivity, we analyzed several phylogenomic data sets using an alternative measure of topological support (the Bayes factor) that both demonstrates and averts several limitations of more frequently employed support measures (such as Markov chain Monte Carlo estimates of posterior probabilities). Bayes factors reveal important, previously hidden, differences across six "phylogenomic" data sets collected to resolve the phylogenetic placement of turtles within Amniota. These data sets vary substantially in their support for well-established amniote relationships, particularly in the proportion of genes that contain extreme amounts of information as well as the proportion that strongly reject these uncontroversial relationships. All six data sets contain little information to resolve the phylogenetic placement of turtles relative to other amniotes. Bayes factors also reveal that a very small number of extremely influential genes (less than 1% of genes in a data set) can fundamentally change significant phylogenetic conclusions. In one example, these genes are shown to contain previously unrecognized paralogs. This study demonstrates both that the resolution of difficult phylogenomic problems remains sensitive to seemingly minor analysis details and that Bayes factors are a valuable tool for identifying and solving these challenges.

摘要

随着基因组数据在系统发育学中的应用已成为常规操作,出现了一些情况,即不同的数据集强烈支持相互矛盾的结论。这种对分析决策的敏感性阻碍了生命之树中一些最棘手节点的确定解决。为了更好地理解这种敏感性的原因和本质,我们使用了一种拓扑支持的替代度量(贝叶斯因子)来分析几个系统发育基因组数据集,该度量既展示了又避免了更常用支持度量的一些局限性(例如后验概率的马尔可夫链蒙特卡罗估计)。贝叶斯因子揭示了为解决龟在羊膜动物中的系统发育位置而收集的六个“系统发育基因组”数据集之间重要的、以前未被发现的差异。这些数据集在对既定的羊膜动物关系的支持上有很大差异,特别是在包含大量信息的基因比例以及强烈拒绝这些无争议关系的基因比例方面。所有六个数据集几乎没有信息来解决龟相对于其他羊膜动物的系统发育位置。贝叶斯因子还表明,极少数极具影响力的基因(占数据集中基因的不到1%)可以从根本上改变重要的系统发育结论。在一个例子中,这些基因被证明包含以前未被识别的旁系同源物。这项研究表明,困难的系统发育基因组问题的解决仍然对看似微小的分析细节敏感,并且贝叶斯因子是识别和解决这些挑战的宝贵工具。

相似文献

1
Bayes Factors Unmask Highly Variable Information Content, Bias, and Extreme Influence in Phylogenomic Analyses.贝叶斯因子揭示了系统发育基因组分析中高度可变的信息内容、偏差和极端影响。
Syst Biol. 2017 Jul 1;66(4):517-530. doi: 10.1093/sysbio/syw101.
2
From phylogenetics to phylogenomics: the evolutionary relationships of insect endosymbiotic gamma-Proteobacteria as a test case.从系统发育学到系统基因组学:以昆虫内共生γ-变形菌的进化关系为例
Syst Biol. 2007 Feb;56(1):1-16. doi: 10.1080/10635150601109759.
3
Model Choice, Missing Data, and Taxon Sampling Impact Phylogenomic Inference of Deep Basidiomycota Relationships.模型选择、缺失数据和分类群采样对深担子菌系统发育关系的基因组推断的影响。
Syst Biol. 2020 Jan 1;69(1):17-37. doi: 10.1093/sysbio/syz029.
4
Bayes or bootstrap? A simulation study comparing the performance of Bayesian Markov chain Monte Carlo sampling and bootstrapping in assessing phylogenetic confidence.贝叶斯法还是自助法?一项比较贝叶斯马尔可夫链蒙特卡罗抽样和自助法在评估系统发育置信度时性能的模拟研究。
Mol Biol Evol. 2003 Feb;20(2):255-66. doi: 10.1093/molbev/msg028.
5
Broad phylogenomic sampling improves resolution of the animal tree of life.广泛的系统发育基因组采样提高了动物生命树的分辨率。
Nature. 2008 Apr 10;452(7188):745-9. doi: 10.1038/nature06614. Epub 2008 Mar 5.
6
Searching for convergence in phylogenetic Markov chain Monte Carlo.在系统发育马尔可夫链蒙特卡罗方法中寻找收敛性。
Syst Biol. 2006 Aug;55(4):553-65. doi: 10.1080/10635150600812544.
7
Integration of morphological data sets for phylogenetic analysis of Amniota: the importance of integumentary characters and increased taxonomic sampling.用于羊膜动物系统发育分析的形态数据集整合:皮肤特征的重要性及分类群抽样的增加
Syst Biol. 2005 Aug;54(4):530-47. doi: 10.1080/10635150590950326.
8
An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees.一种用于在系统发育树的贝叶斯马尔可夫链蒙特卡罗采样中更新分支的高效独立采样器。
Syst Biol. 2016 Jan;65(1):161-76. doi: 10.1093/sysbio/syv051. Epub 2015 Jul 30.
9
The impact of GC bias on phylogenetic accuracy using targeted enrichment phylogenomic data.使用靶向富集系统发育基因组数据时,GC偏差对系统发育准确性的影响。
Mol Phylogenet Evol. 2017 Jun;111:149-157. doi: 10.1016/j.ympev.2017.03.022. Epub 2017 Apr 5.
10
Guided tree topology proposals for Bayesian phylogenetic inference.贝叶斯系统发育推断的引导树拓扑提议。
Syst Biol. 2012 Jan;61(1):1-11. doi: 10.1093/sysbio/syr074. Epub 2011 Aug 9.

引用本文的文献

1
Gene characterization of Willd. and L. accessions: unmasking genetic diversity.野生种和栽培种的基因特征分析:揭示遗传多样性
3 Biotech. 2025 Jan;15(1):9. doi: 10.1007/s13205-024-04173-6. Epub 2024 Dec 15.
2
Phylogenetic Tree Instability After Taxon Addition: Empirical Frequency, Predictability, and Consequences For Online Inference.分类群添加后的系统发育树不稳定性:在线推断的经验频率、可预测性及后果
Syst Biol. 2025 Feb 10;74(1):101-111. doi: 10.1093/sysbio/syae059.
3
Discovering Fragile Clades and Causal Sequences in Phylogenomics by Evolutionary Sparse Learning.
通过进化稀疏学习在系统基因组学中发现脆弱的进化枝和因果序列。
Mol Biol Evol. 2024 Jul 3;41(7). doi: 10.1093/molbev/msae131.
4
Tip dating and Bayes factors provide insight into the divergences of crown bird clades across the end-Cretaceous mass extinction.支序年代测定和贝叶斯因子为研究白垩纪末大灭绝事件期间冠群鸟类支系的分歧提供了线索。
Proc Biol Sci. 2024 Feb 14;291(2016):20232618. doi: 10.1098/rspb.2023.2618.
5
Online tree expansion could help solve the problem of scalability in Bayesian phylogenetics.在线树扩展可以帮助解决贝叶斯系统发生学中的可扩展性问题。
Syst Biol. 2023 Nov 1;72(5):1199-1206. doi: 10.1093/sysbio/syad045.
6
Effect of Different Types of Sequence Data on Palaeognath Phylogeny.不同类型序列数据对古颌鸟类系统发育的影响。
Genome Biol Evol. 2023 Jun 1;15(6). doi: 10.1093/gbe/evad092.
7
Identifying the Best Approximating Model in Bayesian Phylogenetics: Bayes Factors, Cross-Validation or wAIC?贝叶斯系统发生学中最佳逼近模型的识别:贝叶斯因子、交叉验证还是 wAIC?
Syst Biol. 2023 Jun 17;72(3):616-638. doi: 10.1093/sysbio/syad004.
8
Phylogenomics of trans-Andean tetras of the genus Hyphessobrycon Durbin 1908 (Stethaprioninae: Characidae) and colonization patterns of Middle America.跨安第斯山脉 Hyphessobrycon Durbin 1908 属四齿鲀的系统基因组学(Stethaprioninae:Characidae)与中美洲的殖民模式。
PLoS One. 2023 Jan 20;18(1):e0279924. doi: 10.1371/journal.pone.0279924. eCollection 2023.
9
Improving Orthologous Signal and Model Fit in Datasets Addressing the Root of the Animal Phylogeny.提高解决动物系统发育根源问题的数据集的直系同源信号和模型拟合度。
Mol Biol Evol. 2023 Jan 4;40(1). doi: 10.1093/molbev/msac276.
10
Target capture data resolve recalcitrant relationships in the coffee family (Rubioideae, Rubiaceae).靶向捕获数据解析了茜草科咖啡亚科中难以解决的系统发育关系。
Front Plant Sci. 2022 Sep 8;13:967456. doi: 10.3389/fpls.2022.967456. eCollection 2022.