• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分类群添加后的系统发育树不稳定性:在线推断的经验频率、可预测性及后果

Phylogenetic Tree Instability After Taxon Addition: Empirical Frequency, Predictability, and Consequences For Online Inference.

作者信息

Collienne Lena, Barker Mary, Suchard Marc A, Matsen Frederick A

机构信息

Computational Biology Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA 98109, USA.

Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA 98109, USA.

出版信息

Syst Biol. 2025 Feb 10;74(1):101-111. doi: 10.1093/sysbio/syae059.

DOI:10.1093/sysbio/syae059
PMID:39453463
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11809580/
Abstract

Online phylogenetic inference methods add sequentially arriving sequences to an inferred phylogeny without the need to recompute the entire tree from scratch. Some online method implementations exist already, but there remains concern that additional sequences may change the topological relationship among the original set of taxa. We call such a change in tree topology a lack of stability for the inferred tree. In this article, we analyze the stability of single taxon addition in a Maximum Likelihood framework across 1000 empirical datasets. We find that instability occurs in almost 90% of our examples, although observed topological differences do not always reach significance under the approximately unbiased (AU) test. Changes in tree topology after addition of a taxon rarely occur close to its attachment location, and are more frequently observed in more distant tree locations carrying low bootstrap support. To investigate whether instability is predictable, we hypothesize sources of instability and design summary statistics addressing these hypotheses. Using these summary statistics as input features for machine learning under random forests, we are able to predict instability and can identify the most influential features. In summary, it does not appear that a strict insertion-only online inference method will deliver globally optimal trees, although relaxing insertion strictness by allowing for a small number of final tree rearrangements or accepting slightly suboptimal solutions appears feasible.

摘要

在线系统发育推断方法可以将陆续到达的序列添加到推断出的系统发育树中,而无需从头重新计算整个树。目前已经存在一些在线方法的实现,但人们仍然担心额外的序列可能会改变原始分类单元集之间的拓扑关系。我们将系统发育树拓扑结构的这种变化称为推断树缺乏稳定性。在本文中,我们在最大似然框架下分析了1000个经验数据集上单分类单元添加的稳定性。我们发现,在近90%的示例中都出现了不稳定性,尽管在近似无偏(AU)检验下观察到的拓扑差异并不总是显著。添加一个分类单元后,系统发育树拓扑结构的变化很少发生在其附着位置附近,而在具有低自展支持的更远的树位置更常观察到。为了研究不稳定性是否可预测,我们假设了不稳定性的来源并设计了针对这些假设的汇总统计量。将这些汇总统计量用作随机森林下机器学习的输入特征,我们能够预测不稳定性并识别最具影响力的特征。总之,虽然通过允许少量最终树重排或接受略次优的解决方案来放宽插入严格性似乎是可行的,但严格的仅插入式在线推断方法似乎无法生成全局最优树。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/c4893bb6148c/syae059_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/0ec528c4d4ec/syae059_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/c769d64a1c88/syae059_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/4e22b4cedf53/syae059_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/e8c9100ee6da/syae059_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/3484ddbd47a8/syae059_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/9be828bdb7d3/syae059_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/c4893bb6148c/syae059_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/0ec528c4d4ec/syae059_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/c769d64a1c88/syae059_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/4e22b4cedf53/syae059_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/e8c9100ee6da/syae059_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/3484ddbd47a8/syae059_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/9be828bdb7d3/syae059_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfeb/11809580/c4893bb6148c/syae059_fig7.jpg

相似文献

1
Phylogenetic Tree Instability After Taxon Addition: Empirical Frequency, Predictability, and Consequences For Online Inference.分类群添加后的系统发育树不稳定性:在线推断的经验频率、可预测性及后果
Syst Biol. 2025 Feb 10;74(1):101-111. doi: 10.1093/sysbio/syae059.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
Short-Term Memory Impairment短期记忆障碍
4
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Interventions to reduce harm from continued tobacco use.减少持续吸烟危害的干预措施。
Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3.
7
Stigma Management Strategies of Autistic Social Media Users.自闭症社交媒体用户的污名管理策略
Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.
8
Systemic Inflammatory Response Syndrome全身炎症反应综合征
9
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
10
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.

本文引用的文献

1
Online tree expansion could help solve the problem of scalability in Bayesian phylogenetics.在线树扩展可以帮助解决贝叶斯系统发生学中的可扩展性问题。
Syst Biol. 2023 Nov 1;72(5):1199-1206. doi: 10.1093/sysbio/syad045.
2
Online Phylogenetics with matOptimize Produces Equivalent Trees and is Dramatically More Efficient for Large SARS-CoV-2 Phylogenies than de novo and Maximum-Likelihood Implementations.在线系统发育学与 matOptimize 产生等效的树,并且比从头开始和最大似然实现对大型 SARS-CoV-2 系统发育更有效率。
Syst Biol. 2023 Nov 1;72(5):1039-1051. doi: 10.1093/sysbio/syad031.
3
Maximum likelihood pandemic-scale phylogenetics.
最大似然法大流行规模系统发育学。
Nat Genet. 2023 May;55(5):746-752. doi: 10.1038/s41588-023-01368-0. Epub 2023 Apr 10.
4
From Easy to Hopeless-Predicting the Difficulty of Phylogenetic Analyses.从简单到无望——预测系统发育分析的难度。
Mol Biol Evol. 2022 Dec 5;39(12). doi: 10.1093/molbev/msac254.
5
Testing Phylogenetic Stability with Variable Taxon Sampling.利用可变分类群采样检验系统发育稳定性。
Methods Mol Biol. 2022;2569:167-188. doi: 10.1007/978-1-0716-2691-7_8.
6
matOptimize: a parallel tree optimization method enables online phylogenetics for SARS-CoV-2.matOptimize:一种并行树优化方法,支持 SARS-CoV-2 的在线系统发生分析。
Bioinformatics. 2022 Aug 2;38(15):3734-3740. doi: 10.1093/bioinformatics/btac401.
7
A review of long-branch attraction.长枝吸引现象综述。
Cladistics. 2005 Apr;21(2):163-193. doi: 10.1111/j.1096-0031.2005.00059.x.
8
Sustainable data analysis with Snakemake.使用 Snakemake 进行可持续数据分析。
F1000Res. 2021 Jan 18;10:33. doi: 10.12688/f1000research.29032.2. eCollection 2021.
9
Properties of Markov Chain Monte Carlo Performance across Many Empirical Alignments.许多实证比对中马尔可夫链蒙特卡罗性能的特性。
Mol Biol Evol. 2021 Apr 13;38(4):1627-1640. doi: 10.1093/molbev/msaa295.
10
Online Bayesian Phylodynamic Inference in BEAST with Application to Epidemic Reconstruction.在线贝叶斯系统发育推断在 BEAST 中的应用及其在疫情重建中的应用
Mol Biol Evol. 2020 Jun 1;37(6):1832-1842. doi: 10.1093/molbev/msaa047.