• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统发育树空间的稳健分析。

Robust Analysis of Phylogenetic Tree Space.

机构信息

Department of Earth Sciences, Durham University, Durham, UK.

出版信息

Syst Biol. 2022 Aug 10;71(5):1255-1270. doi: 10.1093/sysbio/syab100.

DOI:10.1093/sysbio/syab100
PMID:34963003
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9366458/
Abstract

Phylogenetic analyses often produce large numbers of trees. Mapping trees' distribution in "tree space" can illuminate the behavior and performance of search strategies, reveal distinct clusters of optimal trees, and expose differences between different data sources or phylogenetic methods-but the high-dimensional spaces defined by metric distances are necessarily distorted when represented in fewer dimensions. Here, I explore the consequences of this transformation in phylogenetic search results from 128 morphological data sets, using stratigraphic congruence-a complementary aspect of tree similarity-to evaluate the utility of low-dimensional mappings. I find that phylogenetic similarities between cladograms are most accurately depicted in tree spaces derived from information-theoretic tree distances or the quartet distance. Robinson-Foulds tree spaces exhibit prominent distortions and often fail to group trees according to phylogenetic similarity, whereas the strong influence of tree shape on the Kendall-Colijn distance makes its tree space unsuitable for many purposes. Distances mapped into two or even three dimensions often display little correspondence with true distances, which can lead to profound misrepresentation of clustering structure. Without explicit testing, one cannot be confident that a tree space mapping faithfully represents the true distribution of trees, nor that visually evident structure is valid. My recommendations for tree space validation and visualization are implemented in a new graphical user interface in the "TreeDist" R package. [Multidimensional scaling; phylogenetic software; tree distance metrics; treespace projections.].

摘要

系统发育分析通常会产生大量的树。在“树空间”中映射树的分布可以阐明搜索策略的行为和性能,揭示出最优树的不同聚类,并揭示不同数据源或系统发育方法之间的差异——但在较少维度中表示时,由度量距离定义的高维空间必然会发生扭曲。在这里,我使用地层一致性(树相似性的一个补充方面)来评估低维映射的效用,探索了 128 个形态数据集的系统发育搜索结果中的这种转换的后果。我发现,系统发育相似性在基于信息论树距离或四分体距离的树空间中得到了最准确的描述。罗宾逊-福尔德斯树空间表现出明显的扭曲,并且经常无法根据系统发育相似性对树进行分组,而树形状对肯德尔-科林距离的强烈影响使得其树空间不适合许多用途。映射到二维甚至三维的距离通常与真实距离几乎没有对应关系,这可能导致聚类结构的严重表示错误。如果没有明确的测试,就不能确定树空间映射是否忠实地表示了树的真实分布,也不能确定明显的结构是否有效。我对树空间验证和可视化的建议在“TreeDist”R 包中的新图形用户界面中得到了实现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c225/9366458/eb673b7edb40/syab100f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c225/9366458/136a25368293/syab100f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c225/9366458/6d5722e2d051/syab100f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c225/9366458/77762c419055/syab100f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c225/9366458/6c023f55c693/syab100f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c225/9366458/eb673b7edb40/syab100f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c225/9366458/136a25368293/syab100f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c225/9366458/6d5722e2d051/syab100f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c225/9366458/77762c419055/syab100f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c225/9366458/6c023f55c693/syab100f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c225/9366458/eb673b7edb40/syab100f5.jpg

相似文献

1
Robust Analysis of Phylogenetic Tree Space.系统发育树空间的稳健分析。
Syst Biol. 2022 Aug 10;71(5):1255-1270. doi: 10.1093/sysbio/syab100.
2
Information theoretic generalized Robinson-Foulds metrics for comparing phylogenetic trees.基于信息论的广义 Robinson-Foulds 度量在比较系统发生树中的应用。
Bioinformatics. 2020 Dec 22;36(20):5007-5013. doi: 10.1093/bioinformatics/btaa614.
3
MASTtreedist: visualization of tree space based on maximum agreement subtree.MAST树状图距离:基于最大一致子树的树空间可视化。
J Comput Biol. 2013 Jan;20(1):42-9. doi: 10.1089/cmb.2012.0243.
4
Invariant transformers of Robinson and Foulds distance matrices for Convolutional Neural Network.不变的 Robinson 和 Foulds 距离矩阵变换用于卷积神经网络。
J Bioinform Comput Biol. 2022 Aug;20(4):2250012. doi: 10.1142/S0219720022500123. Epub 2022 Jul 6.
5
treespace: Statistical exploration of landscapes of phylogenetic trees.treespace:系统发育树景观的统计探索。
Mol Ecol Resour. 2017 Nov;17(6):1385-1392. doi: 10.1111/1755-0998.12676. Epub 2017 May 15.
6
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
7
Analysis and visualization of tree space.树形空间的分析与可视化
Syst Biol. 2005 Jun;54(3):471-82. doi: 10.1080/10635150590946961.
8
Discrete coalescent trees.离散融合树。
J Math Biol. 2021 Nov 5;83(5):60. doi: 10.1007/s00285-021-01685-0.
9
A metric for phylogenetic trees based on matching.基于匹配的系统发育树度量标准。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1014-22. doi: 10.1109/TCBB.2011.157.
10
MulRF: a software package for phylogenetic analysis using multi-copy gene trees.MulRF:一个使用多拷贝基因树进行系统发育分析的软件包。
Bioinformatics. 2015 Feb 1;31(3):432-3. doi: 10.1093/bioinformatics/btu648. Epub 2014 Oct 1.

引用本文的文献

1
Spaces of ranked tree-child networks.排序树子网络的空间。
J Math Biol. 2025 Sep 2;91(3):32. doi: 10.1007/s00285-025-02265-2.
2
Infinite Mixture Models for Improved Modeling of Across-Site Evolutionary Variation.用于改进跨位点进化变异建模的无限混合模型。
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf199.
3
Combining fossil taxa with and without morphological data improves dated phylogenetic analyses.将有形态学数据和无形态学数据的化石分类群相结合,可改进定年系统发育分析。

本文引用的文献

1
A large-sized basal ankylopollexian from East Asia, shedding light on early biogeographic history of Iguanodontia.一种来自东亚的大型基干禽龙类,为禽龙类早期生物地理历史提供了线索。
Sci Bull (Beijing). 2018 May 15;63(9):556-563. doi: 10.1016/j.scib.2018.03.016. Epub 2018 Apr 12.
2
ON CONSENSUS, COLLAPSIBILITY, AND CLADE CONCORDANCE.关于共识、可折叠性和分支一致性。
Cladistics. 1996 Dec;12(4):305-321. doi: 10.1111/j.1096-0031.1996.tb00017.x.
3
TNT version 1.5, including a full implementation of phylogenetic morphometrics.TNT版本1.5,包括系统发育形态计量学的完整实现。
Biol Lett. 2025 Aug;21(8):20250205. doi: 10.1098/rsbl.2025.0205. Epub 2025 Aug 13.
4
Mitonuclear Coevolution in Bumblebees (Bombus): Genomic Signatures and Its Role in Climatic Niche Adaptation.熊蜂( Bombus )中的线粒体-核共进化:基因组特征及其在气候生态位适应中的作用
Genome Biol Evol. 2025 Jul 3;17(7). doi: 10.1093/gbe/evaf123.
5
Reference-Free Variant Calling with Local Graph Construction with ska lo (SKA).使用ska lo(SKA)进行局部图构建的无参考变异检测
Mol Biol Evol. 2025 Apr 1;42(4). doi: 10.1093/molbev/msaf077.
6
Toward a phylogenomic classification of magnoliids.迈向木兰类植物的系统发育基因组分类
Am J Bot. 2025 Jan;112(1):e16451. doi: 10.1002/ajb2.16451. Epub 2025 Jan 14.
7
Solu: a cloud platform for real-time genomic pathogen surveillance.Solu:一个用于实时基因组病原体监测的云平台。
BMC Bioinformatics. 2025 Jan 13;26(1):12. doi: 10.1186/s12859-024-06005-z.
8
Vertical and diel niches modulate thermal selection by rainforest frogs.垂直和昼夜生态位调节热带雨林蛙的热选择。
Proc Biol Sci. 2024 Nov;291(2034):20241497. doi: 10.1098/rspb.2024.1497. Epub 2024 Nov 13.
9
Seamless, rapid, and accurate analyses of outbreak genomic data using split -mer analysis.利用分拆分析实现爆发基因组数据的无缝、快速和准确分析。
Genome Res. 2024 Oct 29;34(10):1661-1673. doi: 10.1101/gr.279449.124.
10
Estimating the mean in the space of ranked phylogenetic trees.估计排序系统发育树空间中的均值。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae514.
Cladistics. 2016 Jun;32(3):221-238. doi: 10.1111/cla.12160. Epub 2016 Apr 25.
4
Information geometry for phylogenetic trees.系统发生树的信息几何。
J Math Biol. 2021 Feb 15;82(3):19. doi: 10.1007/s00285-021-01553-x.
5
Computational Tools for Evaluating Phylogenetic and Hierarchical Clustering Trees.用于评估系统发育树和层次聚类树的计算工具
J Comput Graph Stat. 2012;21(3):581-599. doi: 10.1080/10618600.2012.640901. Epub 2012 Aug 16.
6
Information theoretic generalized Robinson-Foulds metrics for comparing phylogenetic trees.基于信息论的广义 Robinson-Foulds 度量在比较系统发生树中的应用。
Bioinformatics. 2020 Dec 22;36(20):5007-5013. doi: 10.1093/bioinformatics/btaa614.
7
Parsimony, not Bayesian analysis, recovers more stratigraphically congruent phylogenetic trees.简约法而非贝叶斯分析法更能重建出具有更多地层一致性的系统发育树。
Biol Lett. 2018 Jun;14(6). doi: 10.1098/rsbl.2018.0263.
8
Probabilistic Distances Between Trees.树之间的概率距离。
Syst Biol. 2018 Mar 1;67(2):320-327. doi: 10.1093/sysbio/syx080.
9
A Metric on Phylogenetic Tree Shapes.系统发育树形状的一种度量。
Syst Biol. 2018 Jan 1;67(1):113-126. doi: 10.1093/sysbio/syx046.
10
treespace: Statistical exploration of landscapes of phylogenetic trees.treespace:系统发育树景观的统计探索。
Mol Ecol Resour. 2017 Nov;17(6):1385-1392. doi: 10.1111/1755-0998.12676. Epub 2017 May 15.