• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种有向树距离度量。

A vectorial tree distance measure.

机构信息

The Mina & Everard Goodman Faculty of Life Sciences, Bar Ilan University, Ramat Gan, Israel.

Faculty of Engineering, Tel-Aviv University, Tel Aviv, Israel.

出版信息

Sci Rep. 2022 Mar 28;12(1):5256. doi: 10.1038/s41598-022-08360-4.

DOI:10.1038/s41598-022-08360-4
PMID:35347186
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8960910/
Abstract

A vectorial distance measure for trees is presented. Given two trees, we define a Tree-Alignment (T-Alignment). We T-align the trees from their centers outwards, starting from the root-branches, to make the next level as similar as possible. The algorithm is recursive; condition on the T-alignment of the root-branches we T-align the sub-branches, thereafter each T-alignment is conditioned on the previous one. We define a minimal T-alignment under a lexicographic order which follows the intuition that the differences between the two trees constitutes a vector. Given such a minimal T-alignment, the difference in the number of branches calculated at any level defines the entry of the distance vector at that level. We compare our algorithm to other well-known tree distance measures in the task of clustering sets of phylogenetic trees. We use the TreeSimGM simulator for generating stochastic phylogenetic trees. The vectorial tree distance (VTD) can successfully separate symmetric from asymmetric trees, and hierarchical from non-hierarchical trees. We also test the algorithm as a classifier of phylogenetic trees extracted from two members of the fungi kingdom, mushrooms and mildews, thus showimg that the algorithm can separate real world phylogenetic trees. The Matlab code can be accessed via: https://gitlab.com/avner.priel/vectorial-tree-distance .

摘要

提出了一种用于树的向量距离度量。对于两棵树,我们定义了树对齐(T-Alignment)。我们从根分支开始,从树的中心向外进行 T 对齐,以使下一级尽可能相似。算法是递归的;根据根分支的 T 对齐条件,我们对子分支进行 T 对齐,然后每个 T 对齐条件都依赖于前一个 T 对齐条件。我们在一个字典序下定义最小 T 对齐,这符合这样一种直觉,即两棵树之间的差异构成了一个向量。给定这样一个最小 T 对齐,在任何级别计算的分支数量差异定义了该距离向量在该级别上的条目。我们在聚类系统发育树集合的任务中,将我们的算法与其他著名的树距离度量进行了比较。我们使用 TreeSimGM 模拟器生成随机系统发育树。向量树距离(VTD)可以成功地区分对称树和非对称树,以及层次树和非层次树。我们还将该算法作为从真菌界的两个成员蘑菇和霉菌中提取的系统发育树的分类器进行了测试,从而表明该算法可以分离真实世界的系统发育树。Matlab 代码可通过以下网址访问:https://gitlab.com/avner.priel/vectorial-tree-distance 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/cd8661bc7acb/41598_2022_8360_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/2a9a7306295c/41598_2022_8360_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/8a3a4cd7c682/41598_2022_8360_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/6ee242c46f27/41598_2022_8360_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/3aa5eb8d831f/41598_2022_8360_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/9cadbac450f6/41598_2022_8360_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/cd8661bc7acb/41598_2022_8360_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/2a9a7306295c/41598_2022_8360_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/8a3a4cd7c682/41598_2022_8360_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/6ee242c46f27/41598_2022_8360_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/3aa5eb8d831f/41598_2022_8360_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/9cadbac450f6/41598_2022_8360_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a94/8960910/cd8661bc7acb/41598_2022_8360_Fig8_HTML.jpg

相似文献

1
A vectorial tree distance measure.一种有向树距离度量。
Sci Rep. 2022 Mar 28;12(1):5256. doi: 10.1038/s41598-022-08360-4.
2
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
3
Ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses.幽灵树:用于多样性分析的杂种基因系统发育树的构建。
Microbiome. 2016 Feb 24;4:11. doi: 10.1186/s40168-016-0153-6.
4
Variational Bayesian phylogenies through matrix representation of tree space.通过树空间的矩阵表示实现变分贝叶斯系统发育分析。
PeerJ. 2024 Apr 29;12:e17276. doi: 10.7717/peerj.17276. eCollection 2024.
5
SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees.SATe-II:一种非常快速且准确的同时估计多个序列比对和系统发育树的方法。
Syst Biol. 2012 Jan;61(1):90-106. doi: 10.1093/sysbio/syr095. Epub 2011 Dec 1.
6
Discrete coalescent trees.离散融合树。
J Math Biol. 2021 Nov 5;83(5):60. doi: 10.1007/s00285-021-01685-0.
7
A metric for phylogenetic trees based on matching.基于匹配的系统发育树度量标准。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1014-22. doi: 10.1109/TCBB.2011.157.
8
Taxon ordering in phylogenetic trees: a workbench test.系统发育树中的分类排序:一个工作台测试。
BMC Bioinformatics. 2011 Feb 22;12:58. doi: 10.1186/1471-2105-12-58.
9
Comparison of phylogenetic trees defined on different but mutually overlapping sets of taxa: A review.在不同但相互重叠的分类单元集上定义的系统发育树的比较:综述。
Ecol Evol. 2024 Aug 8;14(8):e70054. doi: 10.1002/ece3.70054. eCollection 2024 Aug.
10
A polynomial-time algorithm computing lower and upper bounds of the rooted subtree prune and regraft distance.一种计算有根子树剪接和重新嫁接距离上下界的多项式时间算法。
J Comput Biol. 2011 May;18(5):743-57. doi: 10.1089/cmb.2010.0045. Epub 2010 Dec 18.

引用本文的文献

1
Comparison of phylogenetic trees defined on different but mutually overlapping sets of taxa: A review.在不同但相互重叠的分类单元集上定义的系统发育树的比较:综述。
Ecol Evol. 2024 Aug 8;14(8):e70054. doi: 10.1002/ece3.70054. eCollection 2024 Aug.

本文引用的文献

1
Network Representation of T-Cell Repertoire- A Novel Tool to Analyze Immune Response to Cancer Formation.T 细胞受体的网络表示——一种分析免疫反应对癌症形成的新工具。
Front Immunol. 2018 Dec 11;9:2913. doi: 10.3389/fimmu.2018.02913. eCollection 2018.
2
: Simulating phylogenetic trees under general Bellman-Harris models with lineage-specific shifts of speciation and extinction in R.在R语言中,在具有特定谱系的物种形成和灭绝转移的一般Bellman-Harris模型下模拟系统发育树。
Methods Ecol Evol. 2018 Mar;9(3):754-760. doi: 10.1111/2041-210X.12917. Epub 2017 Nov 13.
3
Comparison of methods for phylogenetic B-cell lineage inference using time-resolved antibody repertoire simulations (AbSim).
基于时分辨抗体库模拟(AbSim)的系统发育 B 细胞谱系推断方法比较。
Bioinformatics. 2017 Dec 15;33(24):3938-3946. doi: 10.1093/bioinformatics/btx533.
4
Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution.绘制系统发育树以揭示独特的进化模式。
Mol Biol Evol. 2016 Oct;33(10):2735-43. doi: 10.1093/molbev/msw124. Epub 2016 Jun 24.
5
Phylo.io: Interactive Viewing and Comparison of Large Phylogenetic Trees on the Web.Phylo.io:在网络上对大型系统发育树进行交互式查看和比较。
Mol Biol Evol. 2016 Aug;33(8):2163-6. doi: 10.1093/molbev/msw080. Epub 2016 Apr 19.
6
Characterizing and Comparing Phylogenies from their Laplacian Spectrum.从拉普拉斯谱表征和比较系统发育树
Syst Biol. 2016 May;65(3):495-507. doi: 10.1093/sysbio/syv116. Epub 2015 Dec 12.
7
Age-dependent speciation can explain the shape of empirical phylogenies.年龄依赖性物种形成可以解释经验系统发育树的形态。
Syst Biol. 2015 May;64(3):432-40. doi: 10.1093/sysbio/syv001. Epub 2015 Jan 8.
8
NeXML: rich, extensible, and verifiable representation of comparative data and metadata.NeXML:用于比较数据和元数据的丰富、可扩展和可验证的表示形式。
Syst Biol. 2012 Jul;61(4):675-89. doi: 10.1093/sysbio/sys025. Epub 2012 Feb 22.
9
Tanglegrams for rooted phylogenetic trees and networks.有根系统发生树和网络的缠结图。
Bioinformatics. 2011 Jul 1;27(13):i248-56. doi: 10.1093/bioinformatics/btr210.
10
A model of Internet topology using k-shell decomposition.一种使用k-壳分解的互联网拓扑模型。
Proc Natl Acad Sci U S A. 2007 Jul 3;104(27):11150-4. doi: 10.1073/pnas.0701175104. Epub 2007 Jun 22.