• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

带标签树的广义罗宾逊-福尔兹距离。

A generalized Robinson-Foulds distance for labeled trees.

作者信息

Briand Samuel, Dessimoz Christophe, El-Mabrouk Nadia, Lafond Manuel, Lobinska Gabriela

机构信息

Computer Science Department, Université de Montréal, Montreal, Canada.

Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.

出版信息

BMC Genomics. 2020 Nov 18;21(Suppl 10):779. doi: 10.1186/s12864-020-07011-0.

DOI:10.1186/s12864-020-07011-0
PMID:33208096
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7677779/
Abstract

BACKGROUND

The Robinson-Foulds (RF) distance is a well-established measure between phylogenetic trees. Despite a lack of biological justification, it has the advantages of being a proper metric and being computable in linear time. For phylogenetic applications involving genes, however, a crucial aspect of the trees ignored by the RF metric is the type of the branching event (e.g. speciation, duplication, transfer, etc).

RESULTS

We extend RF to trees with labeled internal nodes by including a node flip operation, alongside edge contractions and extensions. We explore properties of this extended RF distance in the case of a binary labeling. In particular, we show that contrary to the unlabeled case, an optimal edit path may require contracting "good" edges, i.e. edges shared between the two trees.

CONCLUSIONS

We provide a 2-approximation algorithm which is shown to perform well empirically. Looking ahead, computing distances between labeled trees opens up a variety of new algorithmic directions.Implementation and simulations available at https://github.com/DessimozLab/pylabeledrf .

摘要

背景

罗宾逊 - 福尔兹(RF)距离是系统发育树之间一种成熟的度量方法。尽管缺乏生物学依据,但它具有是一种恰当度量且可在线性时间内计算的优点。然而,对于涉及基因的系统发育应用,RF度量忽略的系统发育树的一个关键方面是分支事件的类型(例如物种形成、基因复制、基因转移等)。

结果

我们通过纳入节点翻转操作以及边收缩和边扩展,将RF扩展到具有标记内部节点的树。我们在二元标记的情况下探索这种扩展的RF距离的性质。特别地,我们表明与未标记的情况相反,最优编辑路径可能需要收缩“好”边,即两棵树之间共享的边。

结论

我们提供了一种2近似算法,经实证表明其性能良好。展望未来,计算标记树之间的距离开辟了各种新的算法方向。实现和模拟可在https://github.com/DessimozLab/pylabeledrf获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/f58c88dfae67/12864_2020_7011_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/f565283814f2/12864_2020_7011_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/399855af399a/12864_2020_7011_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/7bf5ff4735bc/12864_2020_7011_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/d431c7ff9ab8/12864_2020_7011_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/1781a7a865d8/12864_2020_7011_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/f71a46c99e2b/12864_2020_7011_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/f58c88dfae67/12864_2020_7011_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/f565283814f2/12864_2020_7011_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/399855af399a/12864_2020_7011_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/7bf5ff4735bc/12864_2020_7011_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/d431c7ff9ab8/12864_2020_7011_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/1781a7a865d8/12864_2020_7011_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/f71a46c99e2b/12864_2020_7011_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a911/7677779/f58c88dfae67/12864_2020_7011_Fig7_HTML.jpg

相似文献

1
A generalized Robinson-Foulds distance for labeled trees.带标签树的广义罗宾逊-福尔兹距离。
BMC Genomics. 2020 Nov 18;21(Suppl 10):779. doi: 10.1186/s12864-020-07011-0.
2
A Linear Time Solution to the Labeled Robinson-Foulds Distance Problem.线性时间解决带标签的罗宾逊-福尔德斯距离问题。
Syst Biol. 2022 Oct 12;71(6):1391-1403. doi: 10.1093/sysbio/syac028.
3
The -Robinson-Foulds Dissimilarity Measures for Comparison of Labeled Trees.用于比较带标签树的罗宾逊 - 福尔兹差异度量
J Comput Biol. 2024 Apr;31(4):328-344. doi: 10.1089/cmb.2023.0312. Epub 2024 Jan 25.
4
Invariant transformers of Robinson and Foulds distance matrices for Convolutional Neural Network.不变的 Robinson 和 Foulds 距离矩阵变换用于卷积神经网络。
J Bioinform Comput Biol. 2022 Aug;20(4):2250012. doi: 10.1142/S0219720022500123. Epub 2022 Jul 6.
5
The Generalized Robinson-Foulds Distance for Phylogenetic Trees.系统发育树的广义 Robinson-Foulds 距离。
J Comput Biol. 2021 Dec;28(12):1181-1195. doi: 10.1089/cmb.2021.0342. Epub 2021 Oct 29.
6
The Connection of the Generalized Robinson-Foulds Metric with Partial Wiener Indices.广义 Robinson-Foulds 度量与部分 Wiener 指标的联系。
Acta Biotheor. 2023 Jan 25;71(1):5. doi: 10.1007/s10441-023-09457-7.
7
An exact algorithm for the geodesic distance between phylogenetic trees.一种用于计算系统发育树之间测地距离的精确算法。
J Comput Biol. 2008 Jul-Aug;15(6):577-91. doi: 10.1089/cmb.2008.0068.
8
Linear-time algorithms for phylogenetic tree completion under Robinson-Foulds distance.基于罗宾逊-福尔兹距离的系统发育树补全的线性时间算法。
Algorithms Mol Biol. 2020 Apr 13;15:6. doi: 10.1186/s13015-020-00166-1. eCollection 2020.
9
Efficiently computing the Robinson-Foulds metric.高效计算罗宾逊-福尔兹度量。
J Comput Biol. 2007 Jul-Aug;14(6):724-35. doi: 10.1089/cmb.2007.R012.
10
An efficient algorithm for approximating geodesic distances in tree space.一种用于逼近树空间测地距离的有效算法。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1196-207. doi: 10.1109/TCBB.2010.121.

引用本文的文献

1
The path-label reconciliation (PLR) dissimilarity measure for gene trees.用于基因树的路径标签协调(PLR)差异度量。
Algorithms Mol Biol. 2025 Aug 19;20(1):16. doi: 10.1186/s13015-025-00284-8.
2
Systematic analysis of some Astereae (Asteraceae) species by Integrating pollen morphology and molecular evidence.通过整合花粉形态学和分子证据对一些紫菀族(菊科)物种进行系统分析。
Front Plant Sci. 2025 Apr 8;16:1558995. doi: 10.3389/fpls.2025.1558995. eCollection 2025.
3
Oncotree2vec - a method for embedding and clustering of tumor mutation trees.

本文引用的文献

1
A Metric on Phylogenetic Tree Shapes.系统发育树形状的一种度量。
Syst Biol. 2018 Jan 1;67(1):113-126. doi: 10.1093/sysbio/syx046.
2
Standardized benchmarking in the quest for orthologs.寻找直系同源基因过程中的标准化基准测试。
Nat Methods. 2016 May;13(5):425-30. doi: 10.1038/nmeth.3830. Epub 2016 Apr 4.
3
Orthology and paralogy constraints: satisfiability and consistency.直系同源和旁系同源约束:可满足性和一致性。
Oncotree2vec - 一种用于肿瘤突变树嵌入和聚类的方法。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i180-i188. doi: 10.1093/bioinformatics/btae214.
4
AGO, a Framework for the Reconstruction of Ancestral Syntenies and Gene Orders.AGO,用于重建祖先同线性和基因顺序的框架。
Methods Mol Biol. 2024;2802:247-265. doi: 10.1007/978-1-0716-3838-5_10.
5
The -Robinson-Foulds Dissimilarity Measures for Comparison of Labeled Trees.用于比较带标签树的罗宾逊 - 福尔兹差异度量
J Comput Biol. 2024 Apr;31(4):328-344. doi: 10.1089/cmb.2023.0312. Epub 2024 Jan 25.
6
Mining for a New Class of Fungal Natural Products: The Evolution, Diversity, and Distribution of Isocyanide Synthase Biosynthetic Gene Clusters.挖掘新型真菌天然产物:异氰酸酯合酶生物合成基因簇的进化、多样性及分布
bioRxiv. 2023 Apr 18:2023.04.17.537281. doi: 10.1101/2023.04.17.537281.
7
The evolution of morphological development is congruent with the species phylogeny in the genus .形态发育的进化与该属中的物种系统发育一致。
Front Microbiol. 2023 Mar 29;14:1102250. doi: 10.3389/fmicb.2023.1102250. eCollection 2023.
8
Unsuspected Clonal Spread of Methicillin-Resistant Staphylococcus aureus Causing Bloodstream Infections in Hospitalized Adults Detected Using Whole Genome Sequencing.全基因组测序检测到耐甲氧西林金黄色葡萄球菌引起的住院成年患者血流感染的未被怀疑的克隆传播。
Clin Infect Dis. 2022 Dec 19;75(12):2104-2112. doi: 10.1093/cid/ciac339.
9
A Linear Time Solution to the Labeled Robinson-Foulds Distance Problem.线性时间解决带标签的罗宾逊-福尔德斯距离问题。
Syst Biol. 2022 Oct 12;71(6):1391-1403. doi: 10.1093/sysbio/syac028.
10
The Generalized Robinson-Foulds Distance for Phylogenetic Trees.系统发育树的广义 Robinson-Foulds 距离。
J Comput Biol. 2021 Dec;28(12):1181-1195. doi: 10.1089/cmb.2021.0342. Epub 2021 Oct 29.
BMC Genomics. 2014;15 Suppl 6(Suppl 6):S12. doi: 10.1186/1471-2164-15-S6-S12. Epub 2014 Oct 17.
4
TreeFam v9: a new website, more species and orthology-on-the-fly.TreeFam v9:一个新网站,更多的物种和即时同源性。
Nucleic Acids Res. 2014 Jan;42(Database issue):D922-5. doi: 10.1093/nar/gkt1055. Epub 2013 Nov 4.
5
Fast local search for unrooted Robinson-Foulds supertrees.无根 Robinson-Foulds 超级树的快速局部搜索。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1004-13. doi: 10.1109/TCBB.2012.47.
6
A metric for phylogenetic trees based on matching.基于匹配的系统发育树度量标准。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1014-22. doi: 10.1109/TCBB.2011.157.
7
Nodal distances for rooted phylogenetic trees.有根系统发育树的节点距离。
J Math Biol. 2010 Aug;61(2):253-276. doi: 10.1007/s00285-009-0295-2. Epub 2009 Sep 16.
8
Computing the distribution of a tree metric.计算树度量的分布。
IEEE/ACM Trans Comput Biol Bioinform. 2009 Jul-Sep;6(3):420-6. doi: 10.1109/TCBB.2009.32.
9
SPR distance computation for unrooted trees.无根树的 SPR 距离计算。
Evol Bioinform Online. 2008 Feb 9;4:17-27. doi: 10.4137/ebo.s419.
10
EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates.EnsemblCompara基因树:脊椎动物中完整的、可识别基因复制的系统发育树。
Genome Res. 2009 Feb;19(2):327-35. doi: 10.1101/gr.073585.107. Epub 2008 Nov 24.