Suppr超能文献

加权四重奏系统发育学

Weighted quartets phylogenetics.

作者信息

Avni Eliran, Cohen Reuven, Snir Sagi

机构信息

Department of Evolutionary Biology, University of Haifa, Haifa 31905, Israel; School of Engineering, Kinneret College, 15132, Israel; and Department of Evolutionary Biology, University of Haifa, Haifa 31905, Israel;

出版信息

Syst Biol. 2015 Mar;64(2):233-42. doi: 10.1093/sysbio/syu087. Epub 2014 Nov 19.

Abstract

Despite impressive technical and theoretical developments, reconstruction of phylogenetic trees for enormous quantities of molecular data is still a challenging task. A key tool in analyses of large data sets has been the construction of separate trees for subsets (e.g., quartets) of sequences, and subsequent combination of these subtrees into a single tree for the full set (i.e., supertree analysis). Unfortunately, even amalgamating quartets into a supertree remains a computationally daunting task. Assigning weights to quartets to indicate importance or reliability was proposed more than a decade ago, but handling weighted quartets is even more challenging and has scarcely been attempted in the past. In this work, we focus on weighted quartet-based approaches. We propose a scheme to assign weights to quartets coming from weighted trees and devise a tree similarity measure for weighted trees based on weighted quartets. We also extend the quartet MaxCut (QMC algorithm) to handle weighted quartets. We evaluate these tools on simulated and real data. Our simulated data analysis highlights the additional information that is conveyed when using the new weighted tree similarity measure, and shows that extending QMC to a weighted setting improves the quality of tree reconstruction. Our analyses of a cyanobacterial data set with weighted QMC reinforce previous results achieved with other tools.

摘要

尽管在技术和理论方面取得了令人瞩目的进展,但为大量分子数据重建系统发育树仍然是一项具有挑战性的任务。分析大型数据集的一个关键工具是为序列子集(例如四重奏)构建单独的树,然后将这些子树组合成完整集合的单个树(即超树分析)。不幸的是,即使将四重奏合并成超树仍然是一项计算量巨大的任务。十多年前就有人提出为四重奏分配权重以表明其重要性或可靠性,但处理加权四重奏更具挑战性,过去几乎没有人尝试过。在这项工作中,我们专注于基于加权四重奏的方法。我们提出了一种为来自加权树的四重奏分配权重的方案,并基于加权四重奏设计了一种加权树的树相似性度量。我们还扩展了四重奏最大割(QMC算法)以处理加权四重奏。我们在模拟数据和真实数据上评估了这些工具。我们的模拟数据分析突出了使用新的加权树相似性度量时所传达的额外信息,并表明将QMC扩展到加权设置可提高树重建的质量。我们使用加权QMC对蓝藻数据集的分析强化了使用其他工具先前获得的结果。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验