• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 GTM 的非混合不相交树合并可提高物种树估计的准确性。

Unblended disjoint tree merging using GTM improves species tree estimation.

机构信息

Department of Computer Science, University of Illinois at Urbana-Champaign, 201 N Goodwin Ave, Urbana, 61801, IL, US.

出版信息

BMC Genomics. 2020 Apr 16;21(Suppl 2):235. doi: 10.1186/s12864-020-6605-1.

DOI:10.1186/s12864-020-6605-1
PMID:32299343
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7161100/
Abstract

BACKGROUND

Phylogeny estimation is an important part of much biological research, but large-scale tree estimation is infeasible using standard methods due to computational issues. Recently, an approach to large-scale phylogeny has been proposed that divides a set of species into disjoint subsets, computes trees on the subsets, and then merges the trees together using a computed matrix of pairwise distances between the species. The novel component of these approaches is the last step: Disjoint Tree Merger (DTM) methods.

RESULTS

We present GTM (Guide Tree Merger), a polynomial time DTM method that adds edges to connect the subset trees, so as to provably minimize the topological distance to a computed guide tree. Thus, GTM performs unblended mergers, unlike the previous DTM methods. Yet, despite the potential limitation, our study shows that GTM has excellent accuracy, generally matching or improving on two previous DTMs, and is much faster than both.

CONCLUSIONS

The proposed GTM approach to the DTM problem is a useful new tool for large-scale phylogenomic analysis, and shows the surprising potential for unblended DTM methods.

摘要

背景

系统发育估计是许多生物学研究的重要组成部分,但由于计算问题,使用标准方法进行大规模树估计是不可行的。最近,提出了一种大规模系统发育的方法,该方法将一组物种划分为不相交的子集,在子集中计算树,然后使用物种之间计算的成对距离矩阵将树合并在一起。这些方法的新颖之处在于最后一步:不相交树合并(DTM)方法。

结果

我们提出了 GTM(引导树合并),这是一种多项式时间的 DTM 方法,它添加边来连接子集树,从而可证明地最小化到计算出的引导树的拓扑距离。因此,GTM 执行未混合的合并,与以前的 DTM 方法不同。然而,尽管存在潜在的局限性,但我们的研究表明,GTM 具有出色的准确性,通常与两个以前的 DTM 匹配或改进,并且比两者都快得多。

结论

提出的 DTM 问题的 GTM 方法是大规模基因组分析的有用新工具,并显示出未混合 DTM 方法的惊人潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/aae36a34b3b5/12864_2020_6605_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/4103c529da95/12864_2020_6605_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/1bff4b3af4ca/12864_2020_6605_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/2ddfc04ea23a/12864_2020_6605_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/1a460222a983/12864_2020_6605_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/17890c95f911/12864_2020_6605_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/7c99f927a176/12864_2020_6605_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/51e2feef6bcc/12864_2020_6605_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/35e79378b249/12864_2020_6605_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/aae36a34b3b5/12864_2020_6605_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/4103c529da95/12864_2020_6605_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/1bff4b3af4ca/12864_2020_6605_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/2ddfc04ea23a/12864_2020_6605_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/1a460222a983/12864_2020_6605_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/17890c95f911/12864_2020_6605_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/7c99f927a176/12864_2020_6605_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/51e2feef6bcc/12864_2020_6605_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/35e79378b249/12864_2020_6605_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7c/7161100/aae36a34b3b5/12864_2020_6605_Fig9_HTML.jpg

相似文献

1
Unblended disjoint tree merging using GTM improves species tree estimation.使用 GTM 的非混合不相交树合并可提高物种树估计的准确性。
BMC Genomics. 2020 Apr 16;21(Suppl 2):235. doi: 10.1186/s12864-020-6605-1.
2
Using Constrained-INC for Large-Scale Gene Tree and Species Tree Estimation.使用约束增量法进行大规模基因树和物种树估计。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):2-15. doi: 10.1109/TCBB.2020.2990867. Epub 2021 Feb 3.
3
Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge.使用NJMerge进行系统发育估计的统计上一致的分治管道。
Algorithms Mol Biol. 2019 Jul 19;14:14. doi: 10.1186/s13015-019-0151-x. eCollection 2019.
4
SVDquest: Improving SVDquartets species tree estimation using exact optimization within a constrained search space.SVDquest:在约束搜索空间内使用精确优化提高 SVDquartets 种系树估计。
Mol Phylogenet Evol. 2018 Jul;124:122-136. doi: 10.1016/j.ympev.2018.03.006. Epub 2018 Mar 9.
5
Fragmentary Gene Sequences Negatively Impact Gene Tree and Species Tree Reconstruction.片段基因序列对基因树和种系发生树的重建有负面影响。
Mol Biol Evol. 2017 Dec 1;34(12):3279-3291. doi: 10.1093/molbev/msx261.
6
BBCA: Improving the scalability of *BEAST using random binning.BBCA:使用随机装箱提高BEAST的可扩展性。
BMC Genomics. 2014;15 Suppl 6(Suppl 6):S11. doi: 10.1186/1471-2164-15-S6-S11. Epub 2014 Oct 17.
7
STELAR: a statistically consistent coalescent-based species tree estimation method by maximizing triplet consistency.STELAR:一种基于最大三重一致性的统计一致的合并物种树估计方法。
BMC Genomics. 2020 Feb 10;21(1):136. doi: 10.1186/s12864-020-6519-y.
8
SIESTA: enhancing searches for optimal supertrees and species trees.SIESTA:增强最优超级树和物种树搜索。
BMC Genomics. 2018 May 8;19(Suppl 5):252. doi: 10.1186/s12864-018-4621-1.
9
To Include or Not to Include: The Impact of Gene Filtering on Species Tree Estimation Methods.包含还是不包含:基因过滤对物种树估计方法的影响。
Syst Biol. 2018 Mar 1;67(2):285-303. doi: 10.1093/sysbio/syx077.
10
Forcing external constraints on tree inference using ASTRAL.使用 ASTRAL 强制对树推断施加外部约束。
BMC Genomics. 2020 Apr 16;21(Suppl 2):218. doi: 10.1186/s12864-020-6607-z.

引用本文的文献

1
Sparse Neighbor Joining: rapid phylogenetic inference using a sparse distance matrix.稀疏邻接法:使用稀疏距离矩阵进行快速系统发育推断。
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae701.
2
Efficient phylogenetic tree inference for massive taxonomic datasets: harnessing the power of a server to analyze 1 million taxa.针对海量分类数据集的高效系统发育树推断:利用服务器的能力分析100万个分类单元。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae055.
3
Recent progress on methods for estimating and updating large phylogenies.

本文引用的文献

1
TreeMerge: a new method for improving the scalability of species tree estimation methods.TreeMerge:一种提高物种树估计方法可扩展性的新方法。
Bioinformatics. 2019 Jul 15;35(14):i417-i426. doi: 10.1093/bioinformatics/btz344.
2
Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge.使用NJMerge进行系统发育估计的统计上一致的分治管道。
Algorithms Mol Biol. 2019 Jul 19;14:14. doi: 10.1186/s13015-019-0151-x. eCollection 2019.
3
Benchmarking of alignment-free sequence comparison methods.无比对信息的序列比较方法的基准测试。
关于估计和更新大型系统发育树的方法的最新进展。
Philos Trans R Soc Lond B Biol Sci. 2022 Oct 10;377(1861):20210244. doi: 10.1098/rstb.2021.0244. Epub 2022 Aug 22.
4
Phylogeny Estimation Given Sequence Length Heterogeneity.给定序列长度异质性的系统发育估计。
Syst Biol. 2021 Feb 10;70(2):268-282. doi: 10.1093/sysbio/syaa058.
Genome Biol. 2019 Jul 25;20(1):144. doi: 10.1186/s13059-019-1755-7.
4
Constrained incremental tree building: new absolute fast converging phylogeny estimation methods with improved scalability and accuracy.约束增量树构建:具有改进的可扩展性和准确性的新型绝对快速收敛系统发育估计方法。
Algorithms Mol Biol. 2019 Feb 6;14:2. doi: 10.1186/s13015-019-0136-9. eCollection 2019.
5
Long-Branch Attraction in Species Tree Estimation: Inconsistency of Partitioned Likelihood and Topology-Based Summary Methods.种系树估计中的长枝吸引:分区似然和基于拓扑的总结方法的不一致性。
Syst Biol. 2019 Mar 1;68(2):281-297. doi: 10.1093/sysbio/syy061.
6
ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees.ASTRAL-III:从部分解析的基因树重建多项式时间种系发生树。
BMC Bioinformatics. 2018 May 8;19(Suppl 6):153. doi: 10.1186/s12859-018-2129-y.
7
Fragmentary Gene Sequences Negatively Impact Gene Tree and Species Tree Reconstruction.片段基因序列对基因树和种系发生树的重建有负面影响。
Mol Biol Evol. 2017 Dec 1;34(12):3279-3291. doi: 10.1093/molbev/msx261.
8
A greedy alignment-free distance estimator for phylogenetic inference.一种用于系统发育推断的贪婪无比对距离估计器。
BMC Bioinformatics. 2017 Jun 7;18(Suppl 8):238. doi: 10.1186/s12859-017-1658-0.
9
ASTRID: Accurate Species TRees from Internode Distances.ASTRID:基于节间距离的精确物种树
BMC Genomics. 2015;16 Suppl 10(Suppl 10):S3. doi: 10.1186/1471-2164-16-S10-S3. Epub 2015 Oct 2.
10
Ultra-large alignments using phylogeny-aware profiles.使用系统发育感知概况的超大比对。
Genome Biol. 2015 Jun 16;16(1):124. doi: 10.1186/s13059-015-0688-z.