• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重新探讨 Fiduccia-Mattheyses 四重奏在更大的系统发育研究中的应用。

Quartet Fiduccia-Mattheyses revisited for larger phylogenetic studies.

机构信息

Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh.

出版信息

Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad332.

DOI:10.1093/bioinformatics/btad332
PMID:37285316
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10260390/
Abstract

MOTIVATION

With the recent breakthroughs in sequencing technology, phylogeny estimation at a larger scale has become a huge opportunity. For accurate estimation of large-scale phylogeny, substantial endeavor is being devoted in introducing new algorithms or upgrading current approaches. In this work, we endeavor to improve the Quartet Fiduccia and Mattheyses (QFM) algorithm to resolve phylogenetic trees of better quality with better running time. QFM was already being appreciated by researchers for its good tree quality, but fell short in larger phylogenomic studies due to its excessively slow running time.

RESULTS

We have re-designed QFM so that it can amalgamate millions of quartets over thousands of taxa into a species tree with a great level of accuracy within a short amount of time. Named "QFM Fast and Improved (QFM-FI)", our version is 20 000× faster than the previous version and 400× faster than the widely used variant of QFM implemented in PAUP* on larger datasets. We have also provided a theoretical analysis of the running time and memory requirements of QFM-FI. We have conducted a comparative study of QFM-FI with other state-of-the-art phylogeny reconstruction methods, such as QFM, QMC, wQMC, wQFM, and ASTRAL, on simulated as well as real biological datasets. Our results show that QFM-FI improves on the running time and tree quality of QFM and produces trees that are comparable with state-of-the-art methods.

AVAILABILITY AND IMPLEMENTATION

QFM-FI is open source and available at https://github.com/sharmin-mim/qfm_java.

摘要

动机

随着测序技术的最新突破,更大规模的系统发育估计成为了一个巨大的机会。为了准确估计大规模的系统发育,人们正在努力引入新的算法或升级现有的方法。在这项工作中,我们努力改进四分体 Fiduccia 和 Mattheyses(QFM)算法,以提高质量并缩短运行时间。QFM 因其良好的树质量而受到研究人员的赞赏,但由于运行时间过长,在更大的系统基因组学研究中表现不佳。

结果

我们重新设计了 QFM,以便能够在短时间内将数百万个四分体合并到数千个分类单元的物种树中,具有很高的准确性。我们的版本名为“QFM Fast and Improved (QFM-FI)”,比以前的版本快 20000 倍,比 PAUP*中广泛使用的 QFM 变体在更大的数据集上快 400 倍。我们还对 QFM-FI 的运行时间和内存需求进行了理论分析。我们对 QFM-FI 与其他最先进的系统发育重建方法,如 QFM、QMC、wQMC、wQFM 和 ASTRAL 进行了比较研究,包括模拟和真实生物数据集。我们的结果表明,QFM-FI 提高了 QFM 的运行时间和树质量,并产生了与最先进方法相当的树。

可用性和实现

QFM-FI 是开源的,可在 https://github.com/sharmin-mim/qfm_java 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6707/10260390/405e5e7cc287/btad332f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6707/10260390/643e36c79f19/btad332f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6707/10260390/b470f91d304e/btad332f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6707/10260390/4545f1511655/btad332f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6707/10260390/c19d239b7aa4/btad332f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6707/10260390/405e5e7cc287/btad332f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6707/10260390/643e36c79f19/btad332f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6707/10260390/b470f91d304e/btad332f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6707/10260390/4545f1511655/btad332f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6707/10260390/c19d239b7aa4/btad332f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6707/10260390/405e5e7cc287/btad332f5.jpg

相似文献

1
Quartet Fiduccia-Mattheyses revisited for larger phylogenetic studies.重新探讨 Fiduccia-Mattheyses 四重奏在更大的系统发育研究中的应用。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad332.
2
wQFM: highly accurate genome-scale species tree estimation from weighted quartets.wQFM:基于加权四重奏的高精度基因组规模物种树估计
Bioinformatics. 2021 Nov 5;37(21):3734-3743. doi: 10.1093/bioinformatics/btab428.
3
Improving quartet graph construction for scalable and accurate species tree estimation from gene trees.改进四重图构建,以实现从基因树到可扩展和准确的种系发生树估计。
Genome Res. 2023 Jul;33(7):1042-1052. doi: 10.1101/gr.277629.122. Epub 2023 May 17.
4
ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes.ASTRAL-II:基于合并的数百个分类群和数千个基因的种系发生树估计。
Bioinformatics. 2015 Jun 15;31(12):i44-52. doi: 10.1093/bioinformatics/btv234.
5
SVDquest: Improving SVDquartets species tree estimation using exact optimization within a constrained search space.SVDquest:在约束搜索空间内使用精确优化提高 SVDquartets 种系树估计。
Mol Phylogenet Evol. 2018 Jul;124:122-136. doi: 10.1016/j.ympev.2018.03.006. Epub 2018 Mar 9.
6
ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees.ASTRAL-III:从部分解析的基因树重建多项式时间种系发生树。
BMC Bioinformatics. 2018 May 8;19(Suppl 6):153. doi: 10.1186/s12859-018-2129-y.
7
The Performance of Two Supertree Schemes Compared Using Synthetic and Real Data Quartet Input.两种超级树构建方案在使用合成和真实数据四元组输入时的性能比较。
J Mol Evol. 2018 Feb;86(2):150-165. doi: 10.1007/s00239-018-9833-0. Epub 2018 Feb 19.
8
Weighted quartets phylogenetics.加权四重奏系统发育学
Syst Biol. 2015 Mar;64(2):233-42. doi: 10.1093/sysbio/syu087. Epub 2014 Nov 19.
9
QuCo: quartet-based co-estimation of species trees and gene trees.QuCo:基于四重奏的物种树和基因树的联合估计。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i413-i421. doi: 10.1093/bioinformatics/btac265.
10
TreeMerge: a new method for improving the scalability of species tree estimation methods.TreeMerge:一种提高物种树估计方法可扩展性的新方法。
Bioinformatics. 2019 Jul 15;35(14):i417-i426. doi: 10.1093/bioinformatics/btz344.

引用本文的文献

1
Leveraging Weighted Quartet Distributions for Enhanced Species Tree Inference from Genome-Wide Data.利用加权四重奏分布从全基因组数据中增强物种树推断
Genome Biol Evol. 2025 Sep 2;17(9). doi: 10.1093/gbe/evaf159.
2
wQFM-TREE: highly accurate and scalable quartet-based species tree inference from gene trees.wQFM-TREE:基于四重奏从基因树中进行高精度且可扩展的物种树推断。
Bioinform Adv. 2025 Mar 13;5(1):vbaf053. doi: 10.1093/bioadv/vbaf053. eCollection 2025.
3
wQFM-DISCO: DISCO-enabled wQFM improves phylogenomic analyses despite the presence of paralogs.

本文引用的文献

1
ASTRAL-Pro 2: ultrafast species tree reconstruction from multi-copy gene family trees.ASTRAL-Pro 2:从多拷贝基因家族树重建超快种系发生树。
Bioinformatics. 2022 Oct 31;38(21):4949-4950. doi: 10.1093/bioinformatics/btac620.
2
Quartet Based Gene Tree Imputation Using Deep Learning Improves Phylogenomic Analyses Despite Missing Data.基于四重奏的深度学习基因树推断在存在缺失数据的情况下仍能改进系统发育基因组分析。
J Comput Biol. 2022 Nov;29(11):1156-1172. doi: 10.1089/cmb.2022.0212. Epub 2022 Sep 1.
3
A New Pipeline for Removing Paralogs in Target Enrichment Data.
wQFM-DISCO:尽管存在旁系同源物,但启用DISCO的wQFM改善了系统发育基因组分析。
Bioinform Adv. 2024 Nov 27;4(1):vbae189. doi: 10.1093/bioadv/vbae189. eCollection 2024.
一种用于去除目标富集数据中旁系同源基因的新方法。
Syst Biol. 2022 Feb 10;71(2):410-425. doi: 10.1093/sysbio/syab044.
4
wQFM: highly accurate genome-scale species tree estimation from weighted quartets.wQFM:基于加权四重奏的高精度基因组规模物种树估计
Bioinformatics. 2021 Nov 5;37(21):3734-3743. doi: 10.1093/bioinformatics/btab428.
5
ASTRAL-Pro: Quartet-Based Species-Tree Inference despite Paralogy.ASTRAL-Pro:基于四重奏的系统发生树推断,即便存在基因重复。
Mol Biol Evol. 2020 Nov 1;37(11):3292-3307. doi: 10.1093/molbev/msaa139.
6
No evidence for distinct types in the evolution of SARS-CoV-2.没有证据表明新冠病毒(SARS-CoV-2)在进化过程中存在不同类型。
Virus Evol. 2020 May 14;6(1):veaa034. doi: 10.1093/ve/veaa034. eCollection 2020 Jan.
7
An Unbiased Molecular Approach Using 3'-UTRs Resolves the Avian Family-Level Tree of Life.一种基于 3'UTR 的无偏分子方法解决了鸟类科水平的生命之树。
Mol Biol Evol. 2021 Jan 4;38(1):108-127. doi: 10.1093/molbev/msaa191.
8
Using Constrained-INC for Large-Scale Gene Tree and Species Tree Estimation.使用约束增量法进行大规模基因树和物种树估计。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):2-15. doi: 10.1109/TCBB.2020.2990867. Epub 2021 Feb 3.
9
Quartet-based inference of cell differentiation trees from ChIP-Seq histone modification data.基于 quartet 的 ChIP-Seq 组蛋白修饰数据推断细胞分化树。
PLoS One. 2019 Sep 26;14(9):e0221270. doi: 10.1371/journal.pone.0221270. eCollection 2019.
10
Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge.使用NJMerge进行系统发育估计的统计上一致的分治管道。
Algorithms Mol Biol. 2019 Jul 19;14:14. doi: 10.1186/s13015-019-0151-x. eCollection 2019.