• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

当数据在谱系间存在异质性时,为基于四重奏的方法设计权重。

Designing Weights for Quartet-Based Methods When Data are Heterogeneous Across Lineages.

机构信息

Institut de Matematiques de la UPC-BarcelonaTech (IMTech), Universitat Politècnica de Catalunya and Centre de Recerca Matemàtica, Av. Diagonal 647, 08028, Barcelona, Spain.

University of British Columbia, Vancouver, Canada.

出版信息

Bull Math Biol. 2023 Jun 13;85(7):68. doi: 10.1007/s11538-023-01167-y.

DOI:10.1007/s11538-023-01167-y
PMID:37310552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10264505/
Abstract

Homogeneity across lineages is a general assumption in phylogenetics according to which nucleotide substitution rates are common to all lineages. Many phylogenetic methods relax this hypothesis but keep a simple enough model to make the process of sequence evolution more tractable. On the other hand, dealing successfully with the general case (heterogeneity of rates across lineages) is one of the key features of phylogenetic reconstruction methods based on algebraic tools. The goal of this paper is twofold. First, we present a new weighting system for quartets (ASAQ) based on algebraic and semi-algebraic tools, thus especially indicated to deal with data evolving under heterogeneous rates. This method combines the weights of two previous methods by means of a test based on the positivity of the branch lengths estimated with the paralinear distance. ASAQ is statistically consistent when applied to data generated under the general Markov model, considers rate and base composition heterogeneity among lineages and does not assume stationarity nor time-reversibility. Second, we test and compare the performance of several quartet-based methods for phylogenetic tree reconstruction (namely QFM, wQFM, quartet puzzling, weight optimization and Willson's method) in combination with several systems of weights, including ASAQ weights and other weights based on algebraic and semi-algebraic methods or on the paralinear distance. These tests are applied to both simulated and real data and support weight optimization with ASAQ weights as a reliable and successful reconstruction method that improves upon the accuracy of global methods (such as neighbor-joining or maximum likelihood) in the presence of long branches or on mixtures of distributions on trees.

摘要

系统发育学中的一个普遍假设是,所有谱系的核苷酸替换率都是相同的。许多系统发育方法放松了这一假设,但保持了足够简单的模型,以使序列进化过程更易于处理。另一方面,成功处理一般情况(谱系间的速率异质性)是基于代数工具的系统发育重建方法的关键特征之一。本文的目的有两个。首先,我们提出了一种新的基于代数和半代数工具的四分体加权系统(ASAQ),因此特别适合处理在异速率下进化的数据。该方法通过基于平行距离估计的分支长度的正性测试,将两种先前方法的权重结合起来。ASAQ 在应用于一般马尔可夫模型生成的数据时具有统计一致性,考虑了谱系之间的速率和碱基组成异质性,并且不假设平稳性或时间可逆性。其次,我们测试并比较了几种基于四分体的系统发育树重建方法(即 QFM、wQFM、四分体拼图、权重优化和 Willson 方法)与几种权重系统(包括 ASAQ 权重和基于代数和半代数方法或平行距离的其他权重)的性能。这些测试应用于模拟和真实数据,并支持使用 ASAQ 权重进行权重优化,这是一种可靠且成功的重建方法,在存在长分支或在树上的分布混合时,可提高全局方法(如邻接法或最大似然法)的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/7fd5eaa5c790/11538_2023_1167_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/70a12bed1373/11538_2023_1167_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/a2c10ef9812a/11538_2023_1167_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/113530077942/11538_2023_1167_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/d3694c991272/11538_2023_1167_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/212c3fc018ce/11538_2023_1167_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/4fc192b3293d/11538_2023_1167_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/ef12eb1e095d/11538_2023_1167_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/479ff507e2ae/11538_2023_1167_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/4daf02e92531/11538_2023_1167_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/89ebb428b3cc/11538_2023_1167_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/901f9f2b357b/11538_2023_1167_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/38852c00e35d/11538_2023_1167_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/b1e6f3136b0a/11538_2023_1167_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/c50736b643ea/11538_2023_1167_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/30112802427b/11538_2023_1167_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/83a77a6d19fe/11538_2023_1167_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/7fd5eaa5c790/11538_2023_1167_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/70a12bed1373/11538_2023_1167_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/a2c10ef9812a/11538_2023_1167_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/113530077942/11538_2023_1167_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/d3694c991272/11538_2023_1167_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/212c3fc018ce/11538_2023_1167_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/4fc192b3293d/11538_2023_1167_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/ef12eb1e095d/11538_2023_1167_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/479ff507e2ae/11538_2023_1167_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/4daf02e92531/11538_2023_1167_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/89ebb428b3cc/11538_2023_1167_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/901f9f2b357b/11538_2023_1167_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/38852c00e35d/11538_2023_1167_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/b1e6f3136b0a/11538_2023_1167_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/c50736b643ea/11538_2023_1167_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/30112802427b/11538_2023_1167_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/83a77a6d19fe/11538_2023_1167_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1f/10264505/7fd5eaa5c790/11538_2023_1167_Fig17_HTML.jpg

相似文献

1
Designing Weights for Quartet-Based Methods When Data are Heterogeneous Across Lineages.当数据在谱系间存在异质性时,为基于四重奏的方法设计权重。
Bull Math Biol. 2023 Jun 13;85(7):68. doi: 10.1007/s11538-023-01167-y.
2
Invariant Versus Classical Quartet Inference When Evolution is Heterogeneous Across Sites and Lineages.当进化在不同位点和谱系间存在异质性时,不变量与经典四重奏推断法
Syst Biol. 2016 Mar;65(2):280-91. doi: 10.1093/sysbio/syv086. Epub 2015 Nov 11.
3
SAQ: Semi-Algebraic Quartet Reconstruction.问题集:半代数四重态重建。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2855-2861. doi: 10.1109/TCBB.2021.3101278. Epub 2021 Dec 8.
4
EM for phylogenetic topology reconstruction on nonhomogeneous data.EM 算法在非同源数据的系统发育拓扑重建中的应用。
BMC Evol Biol. 2014 Jun 17;14:132. doi: 10.1186/1471-2148-14-132.
5
Weighted quartets phylogenetics.加权四重奏系统发育学
Syst Biol. 2015 Mar;64(2):233-42. doi: 10.1093/sysbio/syu087. Epub 2014 Nov 19.
6
wQFM: highly accurate genome-scale species tree estimation from weighted quartets.wQFM:基于加权四重奏的高精度基因组规模物种树估计
Bioinformatics. 2021 Nov 5;37(21):3734-3743. doi: 10.1093/bioinformatics/btab428.
7
Quartet-based phylogenetic inference: improvements and limits.基于四重奏的系统发育推断:改进与局限
Mol Biol Evol. 2001 Jun;18(6):1103-16. doi: 10.1093/oxfordjournals.molbev.a003881.
8
Heterogeneity of nucleotide frequencies among evolutionary lineages and phylogenetic inference.进化谱系间核苷酸频率的异质性与系统发育推断。
Mol Biol Evol. 2003 Apr;20(4):610-21. doi: 10.1093/molbev/msg067. Epub 2003 Apr 2.
9
Short quartet puzzling: a new quartet-based phylogeny reconstruction algorithm.短四重奏难题:一种基于四重奏的新系统发育重建算法。
J Comput Biol. 2008 Jan-Feb;15(1):91-103. doi: 10.1089/cmb.2007.0103.
10
Genetic distance for a general non-stationary markov substitution process.一般非平稳马尔可夫替换过程的遗传距离。
Syst Biol. 2015 Mar;64(2):281-93. doi: 10.1093/sysbio/syu106. Epub 2014 Dec 9.

引用本文的文献

1
Leveraging Weighted Quartet Distributions for Enhanced Species Tree Inference from Genome-Wide Data.利用加权四重奏分布从全基因组数据中增强物种树推断
Genome Biol Evol. 2025 Sep 2;17(9). doi: 10.1093/gbe/evaf159.
2
Ultrafast learning of four-node hybridization cycles in phylogenetic networks using algebraic invariants.利用代数不变量在系统发育网络中对四节点杂交循环进行超快学习。
Bioinform Adv. 2024 Feb 8;4(1):vbae014. doi: 10.1093/bioadv/vbae014. eCollection 2024.