• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于四元组的系统发育分支长度估计

Phylogenomic branch length estimation using quartets.

机构信息

Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, United States.

Department of Integrative Biology, University of California at Berkeley, Berkeley, CA 94720, United States.

出版信息

Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i185-i193. doi: 10.1093/bioinformatics/btad221.

DOI:10.1093/bioinformatics/btad221
PMID:37387151
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10311336/
Abstract

MOTIVATION

Branch lengths and topology of a species tree are essential in most downstream analyses, including estimation of diversification dates, characterization of selection, understanding adaptation, and comparative genomics. Modern phylogenomic analyses often use methods that account for the heterogeneity of evolutionary histories across the genome due to processes such as incomplete lineage sorting. However, these methods typically do not generate branch lengths in units that are usable by downstream applications, forcing phylogenomic analyses to resort to alternative shortcuts such as estimating branch lengths by concatenating gene alignments into a supermatrix. Yet, concatenation and other available approaches for estimating branch lengths fail to address heterogeneity across the genome.

RESULTS

In this article, we derive expected values of gene tree branch lengths in substitution units under an extension of the multispecies coalescent (MSC) model that allows substitutions with varying rates across the species tree. We present CASTLES, a new technique for estimating branch lengths on the species tree from estimated gene trees that uses these expected values, and our study shows that CASTLES improves on the most accurate prior methods with respect to both speed and accuracy.

AVAILABILITY AND IMPLEMENTATION

CASTLES is available at https://github.com/ytabatabaee/CASTLES.

摘要

动机

种系树的分支长度和拓扑结构是大多数下游分析的基础,包括多样化日期的估计、选择特征的描述、适应性的理解和比较基因组学。由于不完全谱系分选等过程,现代系统基因组学分析通常使用考虑基因组中进化历史异质性的方法。然而,这些方法通常不会生成可用于下游应用的单位的分支长度,迫使系统基因组学分析采用替代方法,如通过将基因比对串联成超级矩阵来估计分支长度。然而,串联和其他可用的估计分支长度的方法无法解决基因组中的异质性。

结果

在本文中,我们推导出了在允许跨种系树的替换率变化的多物种合并(MSC)模型的扩展下,替换单位中基因树分支长度的期望值。我们提出了 CASTLES,这是一种从估计的基因树估计种系树上分支长度的新技术,它使用这些期望值,我们的研究表明,CASTLES 在速度和准确性方面都优于最准确的先验方法。

可用性和实现

CASTLES 可在 https://github.com/ytabatabaee/CASTLES 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6969/10311336/80e3d703fee5/btad221f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6969/10311336/c1da42b0d00c/btad221f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6969/10311336/f7659f173c79/btad221f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6969/10311336/bfe2de14d186/btad221f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6969/10311336/80e3d703fee5/btad221f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6969/10311336/c1da42b0d00c/btad221f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6969/10311336/f7659f173c79/btad221f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6969/10311336/bfe2de14d186/btad221f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6969/10311336/80e3d703fee5/btad221f4.jpg

相似文献

1
Phylogenomic branch length estimation using quartets.基于四元组的系统发育分支长度估计
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i185-i193. doi: 10.1093/bioinformatics/btad221.
2
Theoretical and Practical Considerations when using Retroelement Insertions to Estimate Species Trees in the Anomaly Zone.在异常区域使用逆转录元件插入来估计物种树时的理论与实践考量
Syst Biol. 2022 Apr 19;71(3):721-740. doi: 10.1093/sysbio/syab086.
3
Weighted Statistical Binning: Enabling Statistically Consistent Genome-Scale Phylogenetic Analyses.加权统计分箱法:实现统计上一致的全基因组系统发育分析
PLoS One. 2015 Jun 18;10(6):e0129183. doi: 10.1371/journal.pone.0129183. eCollection 2015.
4
To Include or Not to Include: The Impact of Gene Filtering on Species Tree Estimation Methods.包含还是不包含:基因过滤对物种树估计方法的影响。
Syst Biol. 2018 Mar 1;67(2):285-303. doi: 10.1093/sysbio/syx077.
5
Collapsing dubiously resolved gene-tree branches in phylogenomic coalescent analyses.系统发生合并分析中可疑解决的基因树分支的崩溃。
Mol Phylogenet Evol. 2021 May;158:107092. doi: 10.1016/j.ympev.2021.107092. Epub 2021 Feb 2.
6
Branch Length Transforms using Optimal Tree Metric Matching.使用最优树度量匹配的分支长度变换
bioRxiv. 2025 Mar 2:2023.11.13.566962. doi: 10.1101/2023.11.13.566962.
7
wQFM: highly accurate genome-scale species tree estimation from weighted quartets.wQFM:基于加权四重奏的高精度基因组规模物种树估计
Bioinformatics. 2021 Nov 5;37(21):3734-3743. doi: 10.1093/bioinformatics/btab428.
8
Comparing species tree estimation with large anchored phylogenomic and small Sanger-sequenced molecular datasets: an empirical study on Malagasy pseudoxyrhophiine snakes.比较大型锚定系统发育基因组学和小型桑格测序分子数据集的物种树估计:马达加斯加伪蝰蛇的实证研究
BMC Evol Biol. 2015 Oct 12;15:221. doi: 10.1186/s12862-015-0503-1.
9
The gene tree delusion.基因树错觉
Mol Phylogenet Evol. 2016 Jan;94(Pt A):1-33. doi: 10.1016/j.ympev.2015.07.018. Epub 2015 Jul 31.
10
Quantification of congruence among gene trees with polytomies using overall success of resolution for phylogenomic coalescent analyses.使用系统发育合并分析中整体分辨率成功度对具多叉树的基因树进行同形度量化。
Cladistics. 2023 Oct;39(5):418-436. doi: 10.1111/cla.12540. Epub 2023 Apr 25.

引用本文的文献

1
Shifts in bee diet breadths are associated with gene gains and losses and positive selection across olfactory receptors.蜜蜂饮食广度的变化与基因的得失以及嗅觉受体的正选择有关。
G3 (Bethesda). 2025 Aug 6;15(8). doi: 10.1093/g3journal/jkaf105.
2
Poplar: a phylogenomics pipeline.杨树:一种系统发育基因组学流程。
Bioinform Adv. 2025 May 6;5(1):vbaf104. doi: 10.1093/bioadv/vbaf104. eCollection 2025.
3
Extensive genome evolution distinguishes maize within a stable tribe of grasses.广泛的基因组进化使玉米在一个稳定的禾本科族中脱颖而出。

本文引用的文献

1
Weighting by Gene Tree Uncertainty Improves Accuracy of Quartet-based Species Trees.基于基因树不确定性的加权可提高基于四元组的种系发生树的准确性。
Mol Biol Evol. 2022 Dec 5;39(12). doi: 10.1093/molbev/msac215.
2
Quintet Rooting: rooting species trees under the multi-species coalescent model.五倍体起源:在多物种合并模型下对种系树进行起源分析。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i109-i117. doi: 10.1093/bioinformatics/btac224.
3
Quantifying Biodiversity: a Phylogenetic Perspective.量化生物多样性:系统发育视角
bioRxiv. 2025 Jan 24:2025.01.22.633974. doi: 10.1101/2025.01.22.633974.
4
Modeling Substitution Rate Evolution across Lineages and Relaxing the Molecular Clock.对谱系间替代率演变进行建模和放松分子钟假说。
Genome Biol Evol. 2024 Sep 3;16(9). doi: 10.1093/gbe/evae199.
5
Admixture in the fungal pathogen Blastomyces.真菌病原体芽生菌中的混合现象。
Genetics. 2024 Sep 24;228(4). doi: 10.1093/genetics/iyae155.
Conserv Biol. 2002 Feb;16(1):248-252. doi: 10.1046/j.1523-1739.2002.00503.x.
4
An estimate of the deepest branches of the tree of life from ancient vertically evolving genes.从古老的垂直进化基因估计生命之树的最深分支。
Elife. 2022 Feb 22;11:e66695. doi: 10.7554/eLife.66695.
5
Completing gene trees without species trees in sub-quadratic time.在亚二次时间内不依赖物种树完成基因树构建。
Bioinformatics. 2022 Mar 4;38(6):1532-1541. doi: 10.1093/bioinformatics/btab875.
6
DISCO: Species Tree Inference using Multicopy Gene Family Tree Decomposition.利用多拷贝基因家族树分解进行种系树推断。
Syst Biol. 2022 Apr 19;71(3):610-629. doi: 10.1093/sysbio/syab070.
7
Phylogenomics of 10,575 genomes reveals evolutionary proximity between domains Bacteria and Archaea.10575 个基因组的系统发生基因组学揭示了细菌域和古菌域之间的进化亲缘关系。
Nat Commun. 2019 Dec 2;10(1):5477. doi: 10.1038/s41467-019-13443-4.
8
Revisiting a Key Innovation in Evolutionary Biology: Felsenstein's "Phylogenies and the Comparative Method".重温进化生物学的一个关键创新:费雪斯坦的“系统发育与比较方法”。
Am Nat. 2019 Jun;193(6):755-772. doi: 10.1086/703055. Epub 2019 Apr 23.
9
ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees.ASTRAL-III:从部分解析的基因树重建多项式时间种系发生树。
BMC Bioinformatics. 2018 May 8;19(Suppl 6):153. doi: 10.1186/s12859-018-2129-y.
10
A Two-State Model of Tree Evolution and Its Applications to Alu Retrotransposition.树进化的两状态模型及其在 Alu 反转录转座中的应用。
Syst Biol. 2018 May 1;67(3):475-489. doi: 10.1093/sysbio/syx088.