• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用ROADIES从原始基因组组装中准确、可扩展且完全自动化地推断物种树。

Accurate, scalable, and fully automated inference of species trees from raw genome assemblies using ROADIES.

作者信息

Gupta Anshu, Mirarab Siavash, Turakhia Yatish

机构信息

Department of Computer Science and Engineering, University of California, San Diego, CA 92093.

Department of Electrical and Computer Engineering, University of California, San Diego, CA 92093.

出版信息

Proc Natl Acad Sci U S A. 2025 May 13;122(19):e2500553122. doi: 10.1073/pnas.2500553122. Epub 2025 May 2.

DOI:10.1073/pnas.2500553122
PMID:40314967
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12088440/
Abstract

Current genome sequencing initiatives across a wide range of life forms offer significant potential to enhance our understanding of evolutionary relationships and support transformative biological and medical applications. Species trees play a central role in many of these applications; however, despite the widespread availability of genome assemblies, accurate inference of species trees remains challenging due to the limited automation, substantial domain expertise, and computational resources required by conventional methods. To address this limitation, we present ROADIES, a fully automated pipeline to infer species trees starting from raw genome assemblies. In contrast to the prominent approach, ROADIES incorporates a unique strategy of randomly sampling segments of the input genomes to generate gene trees. This eliminates the need for predefining a set of loci, limiting the analyses to a fixed number of genes, and performing the cumbersome gene annotation and/or whole genome alignment steps. ROADIES also eliminates the need to infer orthology by leveraging existing discordance-aware methods that allow multicopy genes. Using the genomic datasets from large-scale sequencing efforts across four diverse life forms (placental mammals, pomace flies, birds, and budding yeasts), we show that ROADIES infers species trees that are comparable in quality to the state-of-the-art studies but in a fraction of the time and effort, including on challenging datasets with rampant gene tree discordance and complex polyploidy. With its speed, accuracy, and automation, ROADIES has the potential to vastly simplify species tree inference, making it accessible to a broader range of scientists and applications.

摘要

当前针对广泛生命形式开展的基因组测序计划,为增进我们对进化关系的理解以及支持变革性的生物学和医学应用提供了巨大潜力。物种树在许多此类应用中发挥着核心作用;然而,尽管基因组组装已广泛可得,但由于传统方法所需的自动化程度有限、大量领域专业知识以及计算资源,准确推断物种树仍然具有挑战性。为解决这一限制,我们提出了ROADIES,这是一种从原始基因组组装开始推断物种树的全自动流程。与突出的方法不同,ROADIES采用了一种独特策略,即对输入基因组的片段进行随机抽样以生成基因树。这消除了预定义一组基因座的需要,将分析限制在固定数量的基因上,以及执行繁琐的基因注释和/或全基因组比对步骤。ROADIES还通过利用允许多拷贝基因的现有不一致感知方法,消除了推断直系同源性的需要。使用来自四种不同生命形式(胎盘哺乳动物、果蝇、鸟类和芽殖酵母)的大规模测序工作的基因组数据集,我们表明ROADIES推断的物种树在质量上与最先进的研究相当,但所需时间和精力仅为其一小部分,包括在具有大量基因树不一致和复杂多倍体的具有挑战性的数据集上。凭借其速度、准确性和自动化,ROADIES有潜力极大地简化物种树推断,使其能够被更广泛的科学家和应用所使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74d3/12088440/611fd516231d/pnas.2500553122fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74d3/12088440/a05c5fee1579/pnas.2500553122fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74d3/12088440/518a931b1147/pnas.2500553122fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74d3/12088440/611fd516231d/pnas.2500553122fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74d3/12088440/a05c5fee1579/pnas.2500553122fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74d3/12088440/518a931b1147/pnas.2500553122fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74d3/12088440/611fd516231d/pnas.2500553122fig03.jpg

相似文献

1
Accurate, scalable, and fully automated inference of species trees from raw genome assemblies using ROADIES.使用ROADIES从原始基因组组装中准确、可扩展且完全自动化地推断物种树。
Proc Natl Acad Sci U S A. 2025 May 13;122(19):e2500553122. doi: 10.1073/pnas.2500553122. Epub 2025 May 2.
2
Accurate, scalable, and fully automated inference of species trees from raw genome assemblies using ROADIES.使用ROADIES从原始基因组组装中准确、可扩展且完全自动化地推断物种树。
bioRxiv. 2024 Jun 1:2024.05.27.596098. doi: 10.1101/2024.05.27.596098.
3
CASTER: Direct species tree inference from whole-genome alignments.卡斯特:从全基因组比对中直接推断物种树。
Science. 2025 Feb 28;387(6737):eadk9688. doi: 10.1126/science.adk9688.
4
TREEasy: An automated workflow to infer gene trees, species trees, and phylogenetic networks from multilocus data.TREEasy:一种从多位点数据推断基因树、物种树和系统发育网络的自动化工作流程。
Mol Ecol Resour. 2020 May;20(3). doi: 10.1111/1755-0998.13149. Epub 2020 Mar 24.
5
Integrating gene annotation with orthology inference at scale.大规模整合基因注释与直系同源推断。
Science. 2023 Apr 28;380(6643):eabn3107. doi: 10.1126/science.abn3107.
6
Comparing species tree estimation with large anchored phylogenomic and small Sanger-sequenced molecular datasets: an empirical study on Malagasy pseudoxyrhophiine snakes.比较大型锚定系统发育基因组学和小型桑格测序分子数据集的物种树估计:马达加斯加伪蝰蛇的实证研究
BMC Evol Biol. 2015 Oct 12;15:221. doi: 10.1186/s12862-015-0503-1.
7
OrthoPhyl-streamlining large-scale, orthology-based phylogenomic studies of bacteria at broad evolutionary scales.OrthoPhyl——简化基于同源性的大规模细菌系统发育基因组学研究,以广泛的进化尺度为目标。
G3 (Bethesda). 2024 Aug 7;14(8). doi: 10.1093/g3journal/jkae119.
8
The Genomic Landscape, Causes, and Consequences of Extensive Phylogenomic Discordance in Murine Rodents.小鼠啮齿动物中广泛的系统发育基因组不一致的基因组格局、原因及后果
Genome Biol Evol. 2025 Feb 3;17(2). doi: 10.1093/gbe/evaf017.
9
Generation of accurate, expandable phylogenomic trees with uDance.使用 uDance 生成准确、可扩展的系统发育基因组树。
Nat Biotechnol. 2024 May;42(5):768-777. doi: 10.1038/s41587-023-01868-8. Epub 2023 Jul 27.
10
Maximum likelihood estimates of species trees: how accuracy of phylogenetic inference depends upon the divergence history and sampling design.最大似然估计物种树:系统发育推断的准确性如何取决于分歧历史和采样设计。
Syst Biol. 2009 Oct;58(5):501-8. doi: 10.1093/sysbio/syp045. Epub 2009 Aug 20.

引用本文的文献

1
Poplar: a phylogenomics pipeline.杨树:一种系统发育基因组学流程。
Bioinform Adv. 2025 May 6;5(1):vbaf104. doi: 10.1093/bioadv/vbaf104. eCollection 2025.

本文引用的文献

1
CASTER: Direct species tree inference from whole-genome alignments.卡斯特:从全基因组比对中直接推断物种树。
Science. 2025 Feb 28;387(6737):eadk9688. doi: 10.1126/science.adk9688.
2
Whole genome phylogenomics helps to resolve the phylogenetic position of the Zygothrica genus group (Diptera, Drosophilidae) and the causes of previous incongruences.全基因组系统发育基因组学有助于解决 Zygothrica 属群(双翅目,果蝇科)的系统发育位置以及先前不一致的原因。
Mol Phylogenet Evol. 2024 Oct;199:108158. doi: 10.1016/j.ympev.2024.108158. Epub 2024 Jul 17.
3
Phylogenomics corroborates morphology: New discussions on the systematics of Trichostomatia (Ciliophora, Litostomatea).
系统发生基因组学证实了形态学:关于 Trichostomatia(纤毛门、Litostomatea)系统分类学的新讨论。
Eur J Protistol. 2024 Aug;95:126093. doi: 10.1016/j.ejop.2024.126093. Epub 2024 May 24.
4
A genome-informed higher rank classification of the biotechnologically important fungal subphylum .基于基因组信息的生物技术重要真菌亚门的高级分类
Stud Mycol. 2023 Jun;105:1-22. doi: 10.3114/sim.2023.105.01. Epub 2023 May 25.
5
Interactive Tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool.交互式生命树 (iTOL) v6:系统发育树显示和注释工具的最新更新。
Nucleic Acids Res. 2024 Jul 5;52(W1):W78-W82. doi: 10.1093/nar/gkae268.
6
Complexity of avian evolution revealed by family-level genomes.鸟类进化的复杂性由家族水平基因组揭示。
Nature. 2024 May;629(8013):851-860. doi: 10.1038/s41586-024-07323-1. Epub 2024 Apr 1.
7
A region of suppressed recombination misleads neoavian phylogenomics.一个重组抑制区域误导了新鸟类系统发生基因组学。
Proc Natl Acad Sci U S A. 2024 Apr 9;121(15):e2319506121. doi: 10.1073/pnas.2319506121. Epub 2024 Apr 1.
8
Genome assemblies of 11 bamboo species highlight diversification induced by dynamic subgenome dominance.11 种竹子的基因组组装凸显了由动态亚基因组优势引起的多样化。
Nat Genet. 2024 Apr;56(4):710-720. doi: 10.1038/s41588-024-01683-0. Epub 2024 Mar 15.
9
Identification of constrained sequence elements across 239 primate genomes.在239个灵长类基因组中鉴定受限序列元件
Nature. 2024 Jan;625(7996):735-742. doi: 10.1038/s41586-023-06798-8. Epub 2023 Nov 29.
10
OMA orthology in 2024: improved prokaryote coverage, ancestral and extant GO enrichment, a revamped synteny viewer and more in the OMA Ecosystem.2024 年的 OMA 同源物:改进的原核生物覆盖范围、祖先和现存 GO 富集、重新设计的同线性视图以及更多的 OMA 生态系统。
Nucleic Acids Res. 2024 Jan 5;52(D1):D513-D521. doi: 10.1093/nar/gkad1020.