• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

WASTER:基于低覆盖度短读长的实用系统发育基因组学

WASTER: Practical phylogenomics from low-coverage short reads.

作者信息

Zhang Chao, Nielsen Rasmus

机构信息

Globe Institute, University of Copenhagen, Øster Voldgade 5-7, Copenhagen, 1350, Denmark.

Department of Integrative Biology and Department of Statistics, University of California Berkeley, 110 Sproul Hall, Berkeley, 94704, CA, USA.

出版信息

bioRxiv. 2025 Jan 24:2025.01.20.633983. doi: 10.1101/2025.01.20.633983.

DOI:10.1101/2025.01.20.633983
PMID:39896589
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11785061/
Abstract

The advent of affordable whole-genome sequencing has spurred numerous large-scale projects aimed at inferring the tree of life, yet achieving a complete species-level phylogeny remains a distant goal due to significant costs and computational demands. Traditional species tree inference methods, though effective, are hampered by the need for high-coverage sequencing, high-quality genomic alignments, and extensive computational resources. To address these challenges, this study introduces WASTER, a novel tool for inferring species trees directly from short-read sequences. WASTER employs a k-mer based approach for identifying variable sites, circumventing the need for genome assembly and alignment. Using simulations, we demonstrate that WASTER achieves accuracy comparable to that of traditional alignment-based methods, even for low sequencing depth, and has substantially higher accuracy than other alignment-free methods. We validate WASTER's efficacy on real data, where it accurately reconstructs phylogenies of eukaryotic species with as low depth as 1.5X. WASTER provides a fast and efficient solution for phylogeny estimation in cases where genome assembly and/or alignment may bias analyses or is challenging, for example due to low sequencing depth. It also provides a method for generating guide trees for tree-based alignment algorithms. WASTER's ability to accurately estimate trees from low-coverage sequencing data without relying on assembly and alignment will lead to substantially reduced sequencing and computational costs in phylogenomic projects.

摘要

价格亲民的全基因组测序技术的出现推动了众多旨在推断生命树的大型项目,但由于成本高昂和计算需求巨大,要实现完整的物种水平系统发育仍然是一个遥不可及的目标。传统的物种树推断方法虽然有效,但因需要高覆盖率测序、高质量的基因组比对以及大量计算资源而受到限制。为应对这些挑战,本研究引入了WASTER,这是一种直接从短读序列推断物种树的新型工具。WASTER采用基于k-mer的方法来识别可变位点,无需进行基因组组装和比对。通过模拟,我们证明即使在低测序深度的情况下,WASTER的准确性也与传统的基于比对的方法相当,并且比其他无比对方法的准确性高得多。我们在真实数据上验证了WASTER的有效性,它能够准确重建深度低至1.5X的真核生物物种的系统发育。在基因组组装和/或比对可能会使分析产生偏差或具有挑战性的情况下,例如由于测序深度低,WASTER为系统发育估计提供了一种快速有效的解决方案。它还提供了一种为基于树的比对算法生成引导树的方法。WASTER能够在不依赖组装和比对的情况下从低覆盖率测序数据中准确估计树,这将大幅降低系统发育基因组学项目中的测序和计算成本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d22d/12233391/fbadb57d8103/nihpp-2025.01.20.633983v2-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d22d/12233391/0d6f833f69ae/nihpp-2025.01.20.633983v2-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d22d/12233391/19984c93e295/nihpp-2025.01.20.633983v2-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d22d/12233391/457c0bcb2675/nihpp-2025.01.20.633983v2-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d22d/12233391/dfb7ff674134/nihpp-2025.01.20.633983v2-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d22d/12233391/fbadb57d8103/nihpp-2025.01.20.633983v2-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d22d/12233391/0d6f833f69ae/nihpp-2025.01.20.633983v2-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d22d/12233391/19984c93e295/nihpp-2025.01.20.633983v2-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d22d/12233391/457c0bcb2675/nihpp-2025.01.20.633983v2-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d22d/12233391/dfb7ff674134/nihpp-2025.01.20.633983v2-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d22d/12233391/fbadb57d8103/nihpp-2025.01.20.633983v2-f0013.jpg

相似文献

1
WASTER: Practical phylogenomics from low-coverage short reads.WASTER:基于低覆盖度短读长的实用系统发育基因组学
bioRxiv. 2025 Jan 24:2025.01.20.633983. doi: 10.1101/2025.01.20.633983.
2
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
3
Systemic Inflammatory Response Syndrome全身炎症反应综合征
4
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
5
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
7
Conservative, physical and surgical interventions for managing faecal incontinence and constipation in adults with central neurological diseases.保守治疗、物理治疗和手术干预用于治疗伴有中枢神经系统疾病的成年人的粪便失禁和便秘。
Cochrane Database Syst Rev. 2024 Oct 29;10(10):CD002115. doi: 10.1002/14651858.CD002115.pub6.
8
Interventions to reduce harm from continued tobacco use.减少持续吸烟危害的干预措施。
Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
10
Withdrawal or continuation of cholinesterase inhibitors or memantine or both, in people with dementia.在痴呆症患者中,停用或继续使用胆碱酯酶抑制剂、美金刚或两者。
Cochrane Database Syst Rev. 2021 Feb 3;2(2):CD009081. doi: 10.1002/14651858.CD009081.pub2.

本文引用的文献

1
CASTER: Direct species tree inference from whole-genome alignments.卡斯特:从全基因组比对中直接推断物种树。
Science. 2025 Feb 28;387(6737):eadk9688. doi: 10.1126/science.adk9688.
2
Phylogenetic and taxonomic updates of , with an emphasis on .关于……的系统发育和分类学更新,重点在于……
Mycology. 2023 Nov 7;15(2):180-209. doi: 10.1080/21501203.2023.2263031. eCollection 2024.
3
Complexity of avian evolution revealed by family-level genomes.鸟类进化的复杂性由家族水平基因组揭示。
Nature. 2024 May;629(8013):851-860. doi: 10.1038/s41586-024-07323-1. Epub 2024 Apr 1.
4
Phylogenomic analyses provide insights into primate evolution.系统发生基因组分析为灵长类动物的进化提供了新视角。
Science. 2023 Jun 2;380(6648):913-924. doi: 10.1126/science.abn6919. Epub 2023 Jun 1.
5
A genomic timescale for placental mammal evolution.胎盘哺乳动物进化的基因组时间尺度。
Science. 2023 Apr 28;380(6643):eabl8189. doi: 10.1126/science.abl8189.
6
Inference of phylogenetic trees directly from raw sequencing reads using Read2Tree.使用 Read2Tree 从原始测序reads 直接推断系统发育树。
Nat Biotechnol. 2024 Jan;42(1):139-147. doi: 10.1038/s41587-023-01753-4. Epub 2023 Apr 20.
7
The evolution and diversification of oakleaf butterflies.栎树蛱蝶的演化和多样化。
Cell. 2022 Aug 18;185(17):3138-3152.e20. doi: 10.1016/j.cell.2022.06.042. Epub 2022 Aug 3.
8
Molecular and morphological analyses clarify species delimitation in section Costatae and reveal Betula buggsii sp. nov. (sect. Costatae, Betulaceae) in China.分子和形态分析阐明了 Costatae 组内的物种界限,并在中国揭示了新种 Betula buggsii sp. nov.(Betulaceae 科 Costatae 组)。
Ann Bot. 2022 Mar 23;129(4):415-428. doi: 10.1093/aob/mcac001.
9
Efficient ancestry and mutation simulation with msprime 1.0.利用 msprime 1.0 进行高效的祖先和突变模拟。
Genetics. 2022 Mar 3;220(3). doi: 10.1093/genetics/iyab229.
10
Resolving phylogeny and polyploid parentage using genus-wide genome-wide sequence data from birch trees.利用桦树属全基因组序列数据解决系统发育和多倍体亲代问题。
Mol Phylogenet Evol. 2021 Jul;160:107126. doi: 10.1016/j.ympev.2021.107126. Epub 2021 Feb 27.