• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

增强基于长读长的菌株感知宏基因组组装

Enhancing Long-Read-Based Strain-Aware Metagenome Assembly.

作者信息

Luo Xiao, Kang Xiongbin, Schönhuth Alexander

机构信息

Genome Data Science, Faculty of Technology, Bielefeld University, Bielefeld, Germany.

Life Science and Health, Centrum Wiskunde and Informatica, Amsterdam, Netherlands.

出版信息

Front Genet. 2022 May 13;13:868280. doi: 10.3389/fgene.2022.868280. eCollection 2022.

DOI:10.3389/fgene.2022.868280
PMID:35646097
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9136235/
Abstract

Microbial communities are usually highly diverse and often involve multiple strains from the participating species due to the rapid evolution of microorganisms. In such a complex microecosystem, different strains may show different biological functions. While reconstruction of individual genomes at the strain level is vital for accurately deciphering the composition of microbial communities, the problem has largely remained unresolved so far. Next-generation sequencing has been routinely used in metagenome assembly but there have been struggles to generate strain-specific genome sequences due to the short-read length. This explains why long-read sequencing technologies have recently provided unprecedented opportunities to carry out haplotype- or strain-resolved genome assembly. Here, we propose MetaBooster and MetaBooster-HiFi, as two pipelines for strain-aware metagenome assembly from PacBio CLR and Oxford Nanopore long-read sequencing data. Benchmarking experiments on both simulated and real sequencing data demonstrate that either the MetaBooster or the MetaBooster-HiFi pipeline drastically outperforms the state-of-the-art metagenome assemblers, in terms of all relevant metagenome assembly criteria, involving genome fraction, contig length, and error rates.

摘要

由于微生物的快速进化,微生物群落通常具有高度的多样性,并且往往涉及来自参与物种的多个菌株。在这样一个复杂的微生态系统中,不同的菌株可能表现出不同的生物学功能。虽然在菌株水平上重建个体基因组对于准确解读微生物群落的组成至关重要,但到目前为止,这个问题在很大程度上仍未得到解决。下一代测序已常规用于宏基因组组装,但由于读长较短,在生成菌株特异性基因组序列方面一直存在困难。这就解释了为什么长读长测序技术最近为进行单倍型或菌株解析的基因组组装提供了前所未有的机会。在这里,我们提出了MetaBooster和MetaBooster-HiFi这两种流程,用于从PacBio CLR和牛津纳米孔长读长测序数据中进行菌株感知的宏基因组组装。对模拟和真实测序数据的基准实验表明,无论是MetaBooster还是MetaBooster-HiFi流程,在所有相关的宏基因组组装标准方面,包括基因组比例、重叠群长度和错误率,都大大优于目前最先进的宏基因组组装工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/532e/9136235/bd71eab22ba5/fgene-13-868280-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/532e/9136235/bd71eab22ba5/fgene-13-868280-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/532e/9136235/bd71eab22ba5/fgene-13-868280-g001.jpg

相似文献

1
Enhancing Long-Read-Based Strain-Aware Metagenome Assembly.增强基于长读长的菌株感知宏基因组组装
Front Genet. 2022 May 13;13:868280. doi: 10.3389/fgene.2022.868280. eCollection 2022.
2
Evaluating long-read de novo assembly tools for eukaryotic genomes: insights and considerations.评估真核生物基因组的长读长从头组装工具:见解与考虑。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad100. Epub 2023 Nov 24.
3
Advancing metagenome-assembled genome-based pathogen identification: unraveling the power of long-read assembly algorithms in Oxford Nanopore sequencing.推进宏基因组组装基因组为基础的病原体鉴定:揭示长读长组装算法在牛津纳米孔测序中的强大功能。
Microbiol Spectr. 2024 Jun 4;12(6):e0011724. doi: 10.1128/spectrum.00117-24. Epub 2024 Apr 30.
4
Benchmarking genome assembly methods on metagenomic sequencing data.基于宏基因组测序数据对基因组组装方法进行基准测试。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad087.
5
Improved Assembly of Metagenome-Assembled Genomes and Viruses in Tibetan Saline Lake Sediment by HiFi Metagenomic Sequencing.通过 HiFi 宏基因组测序提高西藏盐湖沉积物宏基因组组装和病毒组装。
Microbiol Spectr. 2023 Feb 14;11(1):e0332822. doi: 10.1128/spectrum.03328-22. Epub 2022 Dec 8.
6
Long-read based de novo assembly of low-complexity metagenome samples results in finished genomes and reveals insights into strain diversity and an active phage system.基于长读长测序的从头组装方法可用于低复杂度宏基因组样本,从而获得完成的基因组,并深入了解菌株多样性和活跃的噬菌体系统。
BMC Microbiol. 2019 Jun 25;19(1):143. doi: 10.1186/s12866-019-1500-0.
7
Strainy: phasing and assembly of strain haplotypes from long-read metagenome sequencing.Strainy:从长读宏基因组测序中对菌株单倍型进行相位和组装。
Nat Methods. 2024 Nov;21(11):2034-2043. doi: 10.1038/s41592-024-02424-1. Epub 2024 Sep 26.
8
Benchmarking short-, long- and hybrid-read assemblers for metagenome sequencing of complex microbial communities.对用于复杂微生物群落宏基因组测序的短读长、长读长和混合读长组装器进行基准测试。
Microbiology (Reading). 2024 Jun;170(6). doi: 10.1099/mic.0.001469.
9
A comprehensive investigation of metagenome assembly by linked-read sequencing.基于链接读取测序的宏基因组组装综合研究。
Microbiome. 2020 Nov 11;8(1):156. doi: 10.1186/s40168-020-00929-3.
10
Benchmarking multi-platform sequencing technologies for human genome assembly.多平台测序技术在人类基因组组装中的基准测试。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad300.

引用本文的文献

1
Repeat and haplotype aware error correction in nanopore sequencing reads with DeChat.使用DeChat对纳米孔测序读数进行重复和单倍型感知错误校正。
Commun Biol. 2024 Dec 19;7(1):1678. doi: 10.1038/s42003-024-07376-y.
2
Easing genomic surveillance: A comprehensive performance evaluation of long-read assemblers across multi-strain mixture data of HIV-1 and Other pathogenic viruses for constructing a user-friendly bioinformatic pipeline.简化基因组监测:针对 HIV-1 和其他病原性病毒的多菌株混合数据,对长读长组装器进行全面性能评估,以构建用户友好的生物信息学管道。
F1000Res. 2024 May 31;13:556. doi: 10.12688/f1000research.149577.1. eCollection 2024.
3

本文引用的文献

1
VeChat: correcting errors in long reads using variation graphs.VeChat:使用变异图谱纠正长读中的错误。
Nat Commun. 2022 Nov 4;13(1):6657. doi: 10.1038/s41467-022-34381-8.
2
Metagenome assembly of high-fidelity long reads with hifiasm-meta.利用 hifiasm-meta 进行高保真长读长的宏基因组组装。
Nat Methods. 2022 Jun;19(6):671-674. doi: 10.1038/s41592-022-01478-3. Epub 2022 May 9.
3
phasebook: haplotype-aware de novo assembly of diploid genomes from long reads.相位图:基于长读长的二倍体基因组单体型感知从头组装
Unveiling microbial diversity: harnessing long-read sequencing technology.
揭示微生物多样性:利用长读长测序技术
Nat Methods. 2024 Jun;21(6):954-966. doi: 10.1038/s41592-024-02262-1. Epub 2024 Apr 30.
4
Exploring Long-Read Metagenomics for Full Characterization of Shiga Toxin-Producing in Presence of Commensal .在共生菌存在的情况下,探索长读长宏基因组学以全面表征产志贺毒素菌。
Microorganisms. 2023 Aug 9;11(8):2043. doi: 10.3390/microorganisms11082043.
5
VeChat: correcting errors in long reads using variation graphs.VeChat:使用变异图谱纠正长读中的错误。
Nat Commun. 2022 Nov 4;13(1):6657. doi: 10.1038/s41467-022-34381-8.
Genome Biol. 2021 Oct 27;22(1):299. doi: 10.1186/s13059-021-02512-x.
4
Strainberry: automated strain separation in low-complexity metagenomes using long reads.Strainberry:使用长读长进行低复杂度宏基因组中菌株的自动分离。
Nat Commun. 2021 Jul 23;12(1):4485. doi: 10.1038/s41467-021-24515-9.
5
Towards complete and error-free genome assemblies of all vertebrate species.致力于完成所有脊椎动物物种的完整且无错误的基因组组装。
Nature. 2021 Apr;592(7856):737-746. doi: 10.1038/s41586-021-03451-0. Epub 2021 Apr 28.
6
Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm.使用带有 hifiasm 的相定装配图进行单体型解析从头组装。
Nat Methods. 2021 Feb;18(2):170-175. doi: 10.1038/s41592-020-01056-5. Epub 2021 Feb 1.
7
On the complexity of haplotyping a microbial community.关于对微生物群落进行单倍型分型的复杂性
Bioinformatics. 2021 Jun 16;37(10):1360-1366. doi: 10.1093/bioinformatics/btaa977.
8
metaFlye: scalable long-read metagenome assembly using repeat graphs.metaFlye:使用重复图进行可扩展的长读长宏基因组组装。
Nat Methods. 2020 Nov;17(11):1103-1110. doi: 10.1038/s41592-020-00971-x. Epub 2020 Oct 5.
9
PBSIM2: a simulator for long-read sequencers with a novel generative model of quality scores.PBSIM2:一种带有新型质量评分生成模型的长读测序模拟软件。
Bioinformatics. 2021 May 5;37(5):589-595. doi: 10.1093/bioinformatics/btaa835.
10
Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies.Merqury:基因组组装的无参考质量、完整性和相位评估。
Genome Biol. 2020 Sep 14;21(1):245. doi: 10.1186/s13059-020-02134-9.