• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

病毒基因组和微生物16S rRNA的菌株解析从头宏基因组组装

Strain-resolved de-novo metagenomic assembly of viral genomes and microbial 16S rRNAs.

作者信息

Jochheim Annika, Jochheim Florian A, Kolodyazhnaya Alexandra, Morice Étienne, Steinegger Martin, Söding Johannes

机构信息

Quantitative and Computational Biology, Max-Planck Institute for Multidisciplinary Sciences, Göttingen, Germany.

International Max-Planck Research School for Genome Sciences, University of Göttingen, Göttingen, Germany.

出版信息

Microbiome. 2024 Oct 1;12(1):187. doi: 10.1186/s40168-024-01904-y.

DOI:10.1186/s40168-024-01904-y
PMID:39354646
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11443906/
Abstract

BACKGROUND

Metagenomics is a powerful approach to study environmental and human-associated microbial communities and, in particular, the role of viruses in shaping them. Viral genomes are challenging to assemble from metagenomic samples due to their genomic diversity caused by high mutation rates. In the standard de Bruijn graph assemblers, this genomic diversity leads to complex k-mer assembly graphs with a plethora of loops and bulges that are challenging to resolve into strains or haplotypes because variants more than the k-mer size apart cannot be phased. In contrast, overlap assemblers can phase variants as long as they are covered by a single read.

RESULTS

Here, we present PenguiN, a software for strain resolved assembly of viral DNA and RNA genomes and bacterial 16S rRNA from shotgun metagenomics. Its exhaustive detection of all read overlaps in linear time combined with a Bayesian model to select strain-resolved extensions allow it to assemble longer viral contigs, less fragmented genomes, and more strains than existing assembly tools, on both real and simulated datasets. We show a 3-40-fold increase in complete viral genomes and a 6-fold increase in bacterial 16S rRNA genes.

CONCLUSION

PenguiN is the first overlap-based assembler for viral genome and 16S rRNA assembly from large and complex metagenomic datasets, which we hope will facilitate studying the key roles of viruses in microbial communities. Video Abstract.

摘要

背景

宏基因组学是研究环境和人类相关微生物群落的有力方法,尤其适用于研究病毒在塑造这些群落中的作用。由于病毒基因组因高突变率而具有基因组多样性,从宏基因组样本中组装病毒基因组具有挑战性。在标准的德布鲁因图组装器中,这种基因组多样性会导致复杂的k-mer组装图,其中有大量的环和凸起,难以解析为菌株或单倍型,因为距离超过k-mer大小的变体无法进行定相。相比之下,重叠组装器可以对只要被单条读段覆盖的变体进行定相。

结果

在此,我们展示了PenguiN,这是一款用于从鸟枪法宏基因组学中解析病毒DNA和RNA基因组以及细菌16S rRNA菌株的软件。它在线性时间内对所有读段重叠进行详尽检测,并结合贝叶斯模型来选择解析菌株的延伸,这使得它在真实和模拟数据集上都能比现有组装工具组装出更长的病毒重叠群、片段化程度更低的基因组以及更多的菌株。我们展示了完整病毒基因组数量增加了3至40倍,细菌16S rRNA基因数量增加了6倍。

结论

PenguiN是首个基于重叠的用于从大型复杂宏基因组数据集中组装病毒基因组和16S rRNA的组装器,我们希望它将有助于研究病毒在微生物群落中的关键作用。视频摘要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/159ab952e65f/40168_2024_1904_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/acc5758864e3/40168_2024_1904_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/e3305d89b61b/40168_2024_1904_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/dfec9cdb7052/40168_2024_1904_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/876454d3bf28/40168_2024_1904_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/7321cfd11435/40168_2024_1904_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/735cc3f2560f/40168_2024_1904_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/159ab952e65f/40168_2024_1904_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/acc5758864e3/40168_2024_1904_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/e3305d89b61b/40168_2024_1904_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/dfec9cdb7052/40168_2024_1904_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/876454d3bf28/40168_2024_1904_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/7321cfd11435/40168_2024_1904_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/735cc3f2560f/40168_2024_1904_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/11443906/159ab952e65f/40168_2024_1904_Fig7_HTML.jpg

相似文献

1
Strain-resolved de-novo metagenomic assembly of viral genomes and microbial 16S rRNAs.病毒基因组和微生物16S rRNA的菌株解析从头宏基因组组装
Microbiome. 2024 Oct 1;12(1):187. doi: 10.1186/s40168-024-01904-y.
2
Long-read based de novo assembly of low-complexity metagenome samples results in finished genomes and reveals insights into strain diversity and an active phage system.基于长读长测序的从头组装方法可用于低复杂度宏基因组样本,从而获得完成的基因组,并深入了解菌株多样性和活跃的噬菌体系统。
BMC Microbiol. 2019 Jun 25;19(1):143. doi: 10.1186/s12866-019-1500-0.
3
Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut.比较不同的组装和注释工具在分析肠道中模拟病毒宏基因组群落中的应用。
BMC Genomics. 2014 Jan 18;15:37. doi: 10.1186/1471-2164-15-37.
4
Improved Assembly of Metagenome-Assembled Genomes and Viruses in Tibetan Saline Lake Sediment by HiFi Metagenomic Sequencing.通过 HiFi 宏基因组测序提高西藏盐湖沉积物宏基因组组装和病毒组装。
Microbiol Spectr. 2023 Feb 14;11(1):e0332822. doi: 10.1128/spectrum.03328-22. Epub 2022 Dec 8.
5
LMAS: evaluating metagenomic short de novo assembly methods through defined communities.LMAS:通过定义的群落评估宏基因组短从头组装方法。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giac122.
6
Evaluation of short read metagenomic assembly.短读宏基因组组装评估。
BMC Genomics. 2011;12 Suppl 2(Suppl 2):S8. doi: 10.1186/1471-2164-12-S2-S8. Epub 2011 Jul 27.
7
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.病毒宏基因组组装中的碎片化和覆盖度变化,及其对多样性计算的影响。
Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015.
8
MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads.MetaVelvet:Velvet 组装器的扩展,用于从短序列读取进行从头宏基因组组装。
Nucleic Acids Res. 2012 Nov 1;40(20):e155. doi: 10.1093/nar/gks678. Epub 2012 Jul 19.
9
Decomposing a San Francisco estuary microbiome using long-read metagenomics reveals species- and strain-level dominance from picoeukaryotes to viruses.利用长读长宏基因组学分解旧金山河口微生物组,揭示了从微微型真核生物到病毒的种属和菌株水平的优势。
mSystems. 2024 Sep 17;9(9):e0024224. doi: 10.1128/msystems.00242-24. Epub 2024 Aug 19.
10
An ensemble strategy that significantly improves de novo assembly of microbial genomes from metagenomic next-generation sequencing data.一种可显著改进从宏基因组新一代测序数据中进行微生物基因组从头组装的整合策略。
Nucleic Acids Res. 2015 Apr 20;43(7):e46. doi: 10.1093/nar/gkv002. Epub 2015 Jan 13.

引用本文的文献

1
High-resolution metagenome assembly for modern long reads with myloasm.利用肌浆瘤对现代长读长进行高分辨率宏基因组组装。
bioRxiv. 2025 Sep 6:2025.09.05.674543. doi: 10.1101/2025.09.05.674543.

本文引用的文献

1
How low can you go? Short-read polishing of Oxford Nanopore bacterial genome assemblies.能降到多低?牛津纳米孔细菌基因组组装的短读补洞。
Microb Genom. 2024 Jun;10(6). doi: 10.1099/mgen.0.001254.
2
ViPRA-Haplo: De Novo Reconstruction of Viral Populations Using Paired End Sequencing Data.ViPRA-Haplo:利用配对末端测序数据进行病毒群体的从头重建。
IEEE/ACM Trans Comput Biol Bioinform. 2024 May-Jun;21(3):492-500. doi: 10.1109/TCBB.2024.3374595. Epub 2024 Jun 5.
3
Phables: from fragmented assemblies to high-quality bacteriophage genomes.
噬菌体:从碎片化组装到高质量噬菌体基因组。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad586.
4
Co-transfer of functionally interdependent genes contributes to genome mosaicism in lambdoid phages.功能上相互依赖的基因的共转移导致了λ噬菌体的基因组镶嵌现象。
Microb Genom. 2022 Nov;8(11). doi: 10.1099/mgen.0.000915.
5
Metagenomic analysis reveals unexplored diversity of archaeal virome in the human gut.宏基因组分析揭示了人类肠道中未被探索的古菌病毒组多样性。
Nat Commun. 2022 Dec 29;13(1):7978. doi: 10.1038/s41467-022-35735-y.
6
The global virome: How much diversity and how many independent origins?全球病毒组:有多少多样性以及多少独立起源?
Environ Microbiol. 2023 Jan;25(1):40-44. doi: 10.1111/1462-2920.16207. Epub 2022 Sep 29.
7
Opportunities and Challenges of Data-Driven Virus Discovery.数据驱动的病毒发现的机遇与挑战。
Biomolecules. 2022 Aug 4;12(8):1073. doi: 10.3390/biom12081073.
8
Critical Assessment of Metagenome Interpretation: the second round of challenges.宏基因组解读的关键评估:第二轮挑战。
Nat Methods. 2022 Apr;19(4):429-440. doi: 10.1038/s41592-022-01431-4. Epub 2022 Apr 8.
9
Cryptic and abundant marine viruses at the evolutionary origins of Earth's RNA virome.地球 RNA 病毒组进化起源处的隐匿且丰富的海洋病毒。
Science. 2022 Apr 8;376(6589):156-162. doi: 10.1126/science.abm5847. Epub 2022 Apr 7.
10
Discovery, diversity, and functional associations of crAss-like phages in human gut metagenomes from four Dutch cohorts.在来自四个荷兰队列的人类肠道宏基因组中发现、多样性和 crAss 样噬菌体的功能关联。
Cell Rep. 2022 Jan 11;38(2):110204. doi: 10.1016/j.celrep.2021.110204.