• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对肠道病毒基因组的补充见解:使用不同组装器和分箱器对短读长和长读长宏基因组进行的比较基准测试。

Complementary insights into gut viral genomes: a comparative benchmark of short- and long-read metagenomes using diverse assemblers and binners.

作者信息

Wang Huarui, Sun Chuqing, Li Yun, Chen Jingchao, Zhao Xing-Ming, Chen Wei-Hua

机构信息

Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Department of Bioinformatics and Systems Biology, Center for Artificial Intelligence Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.

Department of Neurology, Institute of Science and Technology for Brain-Inspired Intelligence, Zhongshan Hospitaland, Fudan University , Shanghai, 200433, China.

出版信息

Microbiome. 2024 Dec 20;12(1):260. doi: 10.1186/s40168-024-01981-z.

DOI:10.1186/s40168-024-01981-z
PMID:39707560
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11660840/
Abstract

BACKGROUND

Metagenome-assembled viral genomes have significantly advanced the discovery and characterization of the human gut virome. However, we lack a comparative assessment of assembly tools on the efficacy of viral genome identification, particularly across next-generation sequencing (NGS) and third-generation sequencing (TGS) data.

RESULTS

We evaluated the efficiency of NGS, TGS, and hybrid assemblers for viral genome discovery using 95 viral-like particle (VLP)-enriched fecal samples sequenced on both Illumina and PacBio platforms. MEGAHIT, metaFlye, and hybridSPAdes emerged as the optimal choices for NGS, TGS, and hybrid datasets, respectively. Notably, these assemblers recovered distinct viral genomes, demonstrating a remarkable degree of complementarity. By combining individual assembler results, we expanded the total number of nonredundant high-quality viral genomes by 4.83 ~ 21.7-fold compared to individual assemblers. Among them, viral genomes from NGS and TGS data have the least overlap, indicating the impact of data type on viral genome recovery. We also evaluated four binning methods, finding that CONCOCT incorporated more unrelated contigs into the same bins, while MetaBAT2, AVAMB, and vRhyme balanced inclusiveness and taxonomic consistency within bins.

CONCLUSIONS

Our findings highlight the challenges in metagenome-driven viral discovery, underscoring tool limitations. We advocate for combined use of multiple assemblers and sequencing technologies when feasible and highlight the urgent need for specialized tools tailored to gut virome assembly. This study contributes essential insights for advancing viral genome research in the context of gut metagenomics. Video Abstract.

摘要

背景

宏基因组组装的病毒基因组极大地推动了人类肠道病毒组的发现和特征描述。然而,我们缺乏对组装工具在病毒基因组识别效率方面的比较评估,尤其是在下一代测序(NGS)和第三代测序(TGS)数据方面。

结果

我们使用在Illumina和PacBio平台上测序的95个富含病毒样颗粒(VLP)的粪便样本,评估了NGS、TGS和混合组装器在病毒基因组发现方面的效率。MEGAHIT、metaFlye和hybridSPAdes分别成为NGS、TGS和混合数据集的最佳选择。值得注意的是,这些组装器恢复了不同的病毒基因组,显示出显著的互补程度。通过组合各个组装器的结果,与单个组装器相比,我们将非冗余高质量病毒基因组的总数扩大了4.83至21.7倍。其中,NGS和TGS数据中的病毒基因组重叠最少,表明数据类型对病毒基因组恢复的影响。我们还评估了四种分箱方法,发现CONCOCT将更多不相关的重叠群纳入同一箱中,而MetaBAT2、AVAMB和vRhyme在箱内平衡了包容性和分类一致性。

结论

我们的研究结果突出了宏基因组驱动的病毒发现中的挑战,强调了工具的局限性。我们主张在可行时联合使用多种组装器和测序技术,并强调迫切需要针对肠道病毒组组装的专门工具。这项研究为在肠道宏基因组学背景下推进病毒基因组研究提供了重要见解。视频摘要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9707/11660840/c98241938aa8/40168_2024_1981_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9707/11660840/b8f3bdd8c6db/40168_2024_1981_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9707/11660840/1455b288a77d/40168_2024_1981_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9707/11660840/8ec624c194c2/40168_2024_1981_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9707/11660840/2bcbf5743a7a/40168_2024_1981_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9707/11660840/c98241938aa8/40168_2024_1981_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9707/11660840/b8f3bdd8c6db/40168_2024_1981_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9707/11660840/1455b288a77d/40168_2024_1981_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9707/11660840/8ec624c194c2/40168_2024_1981_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9707/11660840/2bcbf5743a7a/40168_2024_1981_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9707/11660840/c98241938aa8/40168_2024_1981_Fig5_HTML.jpg

相似文献

1
Complementary insights into gut viral genomes: a comparative benchmark of short- and long-read metagenomes using diverse assemblers and binners.对肠道病毒基因组的补充见解:使用不同组装器和分箱器对短读长和长读长宏基因组进行的比较基准测试。
Microbiome. 2024 Dec 20;12(1):260. doi: 10.1186/s40168-024-01981-z.
2
Optimizing and evaluating the reconstruction of Metagenome-assembled microbial genomes.优化和评估宏基因组组装微生物基因组的重建。
BMC Genomics. 2017 Nov 28;18(1):915. doi: 10.1186/s12864-017-4294-1.
3
Nanopore and Illumina sequencing reveal different viral populations from human gut samples.纳米孔和 Illumina 测序揭示了来自人类肠道样本的不同病毒群体。
Microb Genom. 2024 Apr;10(4). doi: 10.1099/mgen.0.001236.
4
Benchmarking genome assembly methods on metagenomic sequencing data.基于宏基因组测序数据对基因组组装方法进行基准测试。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad087.
5
Long-Read Metagenomics Improves the Recovery of Viral Diversity from Complex Natural Marine Samples.长读宏基因组提高了从复杂自然海洋样本中病毒多样性的恢复。
mSystems. 2022 Jun 28;7(3):e0019222. doi: 10.1128/msystems.00192-22. Epub 2022 Jun 13.
6
Comparison of PCR versus PCR-Free DNA Library Preparation for Characterising the Human Faecal Virome.比较聚合酶链式反应与无聚合酶链式反应 DNA 文库制备在人类粪便病毒组学中的应用。
Viruses. 2021 Oct 18;13(10):2093. doi: 10.3390/v13102093.
7
Improved Assembly of Metagenome-Assembled Genomes and Viruses in Tibetan Saline Lake Sediment by HiFi Metagenomic Sequencing.通过 HiFi 宏基因组测序提高西藏盐湖沉积物宏基因组组装和病毒组装。
Microbiol Spectr. 2023 Feb 14;11(1):e0332822. doi: 10.1128/spectrum.03328-22. Epub 2022 Dec 8.
8
Benchmarking and Assessment of Eight Genome Assemblers on Viral Next-Generation Sequencing Data, Including the SARS-CoV-2.对包括 SARS-CoV-2 在内的病毒下一代测序数据的八种基因组组装器的基准测试和评估。
OMICS. 2022 Jul;26(7):372-381. doi: 10.1089/omi.2022.0042. Epub 2022 Jun 28.
9
Genome binning of viral entities from bulk metagenomics data.宏基因组数据中病毒类群的基因组分箱。
Nat Commun. 2022 Feb 18;13(1):965. doi: 10.1038/s41467-022-28581-5.
10
Evaluating metagenomics tools for genome binning with real metagenomic datasets and CAMI datasets.评估宏基因组工具在真实宏基因组数据集和 CAMI 数据集上的基因组 binning 效果。
BMC Bioinformatics. 2020 Jul 28;21(1):334. doi: 10.1186/s12859-020-03667-3.

引用本文的文献

1
Genome-resolved metagenomics from short-read sequencing data in the era of artificial intelligence.人工智能时代基于短读长测序数据的基因组解析宏基因组学
Funct Integr Genomics. 2025 Jun 10;25(1):124. doi: 10.1007/s10142-025-01625-x.

本文引用的文献

1
VirRep: a hybrid language representation learning framework for identifying viruses from human gut metagenomes.VirRep:一种混合语言表示学习框架,用于从人类肠道宏基因组中识别病毒。
Genome Biol. 2024 Jul 4;25(1):177. doi: 10.1186/s13059-024-03320-9.
2
Long-read powered viral metagenomics in the oligotrophic Sargasso Sea.寡营养马尾藻海的长读长病毒宏基因组学
Nat Commun. 2024 May 14;15(1):4089. doi: 10.1038/s41467-024-48300-6.
3
Nanopore and Illumina sequencing reveal different viral populations from human gut samples.纳米孔和 Illumina 测序揭示了来自人类肠道样本的不同病毒群体。
Microb Genom. 2024 Apr;10(4). doi: 10.1099/mgen.0.001236.
4
The long and short of it: benchmarking viromics using Illumina, Nanopore and PacBio sequencing technologies.简而言之:使用Illumina、Nanopore和PacBio测序技术对病毒组进行基准测试。
Microb Genom. 2024 Feb;10(2). doi: 10.1099/mgen.0.001198.
5
COBRA improves the completeness and contiguity of viral genomes assembled from metagenomes.COBRA 提高了宏基因组组装得到的病毒基因组的完整性和连续性。
Nat Microbiol. 2024 Mar;9(3):737-750. doi: 10.1038/s41564-023-01598-2. Epub 2024 Feb 6.
6
Efficient Recovery of Complete Gut Viral Genomes by Combined Short- and Long-Read Sequencing.通过短读长读联合测序实现完整肠道病毒基因组的高效回收。
Adv Sci (Weinh). 2024 Apr;11(13):e2305818. doi: 10.1002/advs.202305818. Epub 2024 Jan 19.
7
Adversarial and variational autoencoders improve metagenomic binning.对抗自编码器和变分自编码器改进了宏基因组分箱。
Commun Biol. 2023 Oct 21;6(1):1073. doi: 10.1038/s42003-023-05452-3.
8
Phables: from fragmented assemblies to high-quality bacteriophage genomes.噬菌体:从碎片化组装到高质量噬菌体基因组。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad586.
9
A comparison of single-coverage and multi-coverage metagenomic binning reveals extensive hidden contamination.单覆盖度与多覆盖度宏基因组 bin 划分比较揭示了广泛存在的隐藏污染。
Nat Methods. 2023 Aug;20(8):1170-1173. doi: 10.1038/s41592-023-01934-8. Epub 2023 Jun 29.
10
The human gut virome: composition, colonization, interactions, and impacts on human health.人类肠道病毒组:组成、定殖、相互作用及其对人类健康的影响。
Front Microbiol. 2023 May 24;14:963173. doi: 10.3389/fmicb.2023.963173. eCollection 2023.