• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在跨测序平台和分箱模式的真实数据集上对宏基因组分箱工具进行基准测试。

Benchmarking metagenomic binning tools on real datasets across sequencing platforms and binning modes.

作者信息

Han Haitao, Wang Ziye, Zhu Shanfeng

机构信息

Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China.

School of Mathematical Sciences and LPMC, Nankai University, Tianjin, China.

出版信息

Nat Commun. 2025 Mar 24;16(1):2865. doi: 10.1038/s41467-025-57957-6.

DOI:10.1038/s41467-025-57957-6
PMID:40128535
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11933696/
Abstract

Metagenomic binning is a culture-free approach that facilitates the recovery of metagenome-assembled genomes by grouping genomic fragments. However, there remains a lack of a comprehensive benchmark to evaluate the performance of metagenomic binning tools across various combinations of data types and binning modes. In this study, we benchmark 13 metagenomic binning tools using short-read, long-read, and hybrid data under co-assembly, single-sample, and multi-sample binning, respectively. The benchmark results demonstrate that multi-sample binning exhibits optimal performance across short-read, long-read, and hybrid data. Moreover, multi-sample binning outperforms other binning modes in identifying potential antibiotic resistance gene hosts and near-complete strains containing potential biosynthetic gene clusters across diverse data types. This study also recommends three efficient binners across all data-binning combinations, as well as high-performance binners for each combination.

摘要

宏基因组分箱是一种无需培养的方法,通过对基因组片段进行分组来促进宏基因组组装基因组的恢复。然而,目前仍缺乏一个全面的基准来评估宏基因组分箱工具在各种数据类型和分箱模式组合下的性能。在本研究中,我们分别使用短读长、长读长和混合数据,在共组装、单样本和多样本分箱模式下,对13种宏基因组分箱工具进行了基准测试。基准测试结果表明,多样本分箱在短读长、长读长和混合数据上均表现出最佳性能。此外,在识别潜在抗生素抗性基因宿主以及跨不同数据类型包含潜在生物合成基因簇的近完整菌株方面,多样本分箱优于其他分箱模式。本研究还推荐了适用于所有数据分箱组合的三种高效分箱工具,以及适用于每种组合的高性能分箱工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0cd/11933696/cb74c9cd03a7/41467_2025_57957_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0cd/11933696/45991ed75289/41467_2025_57957_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0cd/11933696/154fbb9833c8/41467_2025_57957_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0cd/11933696/8901e0b9b583/41467_2025_57957_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0cd/11933696/cb74c9cd03a7/41467_2025_57957_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0cd/11933696/45991ed75289/41467_2025_57957_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0cd/11933696/154fbb9833c8/41467_2025_57957_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0cd/11933696/8901e0b9b583/41467_2025_57957_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0cd/11933696/cb74c9cd03a7/41467_2025_57957_Fig4_HTML.jpg

相似文献

1
Benchmarking metagenomic binning tools on real datasets across sequencing platforms and binning modes.在跨测序平台和分箱模式的真实数据集上对宏基因组分箱工具进行基准测试。
Nat Commun. 2025 Mar 24;16(1):2865. doi: 10.1038/s41467-025-57957-6.
2
Evaluating metagenomics tools for genome binning with real metagenomic datasets and CAMI datasets.评估宏基因组工具在真实宏基因组数据集和 CAMI 数据集上的基因组 binning 效果。
BMC Bioinformatics. 2020 Jul 28;21(1):334. doi: 10.1186/s12859-020-03667-3.
3
Benchmarking genome assembly methods on metagenomic sequencing data.基于宏基因组测序数据对基因组组装方法进行基准测试。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad087.
4
Complementary insights into gut viral genomes: a comparative benchmark of short- and long-read metagenomes using diverse assemblers and binners.对肠道病毒基因组的补充见解:使用不同组装器和分箱器对短读长和长读长宏基因组进行的比较基准测试。
Microbiome. 2024 Dec 20;12(1):260. doi: 10.1186/s40168-024-01981-z.
5
Effective binning of metagenomic contigs using contrastive multi-view representation learning.利用对比多视图表示学习有效分类宏基因组 contigs。
Nat Commun. 2024 Jan 17;15(1):585. doi: 10.1038/s41467-023-44290-z.
6
Fairy: fast approximate coverage for multi-sample metagenomic binning. Fairy:用于多样本宏基因组 bin 快速近似覆盖的方法。
Microbiome. 2024 Aug 14;12(1):151. doi: 10.1186/s40168-024-01861-6.
7
Simple, reference-independent assessment to empirically guide correction and polishing of hybrid microbial community metagenomic assembly.简单、无需参考的评估方法,可实际指导混合微生物群落宏基因组组装的纠错和优化。
PeerJ. 2024 Nov 8;12:e18132. doi: 10.7717/peerj.18132. eCollection 2024.
8
METAMVGL: a multi-view graph-based metagenomic contig binning algorithm by integrating assembly and paired-end graphs.METAMVGL:一种基于多视图图的宏基因组序列拼接 bin 算法,通过整合组装图和配对末端图。
BMC Bioinformatics. 2021 Jul 22;22(Suppl 10):378. doi: 10.1186/s12859-021-04284-4.
9
A survey on computational strategies for genome-resolved gut metagenomics.肠道宏基因组的计算策略研究综述
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad162.
10
Accurate Binning of Metagenomic Contigs Using Composition, Coverage, and Assembly Graphs.基于组成、覆盖度和组装图对宏基因组序列进行精确分箱。
J Comput Biol. 2022 Dec;29(12):1357-1376. doi: 10.1089/cmb.2022.0262. Epub 2022 Nov 11.

引用本文的文献

1
Evaluation of shotgun metagenomics as a diagnostic tool for infectious gastroenteritis.鸟枪法宏基因组学作为感染性肠胃炎诊断工具的评估
PLoS One. 2025 Sep 2;20(9):e0331288. doi: 10.1371/journal.pone.0331288. eCollection 2025.

本文引用的文献

1
Effective binning of metagenomic contigs using contrastive multi-view representation learning.利用对比多视图表示学习有效分类宏基因组 contigs。
Nat Commun. 2024 Jan 17;15(1):585. doi: 10.1038/s41467-023-44290-z.
2
MetaCC allows scalable and integrative analyses of both long-read and short-read metagenomic Hi-C data.MetaCC 允许对长读长和短读短宏基因组 Hi-C 数据进行可扩展和综合分析。
Nat Commun. 2023 Oct 6;14(1):6231. doi: 10.1038/s41467-023-41209-6.
3
Quality MAGnified.品质放大。
Nat Rev Microbiol. 2023 Dec;21(12):771. doi: 10.1038/s41579-023-00981-4.
4
Microorganisms, climate change, and the Sustainable Development Goals: progress and challenges.微生物、气候变化与可持续发展目标:进展与挑战
Nat Rev Microbiol. 2023 Oct;21(10):622-623. doi: 10.1038/s41579-023-00953-8.
5
CheckM2: a rapid, scalable and accurate tool for assessing microbial genome quality using machine learning.CheckM2:一种使用机器学习快速、可扩展且准确评估微生物基因组质量的工具。
Nat Methods. 2023 Aug;20(8):1203-1212. doi: 10.1038/s41592-023-01940-w. Epub 2023 Jul 27.
6
SemiBin2: self-supervised contrastive learning leads to better MAGs for short- and long-read sequencing.半Bin2:自监督对比学习可提高短读长读测序的宏基因组组装质量。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i21-i29. doi: 10.1093/bioinformatics/btad209.
7
A comparison of single-coverage and multi-coverage metagenomic binning reveals extensive hidden contamination.单覆盖度与多覆盖度宏基因组 bin 划分比较揭示了广泛存在的隐藏污染。
Nat Methods. 2023 Aug;20(8):1170-1173. doi: 10.1038/s41592-023-01934-8. Epub 2023 Jun 29.
8
Metagenomics reveals the habitat specificity of biosynthetic potential of secondary metabolites in global food fermentations.宏基因组学揭示了全球食品发酵中次生代谢物生物合成潜力的生境特异性。
Microbiome. 2023 May 20;11(1):115. doi: 10.1186/s40168-023-01536-8.
9
Comparing genomes recovered from time-series metagenomes using long- and short-read sequencing technologies.比较使用长读长和短读测序技术从时间序列宏基因组中恢复的基因组。
Microbiome. 2023 May 13;11(1):105. doi: 10.1186/s40168-023-01557-3.
10
A survey on computational strategies for genome-resolved gut metagenomics.肠道宏基因组的计算策略研究综述
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad162.