• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BASALT对宏基因组数据进行分箱优化,并提高基因组解析宏基因组分析的分辨率。

BASALT refines binning from metagenomic data and increases resolution of genome-resolved metagenomic analysis.

作者信息

Qiu Zhiguang, Yuan Li, Lian Chun-Ang, Lin Bin, Chen Jie, Mu Rong, Qiao Xuejiao, Zhang Liyu, Xu Zheng, Fan Lu, Zhang Yunzeng, Wang Shanquan, Li Junyi, Cao Huiluo, Li Bing, Chen Baowei, Song Chi, Liu Yongxin, Shi Lili, Tian Yonghong, Ni Jinren, Zhang Tong, Zhou Jizhong, Zhuang Wei-Qin, Yu Ke

机构信息

Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China.

AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China.

出版信息

Nat Commun. 2024 Mar 11;15(1):2179. doi: 10.1038/s41467-024-46539-7.

DOI:10.1038/s41467-024-46539-7
PMID:38467684
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10928208/
Abstract

Metagenomic binning is an essential technique for genome-resolved characterization of uncultured microorganisms in various ecosystems but hampered by the low efficiency of binning tools in adequately recovering metagenome-assembled genomes (MAGs). Here, we introduce BASALT (Binning Across a Series of Assemblies Toolkit) for binning and refinement of short- and long-read sequencing data. BASALT employs multiple binners with multiple thresholds to produce initial bins, then utilizes neural networks to identify core sequences to remove redundant bins and refine non-redundant bins. Using the same assemblies generated from Critical Assessment of Metagenome Interpretation (CAMI) datasets, BASALT produces up to twice as many MAGs as VAMB, DASTool, or metaWRAP. Processing assemblies from a lake sediment dataset, BASALT produces ~30% more MAGs than metaWRAP, including 21 unique class-level prokaryotic lineages. Functional annotations reveal that BASALT can retrieve 47.6% more non-redundant opening-reading frames than metaWRAP. These results highlight the robust handling of metagenomic sequencing data of BASALT.

摘要

宏基因组分箱是对各种生态系统中未培养微生物进行基因组解析表征的一项重要技术,但受到分箱工具在充分回收宏基因组组装基因组(MAG)方面效率低下的阻碍。在此,我们介绍了用于对短读长和长读长测序数据进行分箱和优化的BASALT(跨系列组装分箱工具包)。BASALT采用多个具有多个阈值的分箱器来生成初始分箱,然后利用神经网络识别核心序列以去除冗余分箱并优化非冗余分箱。使用从宏基因组解释关键评估(CAMI)数据集生成的相同组装数据,BASALT产生的MAG数量是VAMB、DASTool或metaWRAP的两倍。处理来自湖泊沉积物数据集的组装数据时,BASALT产生的MAG比metaWRAP多约30%,包括21个独特的类级原核生物谱系。功能注释显示,BASALT比metaWRAP能多检索47.6%的非冗余开放阅读框。这些结果突出了BASALT对宏基因组测序数据的强大处理能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dab0/10928208/ee3fe94cc36c/41467_2024_46539_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dab0/10928208/5d388e130bd6/41467_2024_46539_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dab0/10928208/af7ec2285076/41467_2024_46539_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dab0/10928208/af5993050800/41467_2024_46539_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dab0/10928208/14a3053cdbb0/41467_2024_46539_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dab0/10928208/ee3fe94cc36c/41467_2024_46539_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dab0/10928208/5d388e130bd6/41467_2024_46539_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dab0/10928208/af7ec2285076/41467_2024_46539_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dab0/10928208/af5993050800/41467_2024_46539_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dab0/10928208/14a3053cdbb0/41467_2024_46539_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dab0/10928208/ee3fe94cc36c/41467_2024_46539_Fig5_HTML.jpg

相似文献

1
BASALT refines binning from metagenomic data and increases resolution of genome-resolved metagenomic analysis.BASALT对宏基因组数据进行分箱优化,并提高基因组解析宏基因组分析的分辨率。
Nat Commun. 2024 Mar 11;15(1):2179. doi: 10.1038/s41467-024-46539-7.
2
Evaluating metagenomics tools for genome binning with real metagenomic datasets and CAMI datasets.评估宏基因组工具在真实宏基因组数据集和 CAMI 数据集上的基因组 binning 效果。
BMC Bioinformatics. 2020 Jul 28;21(1):334. doi: 10.1186/s12859-020-03667-3.
3
MetaWRAP-a flexible pipeline for genome-resolved metagenomic data analysis.MetaWRAP-一个用于基因组解析宏基因组数据分析的灵活管道。
Microbiome. 2018 Sep 15;6(1):158. doi: 10.1186/s40168-018-0541-1.
4
Evaluating Assembly and Binning Strategies for Time Series Drinking Water Metagenomes.评估时间序列饮用水宏基因组的组装和分类策略。
Microbiol Spectr. 2021 Dec 22;9(3):e0143421. doi: 10.1128/Spectrum.01434-21. Epub 2021 Nov 3.
5
HiFine: integrating Hi-C-based and shotgun-based methods to refine binning of metagenomic contigs.HiFine:整合基于 Hi-C 和 shotgun 的方法来优化宏基因组 contigs 的 bin 划分。
Bioinformatics. 2022 May 26;38(11):2973-2979. doi: 10.1093/bioinformatics/btac295.
6
ACR: metagenome-assembled prokaryotic and eukaryotic genome refinement tool.ACR:宏基因组组装原核生物和真核生物基因组精修工具。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad381.
7
Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking.为真实细菌群落中的几乎所有组件构建宏基因组组装基因组,用于分箱基准测试。
BMC Genomics. 2022 Nov 10;23(1):746. doi: 10.1186/s12864-022-08967-x.
8
Fairy: fast approximate coverage for multi-sample metagenomic binning. Fairy:用于多样本宏基因组 bin 快速近似覆盖的方法。
Microbiome. 2024 Aug 14;12(1):151. doi: 10.1186/s40168-024-01861-6.
9
Improved Assembly of Metagenome-Assembled Genomes and Viruses in Tibetan Saline Lake Sediment by HiFi Metagenomic Sequencing.通过 HiFi 宏基因组测序提高西藏盐湖沉积物宏基因组组装和病毒组装。
Microbiol Spectr. 2023 Feb 14;11(1):e0332822. doi: 10.1128/spectrum.03328-22. Epub 2022 Dec 8.
10
A survey on computational strategies for genome-resolved gut metagenomics.肠道宏基因组的计算策略研究综述
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad162.

引用本文的文献

1
Applying ecological principles to microbiome engineering.将生态学原理应用于微生物组工程。
Nat Microbiol. 2025 Sep;10(9):2111-2121. doi: 10.1038/s41564-025-02076-7. Epub 2025 Aug 5.
2
The potential of wastewater monitoring as a novel surveillance tool for early warning of outbreaks.废水监测作为疫情早期预警新型监测工具的潜力。
Emerg Microbes Infect. 2025 Dec;14(1):2528537. doi: 10.1080/22221751.2025.2528537. Epub 2025 Jul 10.
3
Genome-resolved metagenomics from short-read sequencing data in the era of artificial intelligence.

本文引用的文献

1
CheckM2: a rapid, scalable and accurate tool for assessing microbial genome quality using machine learning.CheckM2:一种使用机器学习快速、可扩展且准确评估微生物基因组质量的工具。
Nat Methods. 2023 Aug;20(8):1203-1212. doi: 10.1038/s41592-023-01940-w. Epub 2023 Jul 27.
2
Comparing genomes recovered from time-series metagenomes using long- and short-read sequencing technologies.比较使用长读长和短读测序技术从时间序列宏基因组中恢复的基因组。
Microbiome. 2023 May 13;11(1):105. doi: 10.1186/s40168-023-01557-3.
3
Insights into ecological roles of uncultivated bacteria in Katase hot spring sediment from long-read metagenomics.
人工智能时代基于短读长测序数据的基因组解析宏基因组学
Funct Integr Genomics. 2025 Jun 10;25(1):124. doi: 10.1007/s10142-025-01625-x.
4
Gut microbiota-mediated conversion of mangiferin to norathyriol alters short chain fatty acid and urate metabolism.肠道微生物群介导的芒果苷向诺米林的转化改变了短链脂肪酸和尿酸代谢。
Gut Microbes. 2025 Dec;17(1):2508422. doi: 10.1080/19490976.2025.2508422. Epub 2025 May 22.
5
Recovery of 26 metagenome-assembled genomes from the phycosphere of the marine diatom .从海洋硅藻的藻际环境中获得26个宏基因组组装基因组
Microbiol Resour Announc. 2025 Apr 10;14(4):e0101324. doi: 10.1128/mra.01013-24. Epub 2025 Mar 25.
6
Unveiling Diversity and Function: Venom-Associated Microbes in Two Spiders, Heteropoda venatoria and Chilobrachys guangxiensis.揭示多样性与功能:狩猎巨蟹蛛和广西 Chilobrachys guangxiensis 两种蜘蛛中与毒液相关的微生物
Microb Ecol. 2024 Dec 21;87(1):156. doi: 10.1007/s00248-024-02476-y.
7
Decoupling of strain- and intrastrain-level interactions of microbiomes in a sponge holobiont.海绵共生体中微生物组的菌株和菌株内相互作用的解耦。
Nat Commun. 2024 Sep 18;15(1):8205. doi: 10.1038/s41467-024-52464-6.
8
Solving genomic puzzles: computational methods for metagenomic binning.解决基因组难题:宏基因组 binning 的计算方法。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae372.
通过长读长宏基因组学深入了解片濑温泉沉积物中未培养细菌的生态作用。
Front Microbiol. 2022 Nov 3;13:1045931. doi: 10.3389/fmicb.2022.1045931. eCollection 2022.
4
Improved microbial genomes and gene catalog of the chicken gut from metagenomic sequencing of high-fidelity long reads.基于高通量长读长测序的宏基因组分析,提高了鸡肠道微生物基因组和基因序列的完整性。
Gigascience. 2022 Nov 18;11. doi: 10.1093/gigascience/giac116.
5
HiFi metagenomic sequencing enables assembly of accurate and complete genomes from human gut microbiota.HiFi 宏基因组测序能够从人体肠道微生物组中组装出准确和完整的基因组。
Nat Commun. 2022 Oct 26;13(1):6367. doi: 10.1038/s41467-022-34149-0.
6
Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing.牛津纳米孔 R10.4 长读测序能够从纯培养物和宏基因组中生成近乎完成的细菌基因组,而无需进行短读测序或参考序列优化。
Nat Methods. 2022 Jul;19(7):823-826. doi: 10.1038/s41592-022-01539-7. Epub 2022 Jul 4.
7
Short- and long-read metagenomics expand individualized structural variations in gut microbiomes.短读和长读宏基因组学扩展了肠道微生物组的个体化结构变异。
Nat Commun. 2022 Jun 8;13(1):3175. doi: 10.1038/s41467-022-30857-9.
8
Identifying synergistic high-order 3D chromatin conformations from genome-scale nanopore concatemer sequencing.从基因组规模的纳米孔串联测序中识别协同的高阶 3D 染色质构象。
Nat Biotechnol. 2022 Oct;40(10):1488-1499. doi: 10.1038/s41587-022-01289-z. Epub 2022 May 30.
9
CAMI II: identifying best practices and issues for metagenomics software.CAMI II:确定宏基因组学软件的最佳实践和问题
Nat Methods. 2022 Apr;19(4):412-413. doi: 10.1038/s41592-022-01419-0.
10
Critical Assessment of Metagenome Interpretation: the second round of challenges.宏基因组解读的关键评估:第二轮挑战。
Nat Methods. 2022 Apr;19(4):429-440. doi: 10.1038/s41592-022-01431-4. Epub 2022 Apr 8.