• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

我们对微生物暗物质的了解有多清晰?(重新)评估使用 MDMcleaner 的公共 MAG 和 SAG 数据集。

How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner.

机构信息

Institute for Biological Interfaces 5 (Institut für Biologische Grenzflächen IBG 5), Karlsruhe Institute of Technology (KIT) 76344, Eggenstein-Leopoldshafen, Germany.

出版信息

Nucleic Acids Res. 2022 Jul 22;50(13):e76. doi: 10.1093/nar/gkac294.

DOI:10.1093/nar/gkac294
PMID:35536293
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9303271/
Abstract

As of today, the majority of environmental microorganisms remain uncultured and is therefore referred to as 'microbial dark matter' (MDM). Hence, genomic insights into these organisms are limited to cultivation-independent approaches such as single-cell- and metagenomics. However, without access to cultured representatives for verifying correct taxon-assignments, MDM genomes may cause potentially misleading conclusions based on misclassified or contaminant contigs, thereby obfuscating our view on the uncultured microbial majority. Moreover, gradual database contaminations by past genome submissions can cause error propagations which affect present as well as future comparative genome analyses. Consequently, strict contamination detection and filtering need to be applied, especially in the case of uncultured MDM genomes. Current genome reporting standards, however, emphasize completeness over purity and the de facto gold standard genome assessment tool, checkM, discriminates against uncultured taxa and fragmented genomes. To tackle these issues, we present a novel contig classification, screening, and filtering workflow and corresponding open-source python implementation called MDMcleaner, which was tested and compared to other tools on mock and real datasets. MDMcleaner revealed substantial contaminations overlooked by current screening approaches and sensitively detects misattributed contigs in both novel genomes and the underlying reference databases, thereby greatly improving our view on 'microbial dark matter'.

摘要

截至今天,大多数环境微生物仍未被培养,因此被称为“微生物暗物质”(MDM)。因此,对这些生物体的基因组的了解仅限于培养独立性方法,例如单细胞和宏基因组学。然而,如果无法获得用于验证正确分类群分配的培养代表,MDM 基因组可能会基于错误分类或污染的 contigs 导致潜在误导性的结论,从而模糊我们对未培养的微生物主要群体的看法。此外,过去基因组提交导致的数据库逐渐污染会引起影响当前和未来比较基因组分析的错误传播。因此,特别是在未培养的 MDM 基因组的情况下,需要应用严格的污染检测和过滤。然而,现行的基因组报告标准强调完整性而非纯度,事实上的黄金标准基因组评估工具 checkM 歧视未培养的分类群和碎片化的基因组。为了解决这些问题,我们提出了一种新的 contig 分类、筛选和过滤工作流程以及相应的开源 python 实现,称为 MDMcleaner,我们对其进行了测试,并在模拟和真实数据集上与其他工具进行了比较。MDMcleaner 揭示了当前筛选方法忽略的大量污染,并在新基因组和基础参考数据库中敏感地检测到错误归因的 contigs,从而大大提高了我们对“微生物暗物质”的认识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/945e04c3b964/gkac294fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/6c402ad231ab/gkac294fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/6c7c110bbe77/gkac294fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/af278c6a2440/gkac294fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/5b0b4f0d9af1/gkac294fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/2e9a63639fcf/gkac294fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/945e04c3b964/gkac294fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/6c402ad231ab/gkac294fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/6c7c110bbe77/gkac294fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/af278c6a2440/gkac294fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/5b0b4f0d9af1/gkac294fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/2e9a63639fcf/gkac294fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4299/9303271/945e04c3b964/gkac294fig6.jpg

相似文献

1
How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner.我们对微生物暗物质的了解有多清晰?(重新)评估使用 MDMcleaner 的公共 MAG 和 SAG 数据集。
Nucleic Acids Res. 2022 Jul 22;50(13):e76. doi: 10.1093/nar/gkac294.
2
ICoVeR - an interactive visualization tool for verification and refinement of metagenomic bins.ICoVeR - 一种用于宏基因组分箱验证与优化的交互式可视化工具。
BMC Bioinformatics. 2017 May 2;18(1):233. doi: 10.1186/s12859-017-1653-5.
3
CoMet: a workflow using contig coverage and composition for binning a metagenomic sample with high precision.CoMet:一种使用 contig 覆盖度和组成进行宏基因组样本高精度分箱的工作流程。
BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):571. doi: 10.1186/s12859-017-1967-3.
4
ATLAS: a Snakemake workflow for assembly, annotation, and genomic binning of metagenome sequence data.ATLAS:用于宏基因组序列数据组装、注释和基因组分箱的 SnakeMake 工作流程。
BMC Bioinformatics. 2020 Jun 22;21(1):257. doi: 10.1186/s12859-020-03585-4.
5
Recovery of strain-resolved genomes from human microbiome through an integration framework of single-cell genomics and metagenomics.通过单细胞基因组学和宏基因组学的整合框架从人类微生物组中恢复菌株解析基因组。
Microbiome. 2021 Oct 12;9(1):202. doi: 10.1186/s40168-021-01152-4.
6
Metagenomic Assembly: Reconstructing Genomes from Metagenomes.宏基因组组装:从宏基因组中重建基因组。
Methods Mol Biol. 2021;2242:139-152. doi: 10.1007/978-1-0716-1099-2_9.
7
Microbial dark matter sequences verification in amplicon sequencing and environmental metagenomics data.扩增子测序和环境宏基因组学数据中微生物暗物质序列的验证
Front Microbiol. 2023 Nov 2;14:1247119. doi: 10.3389/fmicb.2023.1247119. eCollection 2023.
8
Recovering complete and draft population genomes from metagenome datasets.从宏基因组数据集中恢复完整和草图的种群基因组。
Microbiome. 2016 Mar 8;4:8. doi: 10.1186/s40168-016-0154-5.
9
Uncultured prokaryotic genomes in the spotlight: An examination of publicly available data from metagenomics and single-cell genomics.备受关注的未培养原核生物基因组:对宏基因组学和单细胞基因组学公开数据的审视。
Comput Struct Biotechnol J. 2023 Sep 12;21:4508-4518. doi: 10.1016/j.csbj.2023.09.010. eCollection 2023.
10
BiosyntheticSPAdes: reconstructing biosynthetic gene clusters from assembly graphs.BiosyntheticSPAdes:从组装图重建生物合成基因簇。
Genome Res. 2019 Aug;29(8):1352-1362. doi: 10.1101/gr.243477.118. Epub 2019 Jun 3.

引用本文的文献

1
Distinct microbial communities within and on seep carbonates support long-term anaerobic oxidation of methane and divergent pMMO diversity.渗漏碳酸盐岩内部和表面不同的微生物群落支持甲烷的长期厌氧氧化和不同的颗粒甲烷单加氧酶多样性。
ISME J. 2025 Jan 2;19(1). doi: 10.1093/ismejo/wraf153.
2
Phylogenomic and Pangenomic Assessment of a Mediterranean Strain of Raphidiopsis raciborskii Extends Knowledge of the Global Distribution and Characteristics of a Potentially Toxigenic Cyanobacterium.对地中海席藻(Raphidiopsis raciborskii)菌株的系统基因组学和泛基因组学评估扩展了对一种潜在产毒蓝藻全球分布和特征的认识。
Environ Microbiol Rep. 2025 Jun;17(3):e70098. doi: 10.1111/1758-2229.70098.
3

本文引用的文献

1
Whokaryote: distinguishing eukaryotic and prokaryotic contigs in metagenomes based on gene structure.WhoKaryote:基于基因结构区分宏基因组中的真核生物和原核生物序列。
Microb Genom. 2022 May;8(5). doi: 10.1099/mgen.0.000823.
2
Tiara: deep learning-based classification system for eukaryotic sequences.Tiara:基于深度学习的真核序列分类系统。
Bioinformatics. 2022 Jan 3;38(2):344-350. doi: 10.1093/bioinformatics/btab672.
3
Printing Microbial Dark Matter: Using Single Cell Dispensing and Genomics to Investigate the Patescibacteria/Candidate Phyla Radiation.
Predicting climate-change impacts on the global glacier-fed stream microbiome.
预测气候变化对全球冰川补给河流微生物群落的影响。
Nat Commun. 2025 Feb 1;16(1):1264. doi: 10.1038/s41467-025-56426-4.
4
Structure and metabolic function of spatiotemporal pit mud microbiome.时空窖泥微生物群落的结构与代谢功能
Environ Microbiome. 2025 Jan 20;20(1):10. doi: 10.1186/s40793-025-00668-8.
5
Mapping the metagenomic diversity of the multi-kingdom glacier-fed stream microbiome.绘制多王国冰川补给溪流微生物组的宏基因组多样性图谱。
Nat Microbiol. 2025 Jan;10(1):217-230. doi: 10.1038/s41564-024-01874-9. Epub 2025 Jan 2.
6
Deciphering the biosynthetic landscape of biofilms in glacier-fed streams.解析冰川补给溪流中生物膜的生物合成格局。
mSystems. 2025 Feb 18;10(2):e0113724. doi: 10.1128/msystems.01137-24. Epub 2024 Dec 31.
7
Small amounts of misassembly can have disproportionate effects on pangenome-based metagenomic analyses.少量的错误组装可能会对基于泛基因组的宏基因组分析产生不成比例的影响。
bioRxiv. 2024 Oct 13:2024.10.11.617902. doi: 10.1101/2024.10.11.617902.
8
Biodiversity of microorganisms in the Baltic Sea: the power of novel methods in the identification of marine microbes.波罗的海中微生物的多样性:新型方法在海洋微生物鉴定中的威力。
FEMS Microbiol Rev. 2024 Sep 18;48(5). doi: 10.1093/femsre/fuae024.
9
Taxonomic and functional metagenomic assessment of a Dolichospermum bloom in a large and deep lake south of the Alps.对阿尔卑斯山南的一个大型深湖中螺旋鱼腥藻水华的分类和功能宏基因组评估。
FEMS Microbiol Ecol. 2024 Sep 14;100(10). doi: 10.1093/femsec/fiae117.
10
Sustainable agriculture: leveraging microorganisms for a circular economy.可持续农业:利用微生物实现循环经济。
Appl Microbiol Biotechnol. 2024 Aug 30;108(1):452. doi: 10.1007/s00253-024-13294-0.
打印微生物暗物质:利用单细胞分配和基因组学研究Patescibacteria/候选门辐射
Front Microbiol. 2021 Jun 16;12:635506. doi: 10.3389/fmicb.2021.635506. eCollection 2021.
4
GUNC: detection of chimerism and contamination in prokaryotic genomes.GUNC:原核基因组嵌合体和污染的检测。
Genome Biol. 2021 Jun 13;22(1):178. doi: 10.1186/s13059-021-02393-0.
5
Isolation of a member of the candidate phylum 'Atribacteria' reveals a unique cell membrane structure.候选门“酸杆菌门”成员的分离揭示了独特的细胞膜结构。
Nat Commun. 2020 Dec 14;11(1):6381. doi: 10.1038/s41467-020-20149-5.
6
A genomic catalog of Earth's microbiomes.地球微生物组的基因组目录。
Nat Biotechnol. 2021 Apr;39(4):499-509. doi: 10.1038/s41587-020-0718-6. Epub 2020 Nov 9.
7
Metagenome Proteins and Database Contamination.宏基因组蛋白质和数据库污染。
mSphere. 2020 Nov 4;5(6):e00854-20. doi: 10.1128/mSphere.00854-20.
8
Metagenome-assembled genome binning methods with short reads disproportionately fail for plasmids and genomic Islands.短读长元基因组组装基因组分箱方法对质粒和基因组岛的效果不成比例地差。
Microb Genom. 2020 Oct;6(10). doi: 10.1099/mgen.0.000436.
9
Microbial single-cell omics: the crux of the matter.微生物单细胞组学:问题的关键
Appl Microbiol Biotechnol. 2020 Oct;104(19):8209-8220. doi: 10.1007/s00253-020-10844-0. Epub 2020 Aug 26.
10
Targeted Cell Sorting Combined With Single Cell Genomics Captures Low Abundant Microbial Dark Matter With Higher Sensitivity Than Metagenomics.靶向细胞分选结合单细胞基因组学能够捕获低丰度微生物暗物质,其灵敏度高于宏基因组学。
Front Microbiol. 2020 Jul 22;11:1377. doi: 10.3389/fmicb.2020.01377. eCollection 2020.