利用单细胞宏基因组学进行菌株水平微生物检测和定量。

Strain level microbial detection and quantification with applications to single cell metagenomics.

机构信息

Cancer Data Science Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.

Department of Computer Science & Engineering, UC San Diego, La Jolla, CA, USA.

出版信息

Nat Commun. 2022 Oct 28;13(1):6430. doi: 10.1038/s41467-022-33869-7.

DOI:10.1038/s41467-022-33869-7

PMID:36307411

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9616933/

Abstract

Computational identification and quantification of distinct microbes from high throughput sequencing data is crucial for our understanding of human health. Existing methods either use accurate but computationally expensive alignment-based approaches or less accurate but computationally fast alignment-free approaches, which often fail to correctly assign reads to genomes. Here we introduce CAMMiQ, a combinatorial optimization framework to identify and quantify distinct genomes (specified by a database) in a metagenomic dataset. As a key methodological innovation, CAMMiQ uses substrings of variable length and those that appear in two genomes in the database, as opposed to the commonly used fixed-length, unique substrings. These substrings allow to accurately decouple mixtures of highly similar genomes resulting in higher accuracy than the leading alternatives, without requiring additional computational resources, as demonstrated on commonly used benchmarking datasets. Importantly, we show that CAMMiQ can distinguish closely related bacterial strains in simulated metagenomic and real single-cell metatranscriptomic data.

摘要

从高通量测序数据中计算识别和量化不同的微生物对于我们理解人类健康至关重要。现有的方法要么使用准确但计算成本高的基于比对的方法，要么使用准确性较低但计算速度快的无比对方法，但这些方法往往无法正确地将reads 分配到基因组上。在这里，我们介绍了 CAMMiQ，这是一种组合优化框架，用于在宏基因组数据集中识别和量化不同的基因组（由数据库指定）。作为一个关键的方法学创新，CAMMiQ 使用可变长度的子字符串和数据库中两个基因组中出现的子字符串，而不是常用的固定长度、唯一的子字符串。这些子字符串可以准确地分离高度相似的基因组混合物，从而比领先的替代方法具有更高的准确性，而无需额外的计算资源，这在常用的基准数据集上得到了验证。重要的是，我们表明 CAMMiQ 可以区分模拟宏基因组和真实单细胞宏转录组数据中密切相关的细菌菌株。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75a/9616933/1b8ecace220e/41467_2022_33869_Fig1_HTML.jpg

相似文献

Strain level microbial detection and quantification with applications to single cell metagenomics.

Nat Commun. 2022 Oct 28;13(1):6430. doi: 10.1038/s41467-022-33869-7.

MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.

Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.

Strain-level metagenomic assignment and compositional estimation for long reads with MetaMaps.

Nat Commun. 2019 Jul 11;10(1):3066. doi: 10.1038/s41467-019-10934-2.

Mora: abundance aware metagenomic read re-assignment for disentangling similar strains.

BMC Bioinformatics. 2024 Apr 23;25(1):161. doi: 10.1186/s12859-024-05768-9.

Signal enrichment with strain-level resolution in metagenomes using topological data analysis.

BMC Genomics. 2019 Apr 4;20(Suppl 2):194. doi: 10.1186/s12864-019-5490-y.

A comprehensive investigation of metagenome assembly by linked-read sequencing.

Microbiome. 2020 Nov 11;8(1):156. doi: 10.1186/s40168-020-00929-3.

Evaluating Assembly and Binning Strategies for Time Series Drinking Water Metagenomes.

Microbiol Spectr. 2021 Dec 22;9(3):e0143421. doi: 10.1128/Spectrum.01434-21. Epub 2021 Nov 3.

MetaObtainer: A Tool for Obtaining Specified Species from Metagenomic Reads of Next-generation Sequencing.

Interdiscip Sci. 2015 Dec;7(4):405-13. doi: 10.1007/s12539-015-0281-x. Epub 2015 Aug 21.

MetaID: a novel method for identification and quantification of metagenomic samples.

BMC Genomics. 2013;14 Suppl 8(Suppl 8):S4. doi: 10.1186/1471-2164-14-S8-S4. Epub 2013 Dec 9.

MiCoP: microbial community profiling method for detecting viral and fungal organisms in metagenomic samples.

BMC Genomics. 2019 Jun 6;20(Suppl 5):423. doi: 10.1186/s12864-019-5699-9.

引用本文的文献

Bioinformatic approaches to blood and tissue microbiome analyses: challenges and perspectives.

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf176.

MeStanG-Resource for High-Throughput Sequencing Standard Data Sets Generation for Bioinformatic Methods Evaluation and Validation.

Biology (Basel). 2025 Jan 14;14(1):69. doi: 10.3390/biology14010069.

kMetaShot: a fast and reliable taxonomy classifier for metagenome-assembled genomes.

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae680.

Beyond the Gut: The intratumoral microbiome's influence on tumorigenesis and treatment response.

Cancer Commun (Lond). 2024 Oct;44(10):1130-1167. doi: 10.1002/cac2.12597. Epub 2024 Aug 1.

Identification of intracellular bacteria from multiple single-cell RNA-seq platforms using CSI-Microbes.

Sci Adv. 2024 Jul 5;10(27):eadj7402. doi: 10.1126/sciadv.adj7402. Epub 2024 Jul 3.

Fast, parallel, and cache-friendly suffix array construction.

Algorithms Mol Biol. 2024 Apr 28;19(1):16. doi: 10.1186/s13015-024-00263-5.

Isolation and Cultivation of Human Gut Microorganisms: A Review.

Microorganisms. 2023 Apr 20;11(4):1080. doi: 10.3390/microorganisms11041080.

Sketching and sampling approaches for fast and accurate long read classification.

BMC Bioinformatics. 2022 Oct 31;23(1):452. doi: 10.1186/s12859-022-05014-0.

本文引用的文献

Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data.

Front Microbiol. 2020 Aug 18;11:1925. doi: 10.3389/fmicb.2020.01925. eCollection 2020.

ganon: precise metagenomics classification against large and up-to-date sets of reference sequences.

Bioinformatics. 2020 Jul 1;36(Suppl_1):i12-i20. doi: 10.1093/bioinformatics/btaa458.

Metagenomic growth rate inferences of strains in situ.

Sci Adv. 2020 Apr 22;6(17):eaaz2299. doi: 10.1126/sciadv.aaz2299. eCollection 2020 Apr.

The human tumor microbiome is composed of tumor type-specific intracellular bacteria.

Science. 2020 May 29;368(6494):973-980. doi: 10.1126/science.aay9189.

To Petabytes and beyond: recent advances in probabilistic and signal processing algorithms and their application to metagenomics.

Nucleic Acids Res. 2020 Jun 4;48(10):5217-5234. doi: 10.1093/nar/gkaa265.

Microbiome analyses of blood and tissues suggest cancer diagnostic approach.

Nature. 2020 Mar;579(7800):567-574. doi: 10.1038/s41586-020-2095-1. Epub 2020 Mar 11.

Carnelian uncovers hidden functional patterns across diverse study populations from whole metagenome sequencing reads.

Genome Biol. 2020 Feb 24;21(1):47. doi: 10.1186/s13059-020-1933-7.

Improved metagenomic analysis with Kraken 2.

Genome Biol. 2019 Nov 28;20(1):257. doi: 10.1186/s13059-019-1891-0.

Mash Screen: high-throughput sequence containment estimation for genome discovery.

Genome Biol. 2019 Nov 5;20(1):232. doi: 10.1186/s13059-019-1841-x.

Benchmarking Metagenomics Tools for Taxonomic Classification.

Cell. 2019 Aug 8;178(4):779-794. doi: 10.1016/j.cell.2019.07.010.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用单细胞宏基因组学进行菌株水平微生物检测和定量。

Strain level microbial detection and quantification with applications to single cell metagenomics.

机构信息

Cancer Data Science Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.

Department of Computer Science & Engineering, UC San Diego, La Jolla, CA, USA.

出版信息

Nat Commun. 2022 Oct 28;13(1):6430. doi: 10.1038/s41467-022-33869-7.

DOI:10.1038/s41467-022-33869-7

PMID:36307411

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9616933/

Abstract

摘要

利用单细胞宏基因组学进行菌株水平微生物检测和定量。

Strain level microbial detection and quantification with applications to single cell metagenomics.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用单细胞宏基因组学进行菌株水平微生物检测和定量。

Strain level microbial detection and quantification with applications to single cell metagenomics.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献