菌落和宏基因组测序数据中菌株水平微生物检测的计算方法

Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data.

作者信息

Anyansi Christine, Straub Timothy J, Manson Abigail L, Earl Ashlee M, Abeel Thomas

机构信息

Delft Bioinformatics Lab, Delft University of Technology, Delft, Netherlands.

Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, United States.

出版信息

Front Microbiol. 2020 Aug 18;11:1925. doi: 10.3389/fmicb.2020.01925. eCollection 2020.

DOI:10.3389/fmicb.2020.01925

PMID:33013732

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7507117/

Abstract

Metagenomic sequencing is a powerful tool for examining the diversity and complexity of microbial communities. Most widely used tools for taxonomic profiling of metagenomic sequence data allow for a species-level overview of the composition. However, individual strains within a species can differ greatly in key genotypic and phenotypic characteristics, such as drug resistance, virulence and growth rate. Therefore, the ability to resolve microbial communities down to the level of individual strains within a species is critical to interpreting metagenomic data for clinical and environmental applications, where identifying a particular strain, or tracking a particular strain across a set of samples, can help aid in clinical diagnosis and treatment, or in characterizing yet unstudied strains across novel environmental locations. Recently published approaches have begun to tackle the problem of resolving strains within a particular species in metagenomic samples. In this review, we present an overview of these new algorithms and their uses, including methods based on assembly reconstruction and methods operating with or without a reference database. While existing metagenomic analysis methods show reasonable performance at the species and higher taxonomic levels, identifying closely related strains within a species presents a bigger challenge, due to the diversity of databases, genetic relatedness, and goals when conducting these analyses. Selection of which metagenomic tool to employ for a specific application should be performed on a case-by case basis as these tools have strengths and weaknesses that affect their performance on specific tasks. A comprehensive benchmark across different use case scenarios is vital to validate performance of these tools on microbial samples. Because strain-level metagenomic analysis is still in its infancy, development of more fine-grained, high-resolution algorithms will continue to be in demand for the future.

摘要

宏基因组测序是用于研究微生物群落多样性和复杂性的强大工具。用于宏基因组序列数据分类分析的最广泛使用的工具能够提供群落组成的物种水平概述。然而，一个物种内的各个菌株在关键的基因型和表型特征上可能有很大差异，如耐药性、毒力和生长速率。因此，将微生物群落解析到物种内单个菌株水平的能力对于解释宏基因组数据在临床和环境应用中的意义至关重要，在这些应用中，识别特定菌株或在一组样本中追踪特定菌株有助于临床诊断和治疗，或用于表征新环境中尚未研究的菌株。最近发表的方法已开始着手解决宏基因组样本中特定物种内菌株解析的问题。在本综述中，我们概述了这些新算法及其用途，包括基于组装重建的方法以及有或没有参考数据库的操作方法。虽然现有的宏基因组分析方法在物种和更高分类水平上表现出合理的性能，但由于数据库的多样性、遗传相关性以及进行这些分析时的目标，识别物种内密切相关的菌株面临更大挑战。针对特定应用选择使用哪种宏基因组工具应逐案进行，因为这些工具都有优缺点，会影响它们在特定任务上的表现。跨不同用例场景的全面基准测试对于验证这些工具在微生物样本上的性能至关重要。由于菌株水平的宏基因组分析仍处于起步阶段，未来对更精细、高分辨率算法的需求将持续存在。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d04/7507117/655f93a00592/fmicb-11-01925-g001.jpg

相似文献

Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data.菌落和宏基因组测序数据中菌株水平微生物检测的计算方法

Front Microbiol. 2020 Aug 18;11:1925. doi: 10.3389/fmicb.2020.01925. eCollection 2020.

MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序：一种合成方法。

Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.

Profiling microbial strains in urban environments using metagenomic sequencing data.利用宏基因组测序数据对城市环境中的微生物菌株进行分析。

Biol Direct. 2018 May 9;13(1):9. doi: 10.1186/s13062-018-0211-z.

An integrated strain-level analytic pipeline utilizing longitudinal metagenomic data.利用纵向宏基因组数据的综合菌株水平分析管道。

Microbiol Spectr. 2024 Nov 5;12(11):e0143124. doi: 10.1128/spectrum.01431-24. Epub 2024 Sep 23.

Metagenomic Approaches Reveal Strain Profiling and Genotyping of Klebsiella pneumoniae from Hospitalized Patients in China.宏基因组学方法揭示中国住院患者肺炎克雷伯菌的菌株分析和基因分型。

Microbiol Spectr. 2022 Apr 27;10(2):e0219021. doi: 10.1128/spectrum.02190-21. Epub 2022 Mar 23.

Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools.微生物遗传条形码标记基因的选择和 Barcoder 软件工具对宏基因组读段的分类。

BMC Bioinformatics. 2018 Aug 30;19(1):309. doi: 10.1186/s12859-018-2320-1.

MSPminer: abundance-based reconstitution of microbial pan-genomes from shotgun metagenomic data.MSPminer：基于丰度的宏基因组数据中微生物泛基因组重建。

Bioinformatics. 2019 May 1;35(9):1544-1552. doi: 10.1093/bioinformatics/bty830.

Strain-resolved microbiome sequencing reveals mobile elements that drive bacterial competition on a clinical timescale.基于应变的微生物组测序揭示了在临床时间尺度上驱动细菌竞争的可移动元件。

Genome Med. 2020 May 29;12(1):50. doi: 10.1186/s13073-020-00747-0.

QuantTB - a method to classify mixed Mycobacterium tuberculosis infections within whole genome sequencing data.QuantTB - 一种在全基因组测序数据中分类混合结核分枝杆菌感染的方法。

BMC Genomics. 2020 Jan 28;21(1):80. doi: 10.1186/s12864-020-6486-3.

UltraSEQ, a Universal Bioinformatic Platform for Information-Based Clinical Metagenomics and Beyond.UltraSEQ，一个基于信息的临床宏基因组学及其他领域的通用生物信息学平台。

Microbiol Spectr. 2023 Jun 15;11(3):e0416022. doi: 10.1128/spectrum.04160-22. Epub 2023 Apr 11.

引用本文的文献

A reconceptualized framework for human microbiome transmission in early life.一个重新概念化的早期生命中人类微生物组传播框架。

Nat Commun. 2025 Aug 14;16(1):7546. doi: 10.1038/s41467-025-61998-2.

Direct whole-genome sequencing enables strain typing of unculturable from oropharyngeal carriage specimens.直接全基因组测序可实现对来自口咽携带标本中不可培养菌株的分型。

Microb Genom. 2025 Aug;11(8). doi: 10.1099/mgen.0.001464.

Co-carriage of diverse vancomycin-resistant ST80-lineages by 70% of patients in an Irish hospital.爱尔兰一家医院70%的患者同时携带多种耐万古霉素ST80谱系。

JAC Antimicrob Resist. 2025 Apr 29;7(3):dlaf065. doi: 10.1093/jacamr/dlaf065. eCollection 2025 Jun.

Early detection and population dynamics of in naturally contaminated drains from a meat processing plant.肉类加工厂自然污染排水系统中[具体对象未给出]的早期检测与种群动态

Front Microbiol. 2025 Apr 9;16:1541481. doi: 10.3389/fmicb.2025.1541481. eCollection 2025.

Demixer: a probabilistic generative model to delineate different strains of a microbial species in a mixed infection sample.解混器：一种概率生成模型，用于在混合感染样本中描绘微生物物种的不同菌株。

Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf139.

Evaluating the potential of assembler-binner combinations in recovering low-abundance and strain-resolved genomes from human metagenomes.评估组装器-分箱器组合在从人类宏基因组中恢复低丰度和菌株解析基因组方面的潜力。

Heliyon. 2025 Jan 14;11(2):e41938. doi: 10.1016/j.heliyon.2025.e41938. eCollection 2025 Jan 30.

Host DNA depletion on frozen human respiratory samples enables successful metagenomic sequencing for microbiome studies.冷冻人呼吸道样本中的宿主 DNA 耗尽可实现微生物组研究的成功宏基因组测序。

Commun Biol. 2024 Nov 28;7(1):1590. doi: 10.1038/s42003-024-07290-3.

Simulation of 69 microbial communities indicates sequencing depth and false positives are major drivers of bias in prokaryotic metagenome-assembled genome recovery.模拟 69 个微生物群落表明，测序深度和假阳性是影响原核微生物宏基因组组装基因组回收率偏差的主要因素。

PLoS Comput Biol. 2024 Oct 22;20(10):e1012530. doi: 10.1371/journal.pcbi.1012530. eCollection 2024 Oct.

RAPiD: a rapid and accurate plant pathogen identification pipeline for on-site nanopore sequencing.RAPiD：一种用于现场纳米孔测序的快速准确的植物病原体识别管道。

PeerJ. 2024 Sep 25;12:e17893. doi: 10.7717/peerj.17893. eCollection 2024.

An integrated strain-level analytic pipeline utilizing longitudinal metagenomic data.利用纵向宏基因组数据的综合菌株水平分析管道。

Microbiol Spectr. 2024 Nov 5;12(11):e0143124. doi: 10.1128/spectrum.01431-24. Epub 2024 Sep 23.

本文引用的文献

BMC Genomics. 2020 Jan 28;21(1):80. doi: 10.1186/s12864-020-6486-3.

GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database.GTDB-Tk：一个使用基因组分类数据库对基因组进行分类的工具包。

Bioinformatics. 2019 Nov 15;36(6):1925-7. doi: 10.1093/bioinformatics/btz848.

Hybrid metagenomic assembly enables high-resolution analysis of resistance determinants and mobile elements in human microbiomes.混合宏基因组组装可实现人类微生物组中抗性决定因子和移动元件的高分辨率分析。

Nat Biotechnol. 2019 Aug;37(8):937-944. doi: 10.1038/s41587-019-0191-2. Epub 2019 Jul 29.

Strain-level metagenomic assignment and compositional estimation for long reads with MetaMaps.使用 MetaMaps 对长读进行菌株水平宏基因组分配和组成估计。

Nat Commun. 2019 Jul 11;10(1):3066. doi: 10.1038/s41467-019-10934-2.

Long-read based de novo assembly of low-complexity metagenome samples results in finished genomes and reveals insights into strain diversity and an active phage system.基于长读长测序的从头组装方法可用于低复杂度宏基因组样本，从而获得完成的基因组，并深入了解菌株多样性和活跃的噬菌体系统。

BMC Microbiol. 2019 Jun 25;19(1):143. doi: 10.1186/s12866-019-1500-0.

Genomic and Metagenomic Approaches for Predictive Surveillance of Emerging Pathogens and Antibiotic Resistance.基因组学和宏基因组学方法在新兴病原体和抗生素耐药性预测性监测中的应用。

Clin Pharmacol Ther. 2019 Sep;106(3):512-524. doi: 10.1002/cpt.1535. Epub 2019 Jul 22.

Assembly of long, error-prone reads using repeat graphs.使用重复图组装长的、易错的读取。

Nat Biotechnol. 2019 May;37(5):540-546. doi: 10.1038/s41587-019-0072-8. Epub 2019 Apr 1.

Metagenomics-Based, Strain-Level Analysis of From a Time-Series of Microbiome Samples From a Crohn's Disease Patient.基于宏基因组学的克罗恩病患者微生物组样本时间序列的菌株水平分析

Front Microbiol. 2018 Oct 30;9:2559. doi: 10.3389/fmicb.2018.02559. eCollection 2018.

Strain-Level Diversity Analysis of Pseudomonas fragi after Pangenome Reconstruction Shows Distinctive Spoilage-Associated Metabolic Traits Clearly Selected by Different Storage Conditions.经泛基因组重建后弗氏柠檬酸杆菌菌株水平多样性分析表明，不同贮藏条件下明显选择了具有独特腐败相关代谢特征的菌株。

Appl Environ Microbiol. 2018 Dec 13;85(1). doi: 10.1128/AEM.02212-18. Print 2019 Jan 1.

Metagenomic analysis with strain-level resolution reveals fine-scale variation in the human pregnancy microbiome.基于菌株水平分辨率的宏基因组分析揭示了人类妊娠微生物组的精细变化。

Genome Res. 2018 Oct;28(10):1467-1480. doi: 10.1101/gr.236000.118. Epub 2018 Sep 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

菌落和宏基因组测序数据中菌株水平微生物检测的计算方法

Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献