Center for Microbiota and Immunological Diseases, Shanghai General Hospital, Shanghai Institute of Immunology, Shanghai Jiao Tong University, School of Medicine, Shanghai 2,000,025, China.
Shanghai Institute of Immunology, Shanghai Jiao Tong University, School of Medicine, Shanghai 200,000, China.
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab030.
Recent advances in high-throughput sequencing technologies and computational methods have added a new dimension to metagenomic data analysis i.e. genome-resolved metagenomics. In general terms, it refers to the recovery of draft or high-quality microbial genomes and their taxonomic classification and functional annotation. In recent years, several studies have utilized the genome-resolved metagenome analysis approach and identified previously unknown microbial species from human and environmental metagenomes. In this review, we describe genome-resolved metagenome analysis as a series of four necessary steps: (i) preprocessing of the sequencing reads, (ii) de novo metagenome assembly, (iii) genome binning and (iv) taxonomic and functional analysis of the recovered genomes. For each of these four steps, we discuss the most commonly used tools and the currently available pipelines to guide the scientific community in the recovery and subsequent analyses of genomes from any metagenome sample. Furthermore, we also discuss the tools required for validation of assembly quality as well as for improving quality of the recovered genomes. We also highlight the currently available pipelines that can be used to automate the whole analysis without having advanced bioinformatics knowledge. Finally, we will highlight the most widely adapted and actively maintained tools and pipelines that can be helpful to the scientific community in decision making before they commence the analysis.
近年来,高通量测序技术和计算方法的进步为宏基因组数据分析增添了一个新维度,即基因组解析宏基因组学。一般来说,它是指从人类和环境宏基因组中恢复草图或高质量微生物基因组及其分类和功能注释。近年来,已有多项研究利用基因组解析宏基因组分析方法从人类和环境宏基因组中鉴定出以前未知的微生物物种。在这篇综述中,我们将基因组解析宏基因组分析描述为四个必要步骤:(i)测序reads 的预处理,(ii)从头宏基因组组装,(iii)基因组分箱和(iv)回收基因组的分类和功能分析。对于这四个步骤中的每一个,我们讨论最常用的工具和当前可用的管道,以指导科学界从任何宏基因组样本中回收和随后分析基因组。此外,我们还讨论了用于验证组装质量以及提高回收基因组质量所需的工具。我们还强调了目前可用的管道,这些管道可以在没有先进生物信息学知识的情况下自动化整个分析。最后,我们将重点介绍最广泛适应和积极维护的工具和管道,以便科学界在开始分析之前做出决策。
Brief Bioinform. 2021-9-2
Microbiol Spectr. 2021-12-22
mSystems. 2022-8-30
Microbiome. 2020-11-11
BMC Bioinformatics. 2015
Methods Mol Biol. 2018
BMC Genomics. 2017-11-28
Gigascience. 2022-12-28
J Mol Biol. 2023-7-15
Curr Res Microb Sci. 2022-8-7
Microorganisms. 2022-11-24
Front Cell Infect Microbiol. 2022
BMC Bioinformatics. 2021-1-6
Nat Biotechnol. 2021-4
Nat Biotechnol. 2021-1
mSphere. 2020-5-20
Bioinformatics. 2020-8-15
Nat Biotechnol. 2020-4-27