Center of Excellence in Genomics and Precision Dentistry, Department of Physiology, Faculty of Dentistry, Chulalongkorn University, Bangkok, Thailand.
Graduate Program in Bioinformatics and Computational Biology, Faculty of Science, Chulalongkorn University, Bangkok, Thailand.
J Transl Med. 2024 Jan 28;22(1):111. doi: 10.1186/s12967-024-04917-1.
The study of microbial communities has undergone significant advancements, starting from the initial use of 16S rRNA sequencing to the adoption of shotgun metagenomics. However, a new era has emerged with the advent of long-read sequencing (LRS), which offers substantial improvements over its predecessor, short-read sequencing (SRS). LRS produces reads that are several kilobases long, enabling researchers to obtain more complete and contiguous genomic information, characterize structural variations, and study epigenetic modifications. The current leaders in LRS technologies are Pacific Biotechnologies (PacBio) and Oxford Nanopore Technologies (ONT), each offering a distinct set of advantages. This review covers the workflow of long-read metagenomics sequencing, including sample preparation (sample collection, sample extraction, and library preparation), sequencing, processing (quality control, assembly, and binning), and analysis (taxonomic annotation and functional annotation). Each section provides a concise outline of the key concept of the methodology, presenting the original concept as well as how it is challenged or modified in the context of LRS. Additionally, the section introduces a range of tools that are compatible with LRS and can be utilized to execute the LRS process. This review aims to present the workflow of metagenomics, highlight the transformative impact of LRS, and provide researchers with a selection of tools suitable for this task.
微生物群落的研究已经取得了重大进展,从最初使用 16S rRNA 测序发展到采用鸟枪法宏基因组学。然而,随着长读长测序(LRS)的出现,迎来了一个新时代,它相对于其前身短读长测序(SRS)有了显著的改进。LRS 产生的读长可达数千个碱基对,使研究人员能够获得更完整和连续的基因组信息,表征结构变异,并研究表观遗传修饰。目前 LRS 技术的领导者是 Pacific Biotechnologies(PacBio)和 Oxford Nanopore Technologies(ONT),它们各自提供了一系列独特的优势。这篇综述涵盖了长读长宏基因组测序的工作流程,包括样品制备(样品采集、样品提取和文库制备)、测序、处理(质量控制、组装和分类)以及分析(分类注释和功能注释)。每个部分都简要概述了该方法的关键概念,介绍了原始概念以及在 LRS 背景下如何对其进行挑战或修改。此外,该部分还介绍了一系列适用于 LRS 并可用于执行 LRS 过程的工具。本综述旨在呈现宏基因组学的工作流程,强调 LRS 的变革性影响,并为研究人员提供适合这项任务的工具选择。