Agustinho Daniel P, Fu Yilei, Menon Vipin K, Metcalf Ginger A, Treangen Todd J, Sedlazeck Fritz J
Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA.
Department of Computer Science, Rice University, Houston, TX, USA.
Nat Methods. 2024 Jun;21(6):954-966. doi: 10.1038/s41592-024-02262-1. Epub 2024 Apr 30.
Long-read sequencing has recently transformed metagenomics, enhancing strain-level pathogen characterization, enabling accurate and complete metagenome-assembled genomes, and improving microbiome taxonomic classification and profiling. These advancements are not only due to improvements in sequencing accuracy, but also happening across rapidly changing analysis methods. In this Review, we explore long-read sequencing's profound impact on metagenomics, focusing on computational pipelines for genome assembly, taxonomic characterization and variant detection, to summarize recent advancements in the field and provide an overview of available analytical methods to fully leverage long reads. We provide insights into the advantages and disadvantages of long reads over short reads and their evolution from the early days of long-read sequencing to their recent impact on metagenomics and clinical diagnostics. We further point out remaining challenges for the field such as the integration of methylation signals in sub-strain analysis and the lack of benchmarks.
长读长测序最近改变了宏基因组学,提升了菌株水平的病原体特征描述,实现了准确完整的宏基因组组装基因组,并改进了微生物群落的分类和分析。这些进展不仅得益于测序准确性的提高,也发生在快速变化的分析方法中。在本综述中,我们探讨长读长测序对宏基因组学的深远影响,重点关注基因组组装、分类特征描述和变异检测的计算流程,以总结该领域的最新进展,并概述充分利用长读长的可用分析方法。我们深入探讨了长读长相对于短读长的优缺点,以及它们从长读长测序早期到近期对宏基因组学和临床诊断的影响的演变。我们还指出了该领域仍然存在的挑战,例如在亚菌株分析中整合甲基化信号以及缺乏基准。