Mathieu Alban, Leclercq Mickael, Sanabria Melissa, Perin Olivier, Droit Arnaud
Computational Biology Laboratory, CHU de Québec - Université Laval Research Centre, Québec City, QC, Canada.
Université Côte d'Azur, CNRS, INRIA, I3S, Nice, France.
Front Microbiol. 2022 Mar 14;13:811495. doi: 10.3389/fmicb.2022.811495. eCollection 2022.
Shotgun sequencing of environmental DNA (i.e., metagenomics) has revolutionized the field of environmental microbiology, allowing the characterization of all microorganisms in a sequencing experiment. To identify the microbes in terms of taxonomy and biological activity, the sequenced reads must necessarily be aligned on known microbial genomes/genes. However, current alignment methods are limited in terms of speed and can produce a significant number of false positives when detecting bacterial species or false negatives in specific cases (virus, plasmids, and gene detection). Moreover, recent advances in metagenomics have enabled the reconstruction of new genomes using binning strategies, but these genomes, not yet fully characterized, are not used in classic approaches, whereas machine and deep learning methods can use them as models. In this article, we attempted to review the different methods and their efficiency to improve the annotation of metagenomic sequences. Deep learning models have reached the performance of the widely used k-mer alignment-based tools, with better accuracy in certain cases; however, they still must demonstrate their robustness across the variety of environmental samples and across the rapid expansion of accessible genomes in databases.
Front Microbiol. 2022-3-14
BMC Bioinformatics. 2018-7-9
BMC Bioinformatics. 2020-7-28
Methods Mol Biol. 2018
Bioinformatics. 2016-4-1
Bioinformatics. 2019-5-1
Genome Biol. 2017-9-21
Funct Integr Genomics. 2025-6-10
Natl Sci Rev. 2025-2-19
Brief Bioinform. 2025-3-4
Brief Bioinform. 2025-3-4
Front Microbiol. 2025-1-22
Gastroenterol Hepatol Bed Bench. 2024
Int J Mol Sci. 2024-5-26
Nat Rev Mol Cell Biol. 2022-1
NAR Genom Bioinform. 2020-2-19
Nucleic Acids Res. 2021-1-8
Nucleic Acids Res. 2021-1-8
Environ Microbiol. 2021-1
Nucleic Acids Res. 2021-1-8
Nat Rev Microbiol. 2020-5
BMC Bioinformatics. 2020-2-24
Genome Biol. 2019-11-28