Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada.
Genome Biol Evol. 2019 Oct 1;11(10):2750-2766. doi: 10.1093/gbe/evz184.
High-throughput shotgun metagenomics sequencing has enabled the profiling of myriad natural communities. These data are commonly used to identify gene families and pathways that were potentially gained or lost in an environment and which may be involved in microbial adaptation. Despite the widespread interest in these events, there are no established best practices for identifying gene gain and loss in metagenomics data. Horizontal gene transfer (HGT) represents several mechanisms of gene gain that are especially of interest in clinical microbiology due to the rapid spread of antibiotic resistance genes in natural communities. Several additional mechanisms of gene gain and loss, including gene duplication, gene loss-of-function events, and de novo gene birth are also important to consider in the context of metagenomes but have been less studied. This review is largely focused on detecting HGT in prokaryotic metagenomes, but methods for detecting these other mechanisms are first discussed. For this article to be self-contained, we provide a general background on HGT and the different possible signatures of this process. Lastly, we discuss how improved assembly of genomes from metagenomes would be the most straight-forward approach for improving the inference of gene gain and loss events. Several recent technological advances could help improve metagenome assemblies: long-read sequencing, determining the physical proximity of contigs, optical mapping of short sequences along chromosomes, and single-cell metagenomics. The benefits and limitations of these advances are discussed and open questions in this area are highlighted.
高通量鸟枪法宏基因组测序技术使得对无数自然群落进行分析成为可能。这些数据通常用于识别在环境中可能获得或丢失的基因家族和途径,这些基因可能与微生物的适应有关。尽管人们对这些事件普遍感兴趣,但在宏基因组数据中识别基因获得和丢失还没有既定的最佳实践。水平基因转移 (HGT) 代表了几种基因获得的机制,由于抗生素耐药基因在自然群落中的快速传播,这些机制在临床微生物学中尤为重要。其他几种基因获得和丢失的机制,包括基因重复、基因功能丧失事件和从头基因产生,在宏基因组的背景下也很重要,但研究较少。这篇综述主要集中在检测原核宏基因组中的 HGT,但首先讨论了检测这些其他机制的方法。为了使本文自成一体,我们提供了关于 HGT 以及该过程的不同可能特征的一般背景。最后,我们讨论了如何通过改进宏基因组中基因组的组装来更直接地改进对基因获得和丢失事件的推断。最近的一些技术进步可以帮助改进宏基因组组装:长读测序、确定连续体的物理接近度、沿着染色体对短序列进行光学作图以及单细胞宏基因组学。讨论了这些进展的优缺点,并突出了该领域的开放性问题。