Abe Takashi, Akazawa Yu, Toyoda Atsushi, Niki Hironori, Baba Tomoya
Department of Information Engineering, Faculty of Engineering, Niigata University, Niigata, Japan.
Comparative Genomics Laboratory, National Institute of Genetics, Mishima, Japan.
Front Microbiol. 2020 Jul 3;11:1486. doi: 10.3389/fmicb.2020.01486. eCollection 2020.
Horizontal gene transfer (HGT) has been widely suggested to play a critical role in the environmental adaptation of microbes; however, the number and origin of the genes in microbial genomes obtained through HGT remain unknown as the frequency of detected HGT events is generally underestimated, particularly in the absence of information on donor sequences. As an alternative to phylogeny-based methods that rely on sequence alignments, we have developed an alignment-free clustering method on the basis of an unsupervised neural network "Batch-Learning Self-Organizing Map (BLSOM)" in which sequence fragments are clustered based solely on oligonucleotide similarity without taxonomical information, to detect HGT candidates and their origin in entire genomes. By mapping the microbial genomic sequences on large-scale BLSOMs constructed with nearly all prokaryotic genomes, HGT candidates can be identified, and their origin assigned comprehensively, even for microbial genomes that exhibit high novelty. By focusing on two types of , specifically psychrotolerant strains from an Antarctic lake, we detected HGT candidates using BLSOM and found higher proportions of HGT candidates from organisms belonging to in the genomes of these two Antarctic strains compared with those of continental strains. Further, an origin difference was noted in the HGT candidates found in the two Antarctic strains. Although their origins were highly diversified, gene functions related to the cell wall or membrane biogenesis were shared among the HGT candidates. Moreover, analyses of amino acid frequency suggested that housekeeping genes and some HGT candidates of the Antarctic strains exhibited different characteristics to other continental strains. Lys, Ser, Thr, and Val were the amino acids found to be increased in the Antarctic strains, whereas Ala, Arg, Glu, and Leu were decreased. Our findings strongly suggest a low-temperature adaptation process for microbes that may have arisen convergently as an independent evolutionary strategy in each Antarctic strain. Hence, BLSOM analysis could serve as a powerful tool in not only detecting HGT candidates and their origins in entire genomes, but also in providing novel perspectives into the environmental adaptations of microbes.
水平基因转移(HGT)被广泛认为在微生物的环境适应性中起着关键作用;然而,通过HGT获得的微生物基因组中基因的数量和来源仍然未知,因为检测到的HGT事件的频率通常被低估,特别是在缺乏供体序列信息的情况下。作为依赖序列比对的基于系统发育方法的替代方法,我们基于无监督神经网络“批学习自组织映射(BLSOM)”开发了一种无比对聚类方法,其中序列片段仅基于寡核苷酸相似性进行聚类,无需分类学信息,以检测整个基因组中的HGT候选基因及其来源。通过将微生物基因组序列映射到用几乎所有原核生物基因组构建的大规模BLSOM上,即使对于具有高度新颖性的微生物基因组,也可以识别HGT候选基因并全面确定其来源。通过关注两种类型的微生物,特别是来自南极湖泊的耐冷菌株,我们使用BLSOM检测了HGT候选基因,发现与大陆菌株相比,这两种南极菌株的基因组中来自属于……的生物体的HGT候选基因比例更高。此外,在这两种南极菌株中发现的HGT候选基因存在来源差异。尽管它们的来源高度多样化,但与细胞壁或膜生物合成相关的基因功能在HGT候选基因中是共有的。此外,氨基酸频率分析表明,南极菌株的管家基因和一些HGT候选基因表现出与其他大陆菌株不同的特征。赖氨酸、丝氨酸、苏氨酸和缬氨酸是在南极菌株中发现增加的氨基酸,而丙氨酸、精氨酸、谷氨酸和亮氨酸则减少。我们的研究结果有力地表明,微生物的低温适应过程可能是作为一种独立的进化策略在每个南极菌株中趋同出现的。因此,BLSOM分析不仅可以作为检测整个基因组中HGT候选基因及其来源的有力工具,还可以为微生物的环境适应性提供新的视角。