Park Sein, Steinegger Martin, Cho Ho-Seong, Chun Jongsik
Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea.
Institute of Molecular Biology and Genetics, Seoul National University, Seoul, South Korea.
Front Microbiol. 2020 Nov 16;11:585622. doi: 10.3389/fmicb.2020.585622. eCollection 2020.
is a model symbiont that colonizes the guts of vertebrates in studies on host adaptation of the gut symbiont. Previous studies have investigated host-specific phylogenetic and functional properties by isolating the genomic sequence. This dependency on genome isolation is a significant bottleneck. Here, we propose a method to study the association between and its hosts directly from metagenomic reads without strain isolation using pan-genomes. We characterized the host-specificity of in metagenomic samples, not only in previously studied organisms (mice and pigs) but also in dogs. For each sample, two types of profiles were generated: (1) genome-based strain type abundance profiles and (2) gene composition profiles. Our profiles showed host-association of in both phylogenetic and functional aspects without depending on host-specific genome isolation. We observed not only the presence of host-specific lineages, but also the dominant lineages associated with the different hosts. Furthermore, we showed that metagenome-assembled genomes provide detailed insights into the host-specificity of . We inferred evolutionary trajectories of host-associative strains in the metagenomic samples by placing the metagenome-assembled genomes into a phylogenetic tree and identified novel host-specific genes that were unannotated in existing pan-genome databases. Our pan-genomic approach reduces the need for time-consuming and expensive host-specific genome isolation, while producing consistent results with previous host-association findings in mice and pigs. Additionally, we predicted associations that have not yet been studied in dogs.
在肠道共生体宿主适应性研究中,是一种定殖于脊椎动物肠道的典型共生体。先前的研究通过分离基因组序列来研究宿主特异性的系统发育和功能特性。这种对基因组分离的依赖是一个重大瓶颈。在此,我们提出一种方法,可直接从宏基因组读数研究其与宿主之间的关联,而无需使用泛基因组进行菌株分离。我们不仅在先前研究过的生物体(小鼠和猪)中,还在狗中,表征了宏基因组样本中的宿主特异性。对于每个样本,生成了两种类型的图谱:(1)基于基因组的菌株类型丰度图谱和(2)基因组成图谱。我们的图谱在系统发育和功能方面均显示出与宿主的关联,而不依赖于宿主特异性基因组分离。我们不仅观察到了宿主特异性谱系的存在,还观察到了与不同宿主相关的优势谱系。此外,我们表明宏基因组组装基因组为其宿主特异性提供了详细见解。通过将宏基因组组装基因组置于系统发育树中,我们推断了宏基因组样本中宿主关联菌株的进化轨迹,并鉴定了现有泛基因组数据库中未注释的新型宿主特异性基因。我们的泛基因组方法减少了对耗时且昂贵的宿主特异性基因组分离的需求,同时产生了与先前在小鼠和猪中宿主关联研究结果一致的结果。此外,我们预测了尚未在狗中研究的关联。