Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
U.S. Department of Energy Joint Genome Institute, Berkeley, CA, USA.
Nat Microbiol. 2021 Jul;6(7):960-970. doi: 10.1038/s41564-021-00928-6. Epub 2021 Jun 24.
Bacteriophages have important roles in the ecology of the human gut microbiome but are under-represented in reference databases. To address this problem, we assembled the Metagenomic Gut Virus catalogue that comprises 189,680 viral genomes from 11,810 publicly available human stool metagenomes. Over 75% of genomes represent double-stranded DNA phages that infect members of the Bacteroidia and Clostridia classes. Based on sequence clustering we identified 54,118 candidate viral species, 92% of which were not found in existing databases. The Metagenomic Gut Virus catalogue improves detection of viruses in stool metagenomes and accounts for nearly 40% of CRISPR spacers found in human gut Bacteria and Archaea. We also produced a catalogue of 459,375 viral protein clusters to explore the functional potential of the gut virome. This revealed tens of thousands of diversity-generating retroelements, which use error-prone reverse transcription to mutate target genes and may be involved in the molecular arms race between phages and their bacterial hosts.
噬菌体在人类肠道微生物组的生态中具有重要作用,但在参考数据库中代表性不足。为了解决这个问题,我们组装了宏基因组肠道病毒目录,其中包含了 11810 个公开可用的人类粪便宏基因组中的 189680 个病毒基因组。超过 75%的基因组代表感染拟杆菌门和梭菌门成员的双链 DNA 噬菌体。基于序列聚类,我们鉴定了 54118 种候选病毒种,其中 92%在现有数据库中未发现。宏基因组肠道病毒目录提高了粪便宏基因组中病毒的检测能力,并且解释了近 40%在人类肠道细菌和古菌中发现的 CRISPR 间隔物。我们还生成了一个包含 459375 个病毒蛋白簇的目录,以探索肠道病毒组的功能潜力。这揭示了成千上万的产生多样性的反转录转座子,它们利用易错的反转录来突变靶基因,可能参与噬菌体与其细菌宿主之间的分子军备竞赛。