DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
Nucleic Acids Res. 2021 Jan 8;49(D1):D764-D775. doi: 10.1093/nar/gkaa946.
Viruses are integral components of all ecosystems and microbiomes on Earth. Through pervasive infections of their cellular hosts, viruses can reshape microbial community structure and drive global nutrient cycling. Over the past decade, viral sequences identified from genomes and metagenomes have provided an unprecedented view of viral genome diversity in nature. Since 2016, the IMG/VR database has provided access to the largest collection of viral sequences obtained from (meta)genomes. Here, we present the third version of IMG/VR, composed of 18 373 cultivated and 2 314 329 uncultivated viral genomes (UViGs), nearly tripling the total number of sequences compared to the previous version. These clustered into 935 362 viral Operational Taxonomic Units (vOTUs), including 188 930 with two or more members. UViGs in IMG/VR are now reported as single viral contigs, integrated proviruses or genome bins, and are annotated with a new standardized pipeline including genome quality estimation using CheckV, taxonomic classification reflecting the latest ICTV update, and expanded host taxonomy prediction. The new IMG/VR interface enables users to efficiently browse, search, and select UViGs based on genome features and/or sequence similarity. IMG/VR v3 is available at https://img.jgi.doe.gov/vr, and the underlying data are available to download at https://genome.jgi.doe.gov/portal/IMG_VR.
病毒是地球上所有生态系统和微生物组的组成部分。通过对其细胞宿主的广泛感染,病毒可以重塑微生物群落结构并驱动全球养分循环。在过去的十年中,从基因组和宏基因组中鉴定出的病毒序列为我们提供了对自然界中病毒基因组多样性的前所未有的了解。自 2016 年以来,IMG/VR 数据库提供了访问最大的从(宏)基因组中获得的病毒序列集合的途径。在这里,我们展示了 IMG/VR 的第三个版本,它由 18373 个培养的和 2314329 个未培养的病毒基因组(UViGs)组成,与前一个版本相比,序列总数几乎增加了两倍。这些序列聚类成 935362 个病毒操作分类单元(vOTUs),其中 188930 个有两个或更多成员。IMG/VR 中的 UViGs 现在被报告为单个病毒连续序列、整合前病毒或基因组框,并用新的标准化管道进行注释,包括使用 CheckV 估计基因组质量、反映最新 ICTV 更新的分类学分类以及扩展的宿主分类预测。新的 IMG/VR 界面使用户能够根据基因组特征和/或序列相似性高效地浏览、搜索和选择 UViGs。IMG/VR v3 可在 https://img.jgi.doe.gov/vr 上获得,基础数据可在 https://genome.jgi.doe.gov/portal/IMG_VR 上下载。