Bhardwaj Kanchan, Garg Anjali, Jain Aakriti, Kumar Manish, Datt Manish, Singh Vijay, Vrati Sudhanshu
Manav Rachna International Institute of Research and Studies, Sector-43, Aravali hills, Faridabad, Haryana, 121 004, India.
Regional Centre for Biotechnology, NCR Biotech Science Cluster, Faridabad-Gurugram Expressway, Faridabad, Haryana, 121 001, India.
Virol J. 2025 Jul 25;22(1):254. doi: 10.1186/s12985-025-02739-1.
Metagenomics has revealed an unprecedented viral diversity in human gut although, most of the sequence data remains uncharacterized. In this study, we mined a collection of 1090 metagenome assembled "high quality" viral genomes (> 90% completeness, as determined by CheckV) derived from human fecal samples. Sequence analysis revealed eight new species spanning seven genera within the class, Caudoviricetes and nineteen new species from fourteen genera within the ssDNA virus family, Microviridae. Additionally, four "high quality" genomes were not found in any of the four major viral databases, NCBI viral RefSeq, IMG-VR, Gut Phage Database (GPD) and Gut Virome Database (GVD). Further, annotation and KEGG pathway analysis of the "high-quality" genomes identified seven core genes (antB, dnaB, DNMT1, DUT, xlyAB, xtmB and xtmA) associated with metabolism and fundamental viral processes. Moreover, genes for virulence, host-takeover, drug resistance, tRNA, tmRNA and CRISPR elements were also detected. Host prediction analysis suggest bacterial hosts for approximately 40% of the genomes. Overall, this study reports the discovery of novel viral genomes and provides a comprehensive genome profiling of human gut viruses in a subpopulation from India. These findings serve as a foundation for future biological investigations to elucidate the role of these viruses in host physiology.
宏基因组学揭示了人类肠道中前所未有的病毒多样性,尽管大多数序列数据仍未得到表征。在本研究中,我们挖掘了一组1090个宏基因组组装的“高质量”病毒基因组(完整性>90%,由CheckV确定),这些基因组来自人类粪便样本。序列分析揭示了尾病毒目七个属中的八个新物种,以及微小病毒科单链DNA病毒家族中14个属的19个新物种。此外,在四个主要病毒数据库(NCBI病毒RefSeq、IMG-VR、肠道噬菌体数据库(GPD)和肠道病毒组数据库(GVD))中均未发现四个“高质量”基因组。此外,对“高质量”基因组的注释和KEGG通路分析确定了七个与代谢和基本病毒过程相关的核心基因(antB、dnaB、DNMT1、DUT、xlyAB、xtmB和xtmA)。此外,还检测到了毒力、宿主接管、耐药性、tRNA、tmRNA和CRISPR元件的基因。宿主预测分析表明,约40%的基因组的细菌宿主。总体而言,本研究报告了新型病毒基因组的发现,并提供了印度一个亚群体中人类肠道病毒的全面基因组概况。这些发现为未来的生物学研究奠定了基础,以阐明这些病毒在宿主生理学中的作用。