Department of Computer Science, Hong Kong Baptist University, Kowloon, Hong Kong SAR, China.
BGI Research, Shenzhen 518083, China.
Viruses. 2024 Jan 17;16(1):134. doi: 10.3390/v16010134.
Circularity confers protection to viral genomes where linearity falls short, thereby fulfilling the aphorism. However, a shift away from morphology-based classification toward the molecular and ecological classification of viruses is currently underway within the field of virology. Recent years have seen drastic changes in the International Committee on Taxonomy of Viruses' operational definitions of viruses, particularly for the tailed phages that inhabit the human gut. After the abolition of the order Caudovirales, these tailed phages are best defined as members of the class Caudoviricetes. To determine the epistemological value of genome topology in the context of the human gut virome, we designed a set of seven experiments to assay the impact of genome topology and representative viral selection on biological interpretation. Using Oxford Nanopore long reads for viral genome assembly coupled with Illumina short-read polishing, we showed that circular and linear virus genomes differ remarkably in terms of genome quality, GC skew, transfer RNA gene frequency, structural variant frequency, cross-reference functional annotation (COG, KEGG, Pfam, and TIGRfam), state-of-the-art marker-based classification, and phage-host interaction. Furthermore, the disparity profile changes during dereplication. In particular, our phage-host interaction results demonstrated that proportional abundances cannot be meaningfully compared without due regard for genome topology and dereplication threshold, which necessitates the need for standardized reporting. As a best practice guideline, we recommend that comparative studies of the human gut virome always report the ratio of circular to linear viral genomes along with the dereplication threshold so that structural and functional metrics can be placed into context when assessing biologically relevant metagenomic properties such as proportional abundance.
循环性为病毒基因组提供了保护,而线性性则无法做到这一点,从而验证了这一说法。然而,病毒学领域目前正从基于形态的分类向基于分子和生态的病毒分类转变。近年来,国际病毒分类委员会对病毒的操作定义发生了巨大变化,特别是对于栖息在人类肠道中的长尾噬菌体。在废除长尾病毒目之后,这些长尾噬菌体最好被定义为长尾病毒科的成员。为了确定基因组拓扑在人类肠道病毒组中的认识论价值,我们设计了一组七个实验来检验基因组拓扑和代表性病毒选择对生物学解释的影响。我们使用牛津纳米孔长读测序进行病毒基因组组装,并结合 Illumina 短读测序进行优化,结果表明,圆形和线性病毒基因组在基因组质量、GC 倾斜、转移 RNA 基因频率、结构变异频率、交叉参考功能注释(COG、KEGG、Pfam 和 TIGRfam)、最新的基于标记的分类以及噬菌体-宿主相互作用方面存在显著差异。此外,差异分布在去重过程中发生变化。特别是,我们的噬菌体-宿主相互作用结果表明,在不考虑基因组拓扑和去重阈值的情况下,比例丰度无法进行有意义的比较,这就需要进行标准化报告。作为最佳实践指南,我们建议对人类肠道病毒组进行的比较研究始终报告圆形和线性病毒基因组的比例以及去重阈值,以便在评估与生物相关的宏基因组特性(如比例丰度)时,可以将结构和功能指标置于上下文中。