Li Yun, Sun Chuqing, Zhu Jiaying, Geng Mingyan, Li Min, Zhao Xing-Ming, Chen Wei-Hua
Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China.
Wallenberg Laboratory, Department of Molecular and Clinical Medicine and Sahlgrenska Center for Cardiovascular and Metabolic Research, University of Gothenburg, Gothenburg, Sweden.
Microbiol Spectr. 2025 Jul 2:e0001325. doi: 10.1128/spectrum.00013-25.
Due to varying sequencing strategies, current gut virome findings show significant variability. Specifically, bulk- and virus-like particle (VLP)-enriched metagenomic sequencing (termed bulk and VLP, respectively) present unique advantages and limitations, affecting viral genome discovery, taxonomic annotation, and community structure analysis. A comprehensive comparison of these strategies is crucial for thoroughly understanding the gut virome. This study comprehensively compared gut viromes identified from paired bulk and VLP data from 151 adult and 141 infant fecal samples. The VLP method showed superior performance to bulk in viral genome discovery in both data sets by recovering longer and more complete viral genomes, with higher sensitivity for low-abundant ones, resulting in a higher taxonomic annotation rate. However, we observed no correlations in the viral community structure (i.e., Shannon diversities) between bulk- and VLP-derived viromes, implying biases introduced during VLP enrichment. Such biases could be caused by the bacterial host features, such as the structural differences in cell walls and the prevalence and abundance of the viruses. Viruses that are of low prevalence, low abundance, or have Gram-positive bacteria as their hosts were enriched in VLP-derived viromes, in both the adult and infant data sets. Significant complementarity was observed between bulk and VLP viromes, with only about a quarter (26.7% in infants; 29.3% in adults) of VLP-viral genomes overlapping with bulk viruses. Together, our study identifies causal factors underlying the biases of bulk and VLP strategies in human gut virome studies and advocates the use of both strategies to enhance a comprehensive understanding of gut viromes.
The two mainstream gut phageome profiling strategies, namely bulk and virus-like particle (VLP), generated significantly overlapped results and have their own merits and drawbacks. Particularly, VLP exhibits higher efficiency in obtaining more, longer, and more complete viral genomes. However, VLP sequencing has the potential to alter the natural structure of viral communities, often resulting in the identification of viruses with lower prevalence and those specifically associated with Gram-positive bacterial hosts. While bulk metagenome features a more stable and diverse community, which can well reveal the interactions between viruses and bacteria. Nevertheless, bulk sequencing can suffer from lower coverage, leading to fragmented sequences and potentially missing some viral species. Therefore, it is essential to recognize that these methods are complementary rather than competitive in the comprehensive characterization of the gut phageome.
由于测序策略不同,目前肠道病毒组的研究结果存在显著差异。具体而言,基于宏基因组测序的整体样本和富集病毒样颗粒(VLP)的测序(分别称为整体样本测序和VLP测序)具有独特的优势和局限性,影响病毒基因组的发现、分类注释以及群落结构分析。全面比较这些策略对于深入了解肠道病毒组至关重要。本研究全面比较了从151份成人和141份婴儿粪便样本的配对整体样本和VLP数据中鉴定出的肠道病毒组。在两个数据集中,VLP方法在病毒基因组发现方面均表现出优于整体样本测序的性能,能够获得更长、更完整的病毒基因组,对低丰度病毒具有更高的灵敏度,从而具有更高的分类注释率。然而,我们观察到整体样本测序和VLP测序所获得的病毒群落结构(即香农多样性)之间没有相关性,这意味着在VLP富集过程中引入了偏差。这种偏差可能是由细菌宿主特征引起的,例如细胞壁的结构差异以及病毒宿主的流行率和丰度。在成人和婴儿数据集中,低流行率、低丰度或宿主为革兰氏阳性菌的病毒在VLP测序获得的病毒组中得到了富集。整体样本测序和VLP测序所获得的病毒组之间存在显著的互补性,只有约四分之一(婴儿中为26.7%;成人中为29.3%)的VLP病毒基因组与整体样本病毒重叠。总之,我们的研究确定了人类肠道病毒组研究中整体样本测序和VLP策略偏差的潜在原因,并提倡同时使用这两种策略以增强对肠道病毒组的全面理解。
两种主流的肠道噬菌体组分析策略,即整体样本测序和病毒样颗粒(VLP)测序,产生的结果有显著重叠,且各有优缺点。特别是,VLP在获得更多、更长、更完整的病毒基因组方面表现出更高的效率。然而,VLP测序有可能改变病毒群落的自然结构,常常导致鉴定出流行率较低的病毒以及那些与革兰氏阳性菌宿主特异性相关的病毒。而整体样本宏基因组具有更稳定、更多样化的群落,能够很好地揭示病毒与细菌之间的相互作用。尽管如此,整体样本测序可能存在较低的覆盖率,导致序列片段化,并可能遗漏一些病毒种类。因此,必须认识到这些方法在肠道噬菌体组的全面表征中是互补的,而非相互竞争的。