前噬菌体数据库:一个用于探索前噬菌体的多样性、分布和生态学的综合数据库。

Prophage-DB: A comprehensive database to explore diversity, distribution, and ecology of prophages.

作者信息

Dieppa-Colón Etan, Martin Cody, Anantharaman Karthik

机构信息

Department of Bacteriology, University of Wisconsin-Madison.

Microbiology Doctoral Training Program, University of Wisconsin-Madison.

出版信息

bioRxiv. 2024 Jul 16:2024.07.11.603044. doi: 10.1101/2024.07.11.603044.

Abstract

BACKGROUND

Viruses that infect prokaryotes (phages) constitute the most abundant group of biological agents, playing pivotal roles in microbial systems. They are known to impact microbial community dynamics, microbial ecology, and evolution. Efforts to document the diversity, host range, infection dynamics, and effects of bacteriophage infection on host cell metabolism are extremely underexplored. Phages are classified as virulent or temperate based on their life cycles. Temperate phages adopt the lysogenic mode of infection, where the genome integrates into the host cell genome forming a prophage. Prophages enable viral genome replication without host cell lysis, and often contribute novel and beneficial traits to the host genome. Current phage research predominantly focuses on lytic phages, leaving a significant gap in knowledge regarding prophages, including their biology, diversity, and ecological roles.

RESULTS

Here we develop and describe Prophage-DB, a database of prophages, their proteins, and associated metadata that will serve as a resource for viral genomics and microbial ecology. To create the database, we identified and characterized prophages from genomes in three of the largest publicly available databases. We applied several state-of-the-art tools in our pipeline to annotate these viruses, cluster and taxonomically classify them, and detect their respective auxiliary metabolic genes. In total, we identify and characterize over 350,000 prophages and 35,000 auxiliary metabolic genes. Our prophage database is highly representative based on statistical results and contains prophages from a diverse set of archaeal and bacterial hosts which show a wide environmental distribution.

CONCLUSION

Prophages are particularly overlooked in viral ecology and merit increased attention due to their vital implications for microbiomes and their hosts. Here, we created Prophage-DB to advance our comprehension of prophages in microbiomes through a comprehensive characterization of prophages in publicly available genomes. We propose that Prophage-DB will serve as a valuable resource for advancing phage research, offering insights into viral taxonomy, host relationships, auxiliary metabolic genes, and environmental distribution.

摘要

背景

感染原核生物的病毒(噬菌体)是最丰富的生物因子群体,在微生物系统中发挥着关键作用。已知它们会影响微生物群落动态、微生物生态学和进化。然而,记录噬菌体的多样性、宿主范围、感染动态以及噬菌体感染对宿主细胞代谢的影响的研究却极为不足。噬菌体根据其生命周期可分为烈性噬菌体和温和噬菌体。温和噬菌体采用溶原性感染模式,其基因组整合到宿主细胞基因组中形成前噬菌体。前噬菌体能够在不裂解宿主细胞的情况下实现病毒基因组复制,并且常常为宿主基因组贡献新的有益性状。当前的噬菌体研究主要集中在烈性噬菌体上,在关于前噬菌体的知识方面存在重大空白,包括它们的生物学特性、多样性和生态作用。

结果

在此,我们开发并描述了前噬菌体数据库(Prophage-DB),这是一个关于前噬菌体、其蛋白质及相关元数据的数据库,将作为病毒基因组学和微生物生态学的资源。为创建该数据库,我们从三个最大的公开可用数据库中的基因组中鉴定并表征了前噬菌体。我们在流程中应用了多种先进工具来注释这些病毒、对它们进行聚类和分类,并检测其各自的辅助代谢基因。我们总共鉴定并表征了超过350,000个前噬菌体和35,000个辅助代谢基因。根据统计结果,我们的前噬菌体数据库具有高度代表性,包含来自各种古菌和细菌宿主的前噬菌体,这些宿主具有广泛的环境分布。

结论

在前病毒生态学中,前噬菌体尤其被忽视,由于它们对微生物群落及其宿主具有至关重要的影响,因此值得更多关注。在此,我们创建了前噬菌体数据库,通过对公开可用基因组中的前噬菌体进行全面表征,来增进我们对微生物群落中前噬菌体的理解。我们认为,前噬菌体数据库将成为推进噬菌体研究的宝贵资源,为病毒分类学、宿主关系、辅助代谢基因和环境分布提供见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32be/11275716/fe47ae37d1ba/nihpp-2024.07.11.603044v1-f0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索