Cook Ryan, Brown Nathan, Redgwell Tamsin, Rihtman Branko, Barnes Megan, Clokie Martha, Stekel Dov J, Hobman Jon, Jones Michael A, Millard Andrew
School of Veterinary Medicine and Science, University of Nottingham, Loughborough, United Kingdom.
Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom.
Phage (New Rochelle). 2021 Dec 1;2(4):214-223. doi: 10.1089/phage.2021.0007. Epub 2021 Dec 16.
With advances in sequencing technology and decreasing costs, the number of phage genomes that have been sequenced has increased markedly in the past decade. We developed an automated retrieval and analysis system for phage genomes (https://github.com/RyanCook94/inphared) to produce the INfrastructure for a PHAge REference Database (INPHARED) of phage genomes and associated metadata. As of January 2021, 14,244 complete phage genomes have been sequenced. The INPHARED data set is dominated by phages that infect a small number of bacterial genera, with 75% of phages isolated on only 30 bacterial genera. There is further bias, with significantly more lytic phage genomes (∼70%) than temperate (∼30%) within our database. Collectively, this results in ∼54% of temperate phage genomes originating from just three host genera. With much debate on the carriage of antibiotic resistance genes and their potential safety in phage therapy, we searched for putative antibiotic resistance genes. Frequency of antibiotic resistance gene carriage was found to be higher in temperate phages than in lytic phages and again varied with host. Given the bias of currently sequenced phage genomes, we suggest to fully understand phage diversity, efforts should be made to isolate and sequence a larger number of phages, in particular temperate phages, from a greater diversity of hosts.
随着测序技术的进步和成本的降低,在过去十年中已测序的噬菌体基因组数量显著增加。我们开发了一个用于噬菌体基因组的自动检索和分析系统(https://github.com/RyanCook94/inphared),以构建噬菌体基因组及相关元数据的噬菌体参考数据库基础设施(INPHARED)。截至2021年1月,已测序的完整噬菌体基因组有14244个。INPHARED数据集主要由感染少数细菌属的噬菌体组成,75%的噬菌体仅从30个细菌属中分离得到。存在进一步的偏差,我们的数据库中裂解性噬菌体基因组(约70%)明显多于温和噬菌体(约30%)。总体而言,这导致约54%的温和噬菌体基因组仅来自三个宿主属。鉴于在噬菌体治疗中抗生素抗性基因的携带及其潜在安全性存在诸多争议,我们搜索了假定的抗生素抗性基因。发现温和噬菌体中抗生素抗性基因携带的频率高于裂解性噬菌体,并且也因宿主而异。鉴于当前已测序的噬菌体基因组存在偏差,我们建议为了全面了解噬菌体多样性,应努力从更多样化的宿主中分离并测序更多数量的噬菌体,特别是温和噬菌体。