Maier Jessie L, Gin Craig, Callahan Ben, Sheriff Emma K, Duerkop Breck A, Kleiner Manuel
Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC.
Department of Population Health and Pathobiology, North Carolina State University, Raleigh, NC.
bioRxiv. 2024 Mar 25:2024.03.25.586692. doi: 10.1101/2024.03.25.586692.
Serovar Typhimurium () and its bacteriophage P22 are a model system for the study of horizontal gene transfer by generalized transduction. Typically, the P22 DNA packaging machinery initiates packaging when a short sequence of DNA, known as the pac site, is recognized on the P22 genome. However, sequences similar to the pac site in the host genome, called pseudo-pac sites, lead to erroneous packaging and subsequent generalized transduction of DNA. While the general genomic locations of the pseudo-pac sites are known, the sequences themselves have not been determined. We used visualization of P22 sequencing reads mapped to host genomes to define regions of generalized transduction initiation and the likely locations of pseudo-pac sites. We searched each genome region for the sequence with the highest similarity to the P22 pac site and aligned the resulting sequences. We built a regular expression (sequence match pattern) from the alignment and used it to search the genomes of two P22-susceptible strains- LT2 and 14028S- for sequence matches. The final regular expression successfully identified pseudo-pac sites in both LT2 and 14028S that correspond with generalized transduction initiation sites in mapped read coverages. The pseudo-pac site sequences identified in this study can be used to predict locations of generalized transduction in other P22-susceptible hosts or to initiate generalized transduction at specific locations in P22-susceptible hosts with genetic engineering. Furthermore, the bioinformatics approach used to identify the pseudo-pac sites in this study could be applied to other phage-host systems.
鼠伤寒血清型()及其噬菌体P22是用于研究通过广义转导进行水平基因转移的模型系统。通常,当在P22基因组上识别出一段短的DNA序列(称为pac位点)时,P22 DNA包装机制就会启动包装。然而,宿主基因组中与pac位点相似的序列(称为假pac位点)会导致错误包装以及随后DNA的广义转导。虽然假pac位点的大致基因组位置是已知的,但序列本身尚未确定。我们利用映射到宿主基因组的P22测序读数的可视化来定义广义转导起始区域和假pac位点的可能位置。我们在每个基因组区域中搜索与P22 pac位点相似度最高的序列,并对所得序列进行比对。我们根据比对结果构建了一个正则表达式(序列匹配模式),并用它在两种对P22敏感的菌株——LT2和14028S——的基因组中搜索序列匹配情况。最终的正则表达式成功地在LT2和14028S中识别出了与映射读数覆盖中的广义转导起始位点相对应的假pac位点。本研究中鉴定出的假pac位点序列可用于预测其他对P22敏感的宿主中的广义转导位置,或通过基因工程在对P22敏感的宿主中的特定位置启动广义转导。此外,本研究中用于识别假pac位点的生物信息学方法可应用于其他噬菌体-宿主系统。