Camejo Pamela Yael, Rojas Felipe, Ossa Antonio, Hurtado Rodrigo, Tichy Daniel, Pieringer Christian, Pino Michael, Mora-Uribe Paola, Ulloa Soledad, Norambuena Rodrigo, Tobar-Calfucoy Eduardo, Aguilera Matías, Rojas-Martínez Victoria, Cifuentes Onix, Sabag Andrea, Cifuentes Nicolas, San Martín Daniel, Infante Claudia, Cifuentes Pablo, Pieringer Hans, León Luis E
PhageLab Chile SpA, Santiago, Chile.
Sci Rep. 2025 Oct 31;15(1):38249. doi: 10.1038/s41598-025-22075-2.
The use of bacteriophages for biological control of bacterial infections is a promising approach to combat antimicrobial resistant bacteria. Prediction of phage-bacteria interactions is key to identify sensitive bacterial strains to phage therapy. Since these interactions are governed by multiple biological mechanisms, it is not a simple task to predict the outcome of a phage infection, which varies even among strains from the same species. In this study, machine learning-based models capable of predicting the host range of phages from sequencing data were developed. Models were trained using phage-bacteria protein-protein interactions (PPI), predicted from PPI databases, and a host-range dataset obtained from experimental assays with 10 Salmonella enterica and 3 Escherichia coli bacteriophages. The performance of prediction models differed among bacteriophages, ranging from 78 to 92% of accuracy in the case of Salmonella and 84-94% in Escherichia phages, with the highest accuracy (94%) achieved for E. coli phage CBDS-07. Results demonstrated the effectiveness of using PPI as a feature to design ML models for phage-bacteria phenotype prediction.
利用噬菌体对细菌感染进行生物控制是对抗抗菌耐药菌的一种有前景的方法。预测噬菌体与细菌的相互作用是确定对噬菌体疗法敏感的细菌菌株的关键。由于这些相互作用受多种生物学机制支配,预测噬菌体感染的结果并非易事,即使在同一物种的菌株之间结果也会有所不同。在本研究中,开发了基于机器学习的模型,能够从测序数据预测噬菌体的宿主范围。使用从PPI数据库预测的噬菌体-细菌蛋白质-蛋白质相互作用(PPI)以及通过对10种肠炎沙门氏菌和3种大肠杆菌噬菌体进行实验分析获得的宿主范围数据集对模型进行训练。预测模型的性能在不同噬菌体之间存在差异,沙门氏菌噬菌体的准确率在78%至92%之间,大肠杆菌噬菌体的准确率在84%至94%之间,其中大肠杆菌噬菌体CBDS-07的准确率最高(94%)。结果证明了使用PPI作为特征来设计用于噬菌体-细菌表型预测的机器学习模型的有效性。