Klein-Sousa Victor, Roa-Eguiara Aritz, Kielkopf Claudia S, Sofos Nicholas, Taylor Nicholas M I
Structural Biology of Molecular Machines Group, Protein Structure & Function Program, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen, Denmark.
Core Facility of Integrated Microscopy at University of Copenhagen (CFIM), Copenhagen, Denmark.
Sci Adv. 2025 Jun 6;11(23):eadv0870. doi: 10.1126/sciadv.adv0870.
Bacteriophages use receptor-binding proteins (RBPs) to adhere to bacterial hosts, yet their sequence and structural diversity remain poorly understood. Tail fibers, a major class of RBPs, are elongated and flexible trimeric proteins, making their full-length structures difficult to resolve experimentally. Advances in deep learning-based protein structure prediction, such as AlphaFold2-multimer (AF2M) and ESMFold, provide opportunities for studying these challenging proteins. Here, we introduce RBPseg, a method that combines monomeric ESMFold predictions with a structural-based domain identification approach, to divide tail fiber sequences into manageable fractions for high-confidence modeling with AF2M. Using this approach, we generated complete tail fiber models, validated by single-particle cryo-electron microscopy of five fibers from three phages. A structural classification of 67 fibers identified 16 distinct classes and 89 domains, revealing patterns of modularity, convergence, divergence, and domain swapping. Our findings suggest that these structural classes represent at least 24% of the known tail fiber universe, providing key insights into their evolution and functionality.
噬菌体利用受体结合蛋白(RBPs)附着于细菌宿主,但其序列和结构多样性仍知之甚少。尾丝是一类主要的RBPs,是细长且灵活的三聚体蛋白,这使得其实验全长结构难以解析。基于深度学习的蛋白质结构预测技术的进展,如AlphaFold2-multimer(AF2M)和ESMFold,为研究这些具有挑战性的蛋白质提供了机会。在这里,我们介绍了RBPseg,一种将单体ESMFold预测与基于结构的结构域识别方法相结合的方法,用于将尾丝序列划分为可管理的部分,以便使用AF2M进行高可信度建模。使用这种方法,我们生成了完整的尾丝模型,并通过对来自三种噬菌体的五条尾丝进行单颗粒冷冻电子显微镜验证。对67条尾丝的结构分类确定了16个不同的类别和89个结构域,揭示了模块化、趋同、分化和结构域交换的模式。我们的研究结果表明,这些结构类别至少占已知尾丝总数的24%,为其进化和功能提供了关键见解。