Fraser James S, Yu Zhou, Maxwell Karen L, Davidson Alan R
Department of Molecular and Medical Genetics, University of Toronto, Toronto, Ont., Canada M5S 1A8.
J Mol Biol. 2006 Jun 2;359(2):496-507. doi: 10.1016/j.jmb.2006.03.043. Epub 2006 Apr 6.
The immunoglobulin (Ig) fold is one of the most important structures in biology, playing essential roles in the vertebrate immune response, cell adhesion, and many other processes. Through bioinformatic analysis, we have discovered that Ig-like domains are often found in the constituent proteins of tailed double-stranded (ds) DNA bacteriophage particles, and are likely displayed on the surface of these viruses. These phage Ig-like domains fall into three distinct sequence families, which are similar to the classic immunoglobulin domain (I-Set), the fibronectin type 3 repeat (FN3), and the bacterial Ig-like domain (Big2). The phage Ig-like domains are very promiscuous. They are attached to more than ten different functional classes of proteins, and found in all three morphogenetic classes of tailed dsDNA phages. In addition, they reside in phages that infect a diverse set of gram negative and gram positive bacteria. These domains are deceptive because many are added to larger proteins through programmed ribosomal frameshifting, so that they are not always detected by standard protein sequence searching procedures. In addition, the presence of unrecognized Ig-like domains in a variety of phage proteins with different functions has led to gene misannotation. Our results demonstrate that horizontal gene transfer involving Ig-like domain encoding DNA has occurred commonly between diverse classes of both lytic and temperate phages, which otherwise display very limited sequence similarities to one another. We suggest that phage may have been an important vector in the spread of Ig-like domains through diverse species of bacteria. While the function of the phage Ig-like domains is unknown, several lines of evidence suggest that they may play an accessory role in phage infection by weakly interacting with carbohydrates on the bacterial cell surface.
免疫球蛋白(Ig)折叠是生物学中最重要的结构之一,在脊椎动物免疫反应、细胞黏附及许多其他过程中发挥着关键作用。通过生物信息学分析,我们发现Ig样结构域经常出现在有尾双链(ds)DNA噬菌体颗粒的组成蛋白中,并且可能展示在这些病毒的表面。这些噬菌体Ig样结构域可分为三个不同的序列家族,它们分别类似于经典免疫球蛋白结构域(I-Set)、纤连蛋白3型重复序列(FN3)和细菌Ig样结构域(Big2)。噬菌体Ig样结构域非常混杂。它们与十多种不同功能类别的蛋白质相连,并且在有尾dsDNA噬菌体的所有三种形态发生类别中都能找到。此外,它们存在于感染多种革兰氏阴性菌和革兰氏阳性菌的噬菌体中。这些结构域具有欺骗性,因为许多结构域是通过程序性核糖体移码添加到较大蛋白质中的,所以标准蛋白质序列搜索程序并不总能检测到它们。此外,多种具有不同功能的噬菌体蛋白质中存在未被识别的Ig样结构域导致了基因错误注释。我们的结果表明,涉及编码Ig样结构域DNA的水平基因转移在不同类别的裂解性噬菌体和温和噬菌体之间普遍发生,否则它们彼此之间的序列相似性非常有限。我们认为噬菌体可能是Ig样结构域在不同细菌物种间传播的重要载体。虽然噬菌体Ig样结构域的功能尚不清楚,但有几条证据表明它们可能通过与细菌细胞表面的碳水化合物弱相互作用在噬菌体感染中发挥辅助作用。