Kawabata T, Arisaka F, Nishikawa K
Center for Information Biology, National Institute of Genetics, 1111, Yata, Mishima, Shizuoka 411-8540, Japan.
Gene. 2000 Dec 23;259(1-2):223-33. doi: 10.1016/s0378-1119(00)00442-x.
Among the total of 274 orfs within bacteriophage T4, only half have been reasonably well characterized, and the functions of the rest have remained obscure. In order to predict the molecular functions of the orfs, a position-specific iterated (PSI)-BLAST search of bacteriophage T4 against the sequence database of known 3D structures was carried out. PSI-BLAST is one of the most powerful iterative sequence search methods using multiple sequence alignment, with the ability to detect many more proteins with distant homology than standard pairwise methods. The 3D structures of proteins are considered to be better preserved than the sequences, and the detected distantly homologous proteins are likely to possess highly similar 3D structures. Thirteen orfs of phage T4, whose homologues were not detected by standard pairwise methods, were found to have significantly homologous counterparts by this method. The plausibility of the results was confirmed by checking whether important residues at substrate/ligand-binding sites were conserved. Among them, two orfs, vs.1 and e.1, which are similar to Escherichia coli lytic enzyme and MutT protein, respectively, had not been studied previously. Also, gp rIIA, a rapid lysis protein, whose gene structure had been intensively studied during the development of molecular biology in the 1950s and yet whose molecular function remains unknown, has an N-terminal domain that is significantly similar to the N-terminal region of the heat shock protein Hsp90.
在噬菌体T4的总共274个开放阅读框(ORF)中,只有一半得到了较为充分的表征,其余的功能仍不清楚。为了预测这些开放阅读框的分子功能,针对已知三维结构的序列数据库对噬菌体T4进行了位置特异性迭代(PSI)-BLAST搜索。PSI-BLAST是使用多序列比对的最强大的迭代序列搜索方法之一,与标准的成对方法相比,它能够检测到更多具有远缘同源性的蛋白质。蛋白质的三维结构被认为比序列更保守,检测到的远缘同源蛋白质可能具有高度相似的三维结构。通过这种方法发现,噬菌体T4的13个开放阅读框,其同源物未被标准成对方法检测到,但却有明显同源的对应物。通过检查底物/配体结合位点的重要残基是否保守,证实了结果的合理性。其中,分别与大肠杆菌裂解酶和MutT蛋白相似的两个开放阅读框vs.1和e.1,此前尚未被研究过。此外,gp rIIA是一种快速裂解蛋白,其基因结构在20世纪50年代分子生物学发展过程中曾被深入研究,但其分子功能仍然未知,它有一个与热休克蛋白Hsp90的N端区域显著相似的N端结构域。