De Groot Piet W J, Hellingwerf Klaas J, Klis Frans M
Laboratory for Microbiology, Swammerdam Institute for Life Sciences, University of Amsterdam, Nieuwe Achtergracht 166, 1018 WV Amsterdam, The Netherlands.
Yeast. 2003 Jul 15;20(9):781-96. doi: 10.1002/yea.1007.
Glycosylphosphatidylinositol-modified (GPI) proteins share structural features that allow their identification using a genomic approach. From the known S. cerevisiae and C. albicans GPI proteins, the following consensus sequence for the GPI attachment site and its downstream region was derived: [NSGDAC]-[GASVIETKDLF]-[GASV]-X(4,19)-FILMVAGPSTCYWN>, where > indicates the C-terminal end of the protein. This consensus sequence, which recognized known GPI proteins from various fungi, was used to screen the genomes of the yeasts S. cerevisiae, C. albicans, Sz. pombe and the filamentous fungus N. crassa for putative GPI proteins. The subsets of proteins so obtained were further screened for the presence of an N-terminal signal sequence for the secretion and absence of internal transmembrane domains. In this way, we identified 66 putative GPI proteins in S. cerevisiae. Some of these are known GPI proteins that were not identified by earlier genomic analyses, indicating that this selection procedure renders a more complete image of the S. cerevisiae GPI proteome. Using the same approach, 104 putative GPI proteins were identified in the human pathogen C. albicans. Among these were the proteins Gas/Phr, Ecm33, Crh and Plb, all members of GPI protein families that are also present in S. cerevisiae. In addition, several proteins and protein families with no significant homology to S. cerevisiae proteins were identified, including the cell wall-associated Als, Csa1/Rbt5, Hwp1/Rbt1 and Hyr1 protein families. In Sz. pombe, which has a low level of (galacto)mannan in the cell wall compared to C. albicans and S. cerevisiae, only 33 GPI candidates were identified and in N. crassa 97. BLAST searches revealed that about half of the putative GPI proteins that were identified in Sz. pombe and N. crassa are homologous to known or putative GPI proteins from other fungi. We conclude that our algorithm is selective and can also be used for GPI protein identification in other fungi.
糖基磷脂酰肌醇修饰(GPI)的蛋白质具有一些结构特征,使得可以通过基因组方法对其进行鉴定。从已知的酿酒酵母和白色念珠菌的GPI蛋白中,推导得出了GPI附着位点及其下游区域的如下共有序列:[NSGDAC]-[GASVIETKDLF]-[GASV]-X(4,19)-FILMVAGPSTCYWN,其中>表示蛋白质的C末端。该共有序列可识别来自各种真菌的已知GPI蛋白,被用于筛选酿酒酵母、白色念珠菌、粟酒裂殖酵母和丝状真菌粗糙脉孢菌的基因组,以寻找假定的GPI蛋白。对如此获得的蛋白质子集进一步筛选,以确定其是否存在用于分泌的N末端信号序列以及是否不存在内部跨膜结构域。通过这种方式,我们在酿酒酵母中鉴定出了66种假定的GPI蛋白。其中一些是早期基因组分析未鉴定出的已知GPI蛋白,这表明该筛选程序能够更完整地呈现酿酒酵母的GPI蛋白质组。使用相同的方法,在人类病原体白色念珠菌中鉴定出了104种假定的GPI蛋白。其中包括Gas/Phr、Ecm33、Crh和Plb蛋白,它们都是GPI蛋白家族的成员,在酿酒酵母中也存在。此外,还鉴定出了几种与酿酒酵母蛋白无显著同源性的蛋白质和蛋白质家族,包括细胞壁相关的Als、Csa1/Rbt5、Hwp1/Rbt1和Hyr1蛋白家族。与白色念珠菌和酿酒酵母相比,细胞壁中(半乳)甘露聚糖水平较低的粟酒裂殖酵母中,仅鉴定出33个GPI候选蛋白,而在粗糙脉孢菌中鉴定出97个。BLAST搜索显示,在粟酒裂殖酵母和粗糙脉孢菌中鉴定出的假定GPI蛋白中,约有一半与其他真菌的已知或假定GPI蛋白同源。我们得出结论,我们的算法具有选择性,也可用于其他真菌中GPI蛋白的鉴定。