Eisenhaber Birgit, Wildpaner Michael, Schultz Carolyn J, Borner Georg H H, Dupree Paul, Eisenhaber Frank
Research Institute of Molecular Pathology, Dr. Bohr-Gasse 7, A-1030 Vienna, Austria.
Plant Physiol. 2003 Dec;133(4):1691-701. doi: 10.1104/pp.103.023580.
Posttranslational glycosylphosphatidylinositol (GPI) lipid anchoring is common not only for animal and fungal but also for plant proteins. The attachment of the GPI moiety to the carboxyl-terminus after proteolytic cleavage of a C-terminal propeptide is performed by the transamidase complex. Its four known subunits also have obvious full-length orthologs in the Arabidopsis and rice (Oryza sativa) genomes; thus, the mechanism of substrate protein processing appears similar for all eukaryotes. A learning set of plant proteins (substrates for the transamidase complex) has been collected both from the literature and plant sequence databases. We find that the plant GPI lipid anchor motif differs in minor aspects from the animal signal (e.g. the plant hydrophobic tail region can contain a higher fraction of aromatic residues). We have developed the "big-Pi plant" program for prediction of compatibility of query protein C-termini with the plant GPI lipid anchor motif requirements. Validation tests show that the sensitivity for transamidase targets is approximately 94%, and the rate of false positive prediction is about 0.1%. Thus, the big-Pi predictor can be applied as unsupervised genome annotation and target selection tool. The program is also suited for the design of modified protein constructs to test their GPI lipid anchoring capacity. The big-Pi plant predictor Web server and lists of potential plant precursor proteins in Swiss-Prot, SPTrEMBL, Arabidopsis, and rice proteomes are available at http://mendel.imp.univie.ac.at/gpi/plants/gpi_plants.html. Arabidopsis and rice protein hits have been functionally classified. Several GPI lipid-anchored arabinogalactan-related proteins have been identified in rice.
翻译后糖基磷脂酰肌醇(GPI)脂质锚定不仅在动物和真菌中常见,在植物蛋白中也很常见。在C端前肽被蛋白水解切割后,GPI部分与羧基末端的连接由转酰胺酶复合物完成。其四个已知亚基在拟南芥和水稻基因组中也有明显的全长直系同源物;因此,底物蛋白加工机制在所有真核生物中似乎相似。已从文献和植物序列数据库中收集了一组植物蛋白(转酰胺酶复合物的底物)。我们发现植物GPI脂质锚定基序在一些小方面与动物信号不同(例如,植物疏水尾部区域可能含有更高比例的芳香族残基)。我们开发了“big-Pi植物”程序,用于预测查询蛋白C末端与植物GPI脂质锚定基序要求的兼容性。验证测试表明,对转酰胺酶靶标的敏感性约为94%,假阳性预测率约为0.1%。因此,big-Pi预测器可作为无监督基因组注释和靶标选择工具。该程序也适用于设计修饰的蛋白质构建体,以测试其GPI脂质锚定能力。big-Pi植物预测器网络服务器以及瑞士蛋白质数据库、SPTrEMBL、拟南芥和水稻蛋白质组中潜在植物前体蛋白列表可在http://mendel.imp.univie.ac.at/gpi/plants/gpi_plants.html获取。已对拟南芥和水稻的蛋白命中结果进行了功能分类。在水稻中已鉴定出几种GPI脂质锚定的阿拉伯半乳聚糖相关蛋白。