Hawkins John, Mahony Donna, Maetschke Stefan, Wakabayashi Mark, Teasdale Rohan D, Bodén Mikael
ARC Centre for Complex Systems, The University of Queensland, St. Lucia, Queensland 4072, Australia.
Proteins. 2007 Nov 15;69(3):606-16. doi: 10.1002/prot.21420.
Peroxisomes are small subcellular compartments responsible for a range of essential metabolic processes. Efforts in predicting peroxisomal protein import are challenged by species variation and sparse sequence data sets with experimentally confirmed localization. We present a predictor of peroxisomal import based on the presence of the dominant peroxisomal targeting signal one (PTS1), a seemingly wellconserved but highly unspecific motif. The signal appears to rely on subtle dependencies with the preceding residues. We evaluate prediction accuracies against two alternative predictor services, PEROXIP and the PTS1 PREDICTOR. We test the integrity of prediction on a range of prokaryotic and eukaryotic proteomes lacking peroxisomes. Similarly we test the accuracy on peroxisomal proteins known to not overlap with training data. The model identified a number of proteins within the RIKEN IPS7 mouse protein dataset as potentially novel peroxisomal proteins. Three were confirmed in vitro using immunofluorescent detection of myc-epitope-tagged proteins in transiently transfected BHK-21 cells (Dhrs2, Serhl, and Ehhadh). The final model has a superior specificity to both alternatives, and an accuracy better than PEROXIP and on par with PTS1 PREDICTOR. Thus, the model we present should prove invaluable for labeling PTS1 targeted proteins with high confidence. We use the predictor to screen several additional eukaryotic genomes to revise previously estimated numbers of peroxisomal proteins. Available at http://pprowler.itee.uq.edu.au.
过氧化物酶体是负责一系列重要代谢过程的小型亚细胞区室。预测过氧化物酶体蛋白导入的工作受到物种差异和实验确认定位的稀疏序列数据集的挑战。我们基于主要的过氧化物酶体靶向信号1(PTS1)的存在提出了一种过氧化物酶体导入预测器,PTS1是一个看似保守但特异性很低的基序。该信号似乎依赖于与前面残基的微妙依赖性。我们针对两种替代预测服务PEROXIP和PTS1 PREDICTOR评估预测准确性。我们在一系列缺乏过氧化物酶体的原核和真核蛋白质组上测试预测的完整性。同样,我们在已知与训练数据不重叠的过氧化物酶体蛋白上测试准确性。该模型在理化学研究所IPS7小鼠蛋白质数据集中鉴定出一些蛋白质可能是新的过氧化物酶体蛋白。其中三种通过在瞬时转染的BHK - 21细胞中对带有myc表位标签的蛋白质进行免疫荧光检测在体外得到证实(Dhrs2、Serhl和Ehhadh)。最终模型的特异性优于这两种替代方法,准确性优于PEROXIP且与PTS1 PREDICTOR相当。因此,我们提出的模型对于高置信度标记PTS1靶向蛋白应该是非常有价值的。我们使用该预测器筛选了几个其他真核基因组,以修订先前估计的过氧化物酶体蛋白数量。可在http://pprowler.itee.uq.edu.au获取。