Group Onco-Proteins, Institut de Recherche de l'Ecole de Biotechnologie de Strasbourg, Illkirch, France.
PLoS One. 2011;6(11):e25376. doi: 10.1371/journal.pone.0025376. Epub 2011 Nov 1.
PDZ domains recognise short sequence motifs at the extreme C-termini of proteins. A model based on microarray data has been recently published for predicting the binding preferences of PDZ domains to five residue long C-terminal sequences. Here we investigated the potential of this predictor for discovering novel protein interactions that involve PDZ domains. When tested on real negative data assembled from published literature, the predictor displayed a high false positive rate (FPR). We predicted and experimentally validated interactions between four PDZ domains derived from the human proteins MAGI1 and SCRIB and 19 peptides derived from human and viral C-termini of proteins. Measured binding intensities did not correlate with prediction scores, and the high FPR of the predictor was confirmed. Results indicate that limitations of the predictor may arise from an incomplete model definition and improper training of the model. Taking into account these limitations, we identified several novel putative interactions between PDZ domains of MAGI1 and SCRIB and the C-termini of the proteins FZD4, ARHGAP6, NET1, TANC1, GLUT7, MARCH3, MAS, ABC1, DLL1, TMEM215 and CYSLTR2. These proteins are localised to the membrane or suggested to act close to it and are often involved in G protein signalling. Furthermore, we showed that, while extension of minimal interacting domains or peptides toward tandem constructs or longer peptides never suppressed their ability to interact, the measured affinities and inferred specificity patterns often changed significantly. This suggests that if protein fragments interact, the full length proteins are also likely to interact, albeit possibly with altered affinities and specificities. Therefore, predictors dealing with protein fragments are promising tools for discovering protein interaction networks but their application to predict binding preferences within networks may be limited.
PDZ 结构域识别蛋白质 C 末端极短序列基序。最近发表了一个基于微阵列数据的模型,用于预测 PDZ 结构域与五残基长 C 末端序列的结合偏好。在这里,我们研究了该预测器发现涉及 PDZ 结构域的新蛋白质相互作用的潜力。当在从已发表文献中汇编的真实负数据上进行测试时,该预测器显示出高假阳性率(FPR)。我们预测并通过实验验证了源自人类蛋白 MAGI1 和 SCRIB 的四个 PDZ 结构域与源自人类和病毒蛋白 C 末端的 19 个肽之间的相互作用。测量的结合强度与预测得分无关,并且预测器的高 FPR 得到了确认。结果表明,预测器的限制可能源于模型定义不完整和模型训练不当。考虑到这些限制,我们确定了 MAGI1 和 SCRIB 的 PDZ 结构域与蛋白 FZD4、ARHGAP6、NET1、TANC1、GLUT7、MARCH3、MAS、ABC1、DLL1、TMEM215 和 CYSLTR2 的 C 末端之间的几个新的潜在相互作用。这些蛋白定位于膜或被认为靠近膜,并且经常参与 G 蛋白信号转导。此外,我们表明,虽然最小相互作用结构域或肽向串联构建体或更长肽的延伸从不抑制其相互作用能力,但测量的亲和力和推断的特异性模式经常发生显著变化。这表明,如果蛋白质片段相互作用,则全长蛋白质也很可能相互作用,尽管亲和力和特异性可能会发生改变。因此,处理蛋白质片段的预测器是发现蛋白质相互作用网络的有前途的工具,但其在预测网络内结合偏好的应用可能受到限制。