Papaloukas Costas, Granseth Erik, Viklund Håkan, Elofsson Arne
Stockholm Bioinformatics Center, Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden.
Protein Sci. 2008 Feb;17(2):271-8. doi: 10.1110/ps.073036108. Epub 2007 Dec 20.
Zpred2 is an improved version of ZPRED, a predictor for the Z-coordinates of alpha-helical membrane proteins, that is, the distance of the residues from the center of the membrane. Using principal component analysis and a set of neural networks, Zpred2 analyzes data extracted from the amino acid sequence, the predicted topology, and evolutionary profiles. Zpred2 achieves an average accuracy error of 2.18 A (2.17 A when an independent test set is used), an improvement by 15% compared to the previous version. We show that this accuracy is sufficient to enable the predictions of helix lengths with a correlation coefficient of 0.41. As a comparison, two state-of-the-art HMM-based topology prediction methods manage to predict the helix lengths with a correlation coefficient of less than 0.1. In addition, we applied Zpred2 to two other problems, the re-entrant region identification and model validation. Re-entrants were able to be detected with a certain consistency, but not better than with previous approaches, while incorrect models as well as mispredicted helices of transmembrane proteins could be distinguished based on the Z-coordinate predictions.
Zpred2是ZPRED的改进版本,ZPRED是一种用于预测α-螺旋膜蛋白Z坐标的预测工具,即残基与膜中心的距离。Zpred2使用主成分分析和一组神经网络,分析从氨基酸序列、预测的拓扑结构和进化谱中提取的数据。Zpred2的平均预测误差为2.18埃(使用独立测试集时为2.17埃),与上一版本相比提高了15%。我们表明,这种精度足以以0.41的相关系数预测螺旋长度。作为对比,两种基于隐马尔可夫模型(HMM)的先进拓扑预测方法预测螺旋长度的相关系数小于0.1。此外,我们将Zpred2应用于另外两个问题,即折返区域识别和模型验证。折返区域能够以一定的一致性被检测到,但不比以前的方法更好,而基于Z坐标预测,可以区分错误的模型以及跨膜蛋白预测错误的螺旋。