Department of Biochemistry & Biophysics, Stockholm University, Stockholm, Sweden.
Bioinformatics. 2010 Dec 15;26(24):3067-74. doi: 10.1093/bioinformatics/btq581. Epub 2010 Oct 14.
Learning-based model quality assessment programs have been quite successful at discriminating between high- and low-quality protein structures. Here, we show that it is possible to improve this performance significantly by restricting the learning space to a specific context, in this case membrane proteins. Since these are among the most important structures from a pharmaceutical point-of-view, it is particularly interesting to resolve local model quality for regions corresponding, e.g. to binding sites.
Our new ProQM method uses a support vector machine with a combination of general and membrane protein-specific features. For the transmembrane region, ProQM clearly outperforms all methods developed for generic proteins, and it does so while maintaining performance for extra-membrane domains; in this region it is only matched by ProQres. The predictor is shown to accurately predict quality both on the global and local level when applied to GPCR models, and clearly outperforms consensus-based scoring. Finally, the combination of ProQM and the Rosetta low-resolution energy function achieve a 7-fold enrichment in selection of near-native structural models, at very limited computational cost.
ProQM is available as a server at +proqm.cbr.su.se+.
基于学习的模型质量评估程序在区分高质量和低质量蛋白质结构方面非常成功。在这里,我们表明,通过将学习空间限制在特定的环境中,特别是在膜蛋白的情况下,这种性能可以显著提高。由于这些结构从药物的角度来看非常重要,因此确定对应于结合部位等区域的局部模型质量尤其有趣。
我们的新 ProQM 方法使用支持向量机,结合了通用和膜蛋白特异性特征。对于跨膜区域,ProQM 明显优于为通用蛋白开发的所有方法,并且在保持额外膜区域性能的同时做到这一点;在这个区域,它仅与 ProQres 相匹配。当应用于 GPCR 模型时,该预测器在全局和局部水平上均显示出准确预测质量的能力,并且明显优于基于共识的评分。最后,ProQM 和 Rosetta 低分辨率能量函数的组合在选择近天然结构模型方面实现了 7 倍的富集,而计算成本非常有限。
ProQM 可作为服务器在 +proqm.cbr.su.se+ 上使用。