Jochmann Ramona, Holz Patrick, Sticht Heinrich, Stürzl Michael
Division of Molecular and Experimental Surgery, University Medical Center Erlangen, Friedrich-Alexander University of Erlangen-Nuremberg, Schwabachanlage 10, 91054 Erlangen, Germany.
Institute of Biochemistry, University of Erlangen-Nuremberg, Fahrstr. 17, 91054 Erlangen, Germany.
Biochim Biophys Acta. 2014 Feb;1844(2):416-21. doi: 10.1016/j.bbapap.2013.12.002. Epub 2013 Dec 9.
O-GlcNAcylation is an inducible, highly dynamic and reversible posttranslational modification, which regulates numerous cellular processes such as gene expression, translation, immune reactions, protein degradation, protein-protein interaction, apoptosis, and signal transduction. In contrast to N-linked glycosylation, O-GlcNAcylation does not display a strict amino acid consensus sequence, although serine or threonine residues flanked by proline and valine are preferred sites of O-GlcNAcylation. Based on this information, computational prediction tools of O-GlcNAc sites have been developed. Here, we retrospectively assessed the performance of two available O-GlcNAc prediction programs YinOYang 1.2 server and OGlcNAcScan by comparing their predictions for recently discovered experimentally validated O-GlcNAc sites. Both prediction programs efficiently identified O-GlcNAc sites situated in an environment resembling the consensus sequence P-P-V-[ST]-T-A. However, both prediction programs revealed numerous false negative O-GlcNAc predictions when the site of modification was located in an amino acid sequence differing from the known consensus sequence. By searching for a common sequence motif, we found that O-GlcNAcylation of nucleocytoplasmic proteins preferably occurs at serine and threonine residues flanked downstream by proline and valine and upstream by one to two alanines followed by a stretch of serine and threonine residues. However, O-GlcNAcylation of proteins located in the mitochondria or in the secretory lumen occurs at different sites and does not follow a distinct consensus sequence. Thus, our study indicates the limitations of the presently available computational prediction methods for O-GlcNAc sites and suggests that experimental validation is mandatory. Continuously update and further development of available databases will be the key to improve the performance of O-GlcNAc site prediction.
O-连接的N-乙酰葡糖胺化(O-GlcNAcylation)是一种可诱导的、高度动态且可逆的翻译后修饰,它调控众多细胞过程,如基因表达、翻译、免疫反应、蛋白质降解、蛋白质-蛋白质相互作用、细胞凋亡和信号转导。与N-连接糖基化不同,O-GlcNAcylation没有严格的氨基酸共有序列,尽管脯氨酸和缬氨酸侧翼的丝氨酸或苏氨酸残基是O-GlcNAcylation的优选位点。基于此信息,已开发出O-GlcNAc位点的计算预测工具。在此,我们通过比较两个可用的O-GlcNAc预测程序YinOYang 1.2服务器和OGlcNAcScan对最近发现的经实验验证的O-GlcNAc位点的预测,回顾性评估了它们的性能。两个预测程序都能有效地识别位于类似于共有序列P-P-V-[ST]-T-A环境中的O-GlcNAc位点。然而,当修饰位点位于与已知共有序列不同的氨基酸序列中时,两个预测程序都显示出大量假阴性的O-GlcNAc预测。通过搜索共同的序列基序,我们发现核质蛋白的O-GlcNAcylation优选发生在脯氨酸和缬氨酸侧翼下游、一到两个丙氨酸侧翼上游且随后是一段丝氨酸和苏氨酸残基的丝氨酸和苏氨酸残基处。然而,位于线粒体或分泌腔中的蛋白质的O-GlcNAcylation发生在不同位点,且不遵循明显的共有序列。因此,我们的研究表明了目前可用的O-GlcNAc位点计算预测方法的局限性,并表明实验验证是必不可少的。不断更新和进一步开发可用数据库将是提高O-GlcNAc位点预测性能的关键。