Rao R Shyama Prasad, Buus Ole Thomsen, Wollenweber Bernd
Aarhus University, Department of Genetics and Biotechnology, Forsøgsvej 1, Slagelse 4200, Denmark. Email:
Bioinform Biol Insights. 2010 Feb 17;4:9-17. doi: 10.4137/bbi.s4337.
Many proteins contain a large number of NXS/T sequences (where X is any amino acid except proline) which are the potential sites of asparagine (N) linked glycosylation. However, the patterns of occurrence of these N-glycosylation sequons in related proteins or groups of proteins and their underlying causes have largely been unexplored. We computed the actual and probabilistic occurrence of NXS/T sequons in ABC protein superfamilies from eight diverse eukaryotic organisms. The ABC proteins contained significantly higher NXS/T sequon numbers compared to respective genome-wide average, but the sequon density was significantly lower owing to the increase in protein size and decrease in sequon specific amino acids. However, mammalian ABC proteins have significantly higher sequon density, and both serine and threonine containing sequons (NXS and NXT) have been positively selected-against the recent findings of only threonine specific Darwinian selection of sequons in proteins. The occurrence of sequons was positively correlated with the frequency of sequon specific amino acids and negatively correlated with proline and the NPS/T sequences. Further, the NPS/T sequences were significantly higher than expected in plant ABC proteins which have the lowest number of NXS/T sequons. Accordingly, compared to overall proteins, N-glycosylation sequons in ABC protein superfamilies have a distinct pattern of occurrence, and the results are discussed in an evolutionary perspective.
许多蛋白质含有大量的NXS/T序列(其中X是除脯氨酸以外的任何氨基酸),这些序列是天冬酰胺(N)连接糖基化的潜在位点。然而,这些N-糖基化序列子在相关蛋白质或蛋白质组中的出现模式及其潜在原因在很大程度上尚未得到探索。我们计算了来自八种不同真核生物的ABC蛋白超家族中NXS/T序列子的实际出现情况和概率出现情况。与各自全基因组平均水平相比,ABC蛋白包含的NXS/T序列子数量显著更高,但由于蛋白质大小增加和序列子特异性氨基酸减少,序列子密度显著更低。然而,哺乳动物ABC蛋白具有显著更高的序列子密度,并且含有丝氨酸和苏氨酸的序列子(NXS和NXT)都受到了正选择——这与最近关于蛋白质中仅苏氨酸特异性达尔文选择序列子的发现相反。序列子的出现与序列子特异性氨基酸的频率呈正相关,与脯氨酸和NPS/T序列呈负相关。此外,在NXS/T序列子数量最少的植物ABC蛋白中,NPS/T序列显著高于预期。因此,与整体蛋白质相比,ABC蛋白超家族中的N-糖基化序列子具有独特的出现模式,并从进化角度对结果进行了讨论。