Ben-Dor Shifra, Esterman Nir, Rubin Eitan, Sharon Nathan
Department of Biological Services, Weizmann Institute of Science, Rehovot 76100, Israel.
Glycobiology. 2004 Feb;14(2):95-101. doi: 10.1093/glycob/cwh004. Epub 2003 Sep 26.
N-Glycosylation, the most common and most versatile protein modification reaction, occurs at the beta-amide of the aspargine of the Asn-Xaa-Ser/Thr sequon. For reasons that are unclear, not all such sequons are glycosylated. To find patterns that affect glycosylation, we examined the amino acid residues from the 20th preceding the sequon to the 20th residue following it, using bioinformatics tools. A clean data set of annotated, experimentally verified, glycosylated and nonglycosylated sequons derived from 617 well-defined nonredundant N- and N-,O-glycoproteins listed in SWISS-PROT (June 2002) was used. NXS and NXT sequons were analyzed separately. Although no overt patterns were found to explain sequon occupancy or nonoccupancy, trends for over- or underrepresentation of certain amino acids at particular positions were statistically significant and different in NXS and NXT sequons. In extension of earlier reports, none of the 80 Asn-Pro-Ser/Thr found were glycosylated, and a markedly low level of glycosylation was seen in sequons with Pro at the position following the Ser/Thr. In addition, a general observation was made that the considerable number of glycosylated sequons in the C-terminal 10 residues of glycoproteins suggests that N-glycosylation in these cases may be posttranslational and not cotranslational, as widely accepted.
N-糖基化是最常见且最具多样性的蛋白质修饰反应,发生在Asn-Xaa-Ser/Thr序列中天冬酰胺的β-酰胺基上。由于尚不清楚的原因,并非所有此类序列都会发生糖基化。为了找出影响糖基化的模式,我们使用生物信息学工具检查了序列前第20个氨基酸残基到序列后第20个氨基酸残基。使用了一个经过注释、实验验证的糖基化和非糖基化序列的纯净数据集,这些序列来自SWISS-PROT(2002年6月)中列出的617个明确的非冗余N-糖基化和N-、O-糖蛋白。NXS和NXT序列分别进行了分析。尽管未发现明显模式来解释序列的占据或未占据情况,但特定位置某些氨基酸的过表达或低表达趋势在统计学上具有显著性,且在NXS和NXT序列中有所不同。作为早期报告的延伸,所发现的80个Asn-Pro-Ser/Thr序列均未发生糖基化,并且在Ser/Thr之后位置为Pro的序列中糖基化水平明显较低。此外,还有一个普遍观察结果是,糖蛋白C末端10个残基中有相当数量的糖基化序列,这表明在这些情况下N-糖基化可能是翻译后修饰而非如广泛接受的那样是共翻译修饰。