Yang S X, Pollock H G, Rawitch A B
Department of Biochemistry and Molecular Biology, University of Kansas Medical Center, Kansas City, 66160-7421, USA.
Arch Biochem Biophys. 1996 Mar 1;327(1):61-70. doi: 10.1006/abbi.1996.0093.
The amino acid sequence established for human thyroglobulin (hTG) from its cDNA sequence contains 20 putative N-linked glycosylation sites. We have characterized the glycopeptides contained in a tryptic digest of hTG in order to determine which sites are actually linked to carbohydrate. In addition, the distribution of oligosaccharide type(s) at these confirmed sites of N-linked glycosylation has been examined. Glycopeptides were purified using gel permeation chromatography followed by several steps of HPLC. The purified tryptic glycopeptides were characterized by gas phase sequencing and carbohydrate analysis and located within the amino acid sequence of thyroglobulin. Each of the recovered glycopeptides contained a consensus sequence for N-linked glycosylation. Of the 20 putative N-linked glycosylation sites in the human thyroglobulin polypeptide chain, 16 were shown to be actually glycosylated in the mature protein. Eight of these confirmed glycosylation sites (at positions 57, 465, 510, 729, 797, 1696, 1754, and 2230) appear to be linked to complex-type oligosaccharide units containing fucose and galactose in addition to mannose and glucosamine. Five sites (at positions 1200, 1329, 1993, 2275, and 2562) contain high mannose type units and two sites (at positions 179 and 1345) are linked to oligosaccharide units containing galactose in addition to mannose and glucosamine but no fucose and may be either hybrid or complex structures. In addition, position 928 was found to be degenerate in oligosaccharide structure and very different oligosaccharide composition types were found associated with peptides containing the same amino acid sequence. A high probability of a beta turn which would include the glycosylated asparagine residue was predicted for the amino acid sequence found at 13 of the 16 sites. The glycosylation pattern in hTG was also compared with the data recently reported for bovine thyroglobulin (bTG) (27) and as has been recently reported for bTG, no oligosaccharides of the high mannose type were found in the N-terminal portion of hTG. Only four of the 20 putative sites the sequence of hTG, at asparagine residues 91, 477, 1849, and 2102 were not represented in the purified glycopeptide population and are presumed to escape significant glycosylation.
根据人甲状腺球蛋白(hTG)的cDNA序列确定的氨基酸序列含有20个推定的N-连接糖基化位点。我们对hTG胰蛋白酶消化产物中含有的糖肽进行了表征,以确定哪些位点实际上与碳水化合物相连。此外,还研究了这些已确认的N-连接糖基化位点处寡糖类型的分布。糖肽通过凝胶渗透色谱法纯化,随后进行几步高效液相色谱法。纯化后的胰蛋白酶糖肽通过气相测序和碳水化合物分析进行表征,并定位在甲状腺球蛋白的氨基酸序列内。每个回收的糖肽都含有N-连接糖基化的共有序列。在人甲状腺球蛋白多肽链中的20个推定的N-连接糖基化位点中,有16个在成熟蛋白中被证明实际发生了糖基化。这些已确认的糖基化位点中有8个(位于第57、465、510、729、797、1696、1754和2230位)似乎与除甘露糖和葡糖胺外还含有岩藻糖和半乳糖的复合型寡糖单元相连。5个位点(位于第1200、1329、1993、2275和2562位)含有高甘露糖型单元,2个位点(位于第179和1345位)除甘露糖和葡糖胺外还与含有半乳糖的寡糖单元相连,但不含岩藻糖,可能是杂合型或复合型结构。此外,发现第928位在寡糖结构上是简并的,并且在含有相同氨基酸序列的肽段中发现了非常不同的寡糖组成类型。在16个位点中的13个位点发现的氨基酸序列中,预测包含糖基化天冬酰胺残基的β-转角的可能性很高。还将hTG中的糖基化模式与最近报道的牛甲状腺球蛋白(bTG)的数据进行了比较,并且正如最近报道的bTG一样,在hTG的N端部分未发现高甘露糖型寡糖。在hTG序列的20个推定位点中,只有4个位点(天冬酰胺残基91、477、1849和2102)在纯化的糖肽群体中未出现,推测它们未发生显著糖基化。